Search Results for author: Rongwei Lu

Retraining-free Model Quantization via One-Shot Weight-Coupling Learning

Conversely, mixed-precision quantization (MPQ) is advocated to compress the model effectively by allocating heterogeneous bit-width for layers.

Paper
Add Code

Assigning varying compression ratios to workers with distinct data distributions and volumes is thus a promising solution.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.