no code implementations • 8 Apr 2024 • Qun Li, Yuan Meng, Chen Tang, Jiacheng Jiang, Zhi Wang
Quantization is a promising technique for reducing the bit-width of deep models to improve their runtime performance and storage efficiency, and thus becomes a fundamental step for deployment.
1 code implementation • CVPR 2024 • Chen Tang, Yuan Meng, Jiacheng Jiang, Shuzhao Xie, Rongwei Lu, Xinzhu Ma, Zhi Wang, Wenwu Zhu
Conversely, mixed-precision quantization (MPQ) is advocated to compress the model effectively by allocating heterogeneous bit-width for layers.
1 code implementation • 14 Feb 2022 • Rui Wu, Zhaopeng Qiu, Jiacheng Jiang, Guilin Qi, Xian Wu
Medication recommendation targets to provide a proper set of medicines according to patients' diagnoses, which is a critical task in clinics.