Search Results for author: Mengni Wang

Found 1 papers, 1 papers with code

Efficient Post-training Quantization with FP8 Formats

2 code implementations26 Sep 2023 Haihao Shen, Naveen Mellempudi, Xin He, Qun Gao, Chang Wang, Mengni Wang

Recent advances in deep learning methods such as LLMs and Diffusion models have created a need for improved quantization methods that can meet the computational demands of these modern architectures while maintaining accuracy.

Image Classification Language Modelling +3

Cannot find the paper you are looking for? You can Submit a new open access paper.