Search Results for author: Jianchen Zhu

Found 5 papers, 0 papers with code

EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs

no code implementations5 Mar 2024 Hanlin Tang, Yifu Sun, Decheng Wu, Kai Liu, Jianchen Zhu, Zhanhui Kang

To our best knowledge, we are the first work that achieves almost lossless quantization performance for LLMs under a data-independent setting and our algorithm runs over 10 times faster than the data-dependent methods.

Data Free Quantization

E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity

no code implementations24 Oct 2023 Yun Li, Lin Niu, Xipeng Zhang, Kai Liu, Jianchen Zhu, Zhanhui Kang

Traditional pruning methods are known to be challenging to work in Large Language Models (LLMs) for Generative AI because of their unaffordable training process and large computational demands.

Language Modelling Large Language Model

MKQ-BERT: Quantized BERT with 4-bits Weights and Activations

no code implementations25 Mar 2022 Hanlin Tang, Xipeng Zhang, Kai Liu, Jianchen Zhu, Zhanhui Kang

In this work, we propose MKQ-BERT, which further improves the compression level and uses 4-bits for quantization.

Quantization

Spatial Sparse subspace clustering for Compressive Spectral imaging

no code implementations5 Nov 2019 Jianchen Zhu, Tong Zhang, Shengjie Zhao, Carlos Hinojosa, Zengli Liu, Gonzalo R. Arce

This paper aims at developing a clustering approach with spectral images directly from CASSI compressive measurements.

Clustering Image Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.