1 code implementation • 16 Feb 2024 • Xuan Shen, Zhenglun Kong, Changdi Yang, Zhaoyang Han, Lei Lu, Peiyan Dong, Cheng Lyu, Chih-hsiang Li, Xuehang Guo, Zhihao Shu, Wei Niu, Miriam Leeser, Pu Zhao, Yanzhi Wang
In this paper, we propose EdgeQAT, the Entropy and Distribution Guided QAT for the optimization of lightweight LLMs to achieve inference acceleration on Edge devices.
no code implementations • 26 Dec 2023 • Xuan Sheng, Zhicheng Li, Zhaoyang Han, Xiangmao Chang, Piji Li
Meanwhile, we conduct automatic evaluation and human inspection, which indicate the proposed method possesses good performance of stealthiness without bringing grammatical issues and altering the meaning of sentences.
no code implementations • 22 Nov 2022 • Xuan Sheng, Zhaoyang Han, Piji Li, Xiangmao Chang
Deep learning is becoming increasingly popular in real-life applications, especially in natural language processing (NLP).
no code implementations • 5 Sep 2022 • Yundi Shi, Piji Li, Changchun Yin, Zhaoyang Han, Lu Zhou, Zhe Liu
Therefore, in this paper, we propose a malicious prompt template construction method (\textbf{PromptAttack}) to probe the security performance of PLMs.