Search Results for author: Lixing Shen

Found 3 papers, 2 papers with code

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

1 code implementation20 Feb 2025 Tao Ji, Bin Guo, Yuanbin Wu, Qipeng Guo, Lixing Shen, Zhan Chen, Xipeng Qiu, Qi Zhang, Tao Gui

For example, the KV cache size of Llama2-7B is reduced by 92. 19%, with only a 0. 5% drop in LongBench performance.

Quantization

Two Step Joint Model for Drug Drug Interaction Extraction

no code implementations28 Aug 2020 Siliang Tang, Qi Zhang, Tianpeng Zheng, Mengdi Zhou, Zhan Chen, Lixing Shen, Xiang Ren, Yueting Zhuang, ShiLiang Pu, Fei Wu

When patients need to take medicine, particularly taking more than one kind of drug simultaneously, they should be alarmed that there possibly exists drug-drug interaction.

Decoder Drug–drug Interaction Extraction +5

Cannot find the paper you are looking for? You can Submit a new open access paper.