Search Results for author: Shuang Peng

Found 8 papers, 1 papers with code

EdgeInfinite: A Memory-Efficient Infinite-Context Transformer for Edge Devices

no code implementations28 Mar 2025 Jiyu Chen, Shuang Peng, Daxiong Luo, Fan Yang, Renshou Wu, Fangyuan Li, Xiaoxin Chen

Transformer-based large language models (LLMs) encounter challenges in processing long sequences on edge devices due to the quadratic complexity of attention mechanisms and growing memory demands from Key-Value (KV) cache.

One Filter to Deploy Them All: Robust Safety for Quadrupedal Navigation in Unknown Environments

no code implementations13 Dec 2024 Albert Lin, Shuang Peng, Somil Bansal

Through simulation studies and hardware experiments on a Unitree Go1 quadruped, we demonstrate that the proposed framework can automatically safeguard a wide range of hierarchical quadruped controllers, adapts to novel environments, and is robust to unmodeled dynamics without a priori access to the controllers or environments - hence, "One Filter to Deploy Them All".

All Optical Character Recognition (OCR)

FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization

no code implementations28 Feb 2024 Yi Zhang, Fei Yang, Shuang Peng, Fangyu Wang, Aimin Pan

The 4-bit matrix multiplication introduced in the FlattenQuant method can effectively address the compute-bound caused by large matrix calculation.

Quantization

Holmes: Towards Distributed Training Across Clusters with Heterogeneous NIC Environment

no code implementations6 Dec 2023 Fei Yang, Shuang Peng, Ning Sun, Fangyu Wang, Yuanyuan Wang, Fu Wu, Jiezhong Qiu, Aimin Pan

Large language models (LLMs) such as GPT-3, OPT, and LLaMA have demonstrated remarkable accuracy in a wide range of tasks.

Scheduling

Exploring Post-Training Quantization of Protein Language Models

1 code implementation30 Oct 2023 Shuang Peng, Fei Yang, Ning Sun, Sheng Chen, Yanfeng Jiang, Aimin Pan

In summary, our study introduces an innovative PTQ method for ProteinLMs, addressing specific quantization challenges and potentially leading to the development of more efficient ProteinLMs with significant implications for various protein-related applications.

Protein Structure Prediction Quantization

AdaCoach: A Virtual Coach for Training Customer Service Agents

no code implementations27 Apr 2022 Shuang Peng, Shuai Zhu, Minghui Yang, Haozhou Huang, Dan Liu, Zujie Wen, Xuelian Li, Biao Fan

With the development of online business, customer service agents gradually play a crucial role as an interface between the companies and their customers.

Dialogue Evaluation

A Dialogue-based Information Extraction System for Medical Insurance Assessment

no code implementations Findings (ACL) 2021 Shuang Peng, Mengdi Zhou, Minghui Yang, Haitao Mi, Shaosheng Cao, Zujie Wen, Teng Xu, Hongbin Wang, Lei Liu

In the Chinese medical insurance industry, the assessor's role is essential and requires significant efforts to converse with the claimant.

Cannot find the paper you are looking for? You can Submit a new open access paper.