Search Results for author: Chenguang Xi

Found 4 papers, 2 papers with code

Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF

no code implementations4 Mar 2024 Chen Zheng, Ke Sun, Hang Wu, Chenguang Xi, Xun Zhou

This process often leads to issues such as forgetting or a decrease in the base model's abilities.

ICE-GRT: Instruction Context Enhancement by Generative Reinforcement based Transformers

no code implementations4 Jan 2024 Chen Zheng, Ke Sun, Da Tang, Yukun Ma, Yuyu Zhang, Chenguang Xi, Xun Zhou

The emergence of Large Language Models (LLMs) such as ChatGPT and LLaMA encounter limitations in domain-specific tasks, with these models often lacking depth and accuracy in specialized areas, and exhibiting a decrease in general capabilities when fine-tuned, particularly analysis ability in small sized models.

GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond

1 code implementation28 Sep 2023 Shen Zheng, Yuyu Zhang, Yijie Zhu, Chenguang Xi, Pengyang Gao, Xun Zhou, Kevin Chen-Chuan Chang

With the rapid advancement of large language models (LLMs), there is a pressing need for a comprehensive evaluation suite to assess their capabilities and limitations.

Benchmarking

Cannot find the paper you are looking for? You can Submit a new open access paper.