Search Results for author: Chenguang Xi

Found 4 papers, 2 papers with code

Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF

no code implementations • 4 Mar 2024 • Chen Zheng, Ke Sun, Hang Wu, Chenguang Xi, Xun Zhou

This process often leads to issues such as forgetting or a decrease in the base model's abilities.

Paper
Add Code

ICE-GRT: Instruction Context Enhancement by Generative Reinforcement based Transformers

no code implementations • 4 Jan 2024 • Chen Zheng, Ke Sun, Da Tang, Yukun Ma, Yuyu Zhang, Chenguang Xi, Xun Zhou

The emergence of Large Language Models (LLMs) such as ChatGPT and LLaMA encounter limitations in domain-specific tasks, with these models often lacking depth and accuracy in specialized areas, and exhibiting a decrease in general capabilities when fine-tuned, particularly analysis ability in small sized models.

Paper
Add Code

GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond

1 code implementation • 28 Sep 2023 • Shen Zheng, Yuyu Zhang, Yijie Zhu, Chenguang Xi, Pengyang Gao, Xun Zhou, Kevin Chen-Chuan Chang

With the rapid advancement of large language models (LLMs), there is a pressing need for a comprehensive evaluation suite to assess their capabilities and limitations.

Benchmarking

335

Paper
Code

CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU

1 code implementation • 13 Apr 2022 • Zangwei Zheng, Pengtai Xu, Xuan Zou, Da Tang, Zhen Li, Chenguang Xi, Peng Wu, Leqi Zou, Yijie Zhu, Ming Chen, Xiangzhuo Ding, Fuzhao Xue, Ziheng Qin, Youlong Cheng, Yang You

Our experiments show that previous scaling rules fail in the training of CTR prediction neural networks.

Click-Through Rate Prediction

155

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.