Search Results for author: Yichuan Ding

Found 2 papers, 0 papers with code

Hummer: Towards Limited Competitive Preference Dataset

no code implementations19 May 2024 Li Jiang, Yusen Wu, Junwu Xiong, Jingqing Ruan, Yichuan Ding, Qingpei Guo, Zujie Wen, Jun Zhou, Xiaotie Deng

Preference datasets are essential for incorporating human preferences into pre-trained language models, playing a key role in the success of Reinforcement Learning from Human Feedback.

Cannot find the paper you are looking for? You can Submit a new open access paper.