Search Results for author: Winnie Chow

Found 1 papers, 1 papers with code

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

2 code implementations13 Apr 2023 Hanze Dong, Wei Xiong, Deepanshu Goyal, Yihan Zhang, Winnie Chow, Rui Pan, Shizhe Diao, Jipeng Zhang, Kashun Shum, Tong Zhang

Utilizing a reward model and a sufficient number of samples, our approach selects the high-quality samples, discarding those that exhibit undesired behavior, and subsequently enhancing the model by fine-tuning on these filtered samples.

Ethics

Cannot find the paper you are looking for? You can Submit a new open access paper.