Search Results for author: Kaihe Xu

Found 2 papers, 1 papers with code

Reinforcement Learning with Token-level Feedback for Controllable Text Generation

1 code implementation18 Mar 2024 Wendi Li, Wei Wei, Kaihe Xu, Wenfeng Xie, Dangyang Chen, Yu Cheng

To meet the requirements of real-world applications, it is essential to control generations of large language models (LLMs).

Attribute reinforcement-learning +3

Cannot find the paper you are looking for? You can Submit a new open access paper.