1 code implementation • 28 Mar 2024 • Hao Lang, Fei Huang, Yongbin Li
RLHF contains three steps, i. e., human preference collecting, reward learning, and policy optimization, which are usually performed serially.
1 code implementation • 17 Mar 2024 • Feifan Song, Bowen Yu, Hao Lang, Haiyang Yu, Fei Huang, Houfeng Wang, Yongbin Li
Additionally, the concept of diversity for prompts can be more complex than responses that are typically quantified by single digits.
no code implementations • 12 Oct 2023 • Yi Dai, Hao Lang, Kaisheng Zeng, Fei Huang, Yongbin Li
Out-of-distribution (OOD) detection is essential for reliable and trustworthy machine learning.
1 code implementation • 11 May 2023 • Yi Dai, Hao Lang, Yinhe Zheng, Fei Huang, Yongbin Li
A retrieve-then-rerank frame is further introduced to select in-context examples, which guild the LM to generate text that express knowledge for QA tasks.
1 code implementation • 11 May 2023 • Yi Dai, Hao Lang, Yinhe Zheng, Bowen Yu, Fei Huang, Yongbin Li
Specifically, we dedicate task-level prompts to capture task-specific knowledge to retain high LL performances and maintain instance-level prompts to learn knowledge shared across input samples to improve the model's generalization performance.
no code implementations • 5 May 2023 • Hao Lang, Yinhe Zheng, Binyuan Hui, Fei Huang, Yongbin Li
Out-of-Domain (OOD) intent detection is vital for practical dialogue systems, and it usually requires considering multi-turn dialogue contexts.
no code implementations • 5 May 2023 • Hao Lang, Yinhe Zheng, Yixuan Li, Jian Sun, Fei Huang, Yongbin Li
Out-of-distribution (OOD) detection is essential for the reliable and safe deployment of machine learning systems in the real world.
Out-of-Distribution Detection Out of Distribution (OOD) Detection +1
no code implementations • 10 Nov 2022 • Hao Lang, Yinhe Zheng, Jian Sun, Fei Huang, Luo Si, Yongbin Li
Out-of-Domain (OOD) intent detection is important for practical dialog systems.
no code implementations • 10 Jun 2019 • Hao Lang, Wen Wang
The RBSMA algorithm improves the test set accuracy by 7. 8% relative compared to the standard beam search.