no code implementations • 21 Mar 2024 • Yukun Zhao, Lingyong Yan, Weiwei Sun, Guoliang Xing, Shuaiqiang Wang, Chong Meng, Zhicong Cheng, Zhaochun Ren, Dawei Yin
The training process is accomplished by self-rewards inferred from the trained model at the first stage without referring to external human preference resources.
no code implementations • 27 Oct 2023 • Yukun Zhao, Lingyong Yan, Weiwei Sun, Guoliang Xing, Chong Meng, Shuaiqiang Wang, Zhicong Cheng, Zhaochun Ren, Dawei Yin
In this paper, we propose a novel self-detection method to detect which questions that a LLM does not know that are prone to generate nonfactual results.
no code implementations • 25 Oct 2023 • Yukun Zhao, Lingyong Yan, Weiwei Sun, Chong Meng, Shuaiqiang Wang, Zhicong Cheng, Zhaochun Ren, Dawei Yin
Dialogue assessment plays a critical role in the development of open-domain dialogue systems.