no code implementations • 16 Apr 2024 • Yekun Chai, Qingyi Liu, Jingwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu
Harnessing visual texts represents a burgeoning frontier in the evolution of language modeling.
1 code implementation • 11 Apr 2024 • Qingyi Liu, Yekun Chai, Shuohuan Wang, Yu Sun, Qiwei Peng, Keze Wang, Hua Wu
This paper presents GPTfluence, a novel approach that leverages a featurized simulation to assess the impact of training examples on the training dynamics of GPT models.
1 code implementation • 15 Sep 2022 • Yunyi Yang, Hong Ding, Qingyi Liu, Xiaojun Quan
This paper studies the exposure bias problem in task-oriented dialog systems, where the model's generated content over multiple turns drives the dialog context away from the ground-truth distribution at training time, introducing error propagation and damaging the robustness of the TOD system.