1 code implementation • 13 Nov 2023 • Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Lorraine Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren
We further use the data generated by LINK to construct a dataset Logic-Induced-Long-Tail (LINT) that can be used to evaluate downstream models on the long-tail distribution; LINT contains 108K knowledge statements spanning four domains.
no code implementations • 20 Dec 2022 • Howard Chen, Huihan Li, Danqi Chen, Karthik Narasimhan
We consider the task of text generation in language models with constraints specified in natural language.
2 code implementations • ACL 2022 • Huihan Li, Tianyu Gao, Manan Goenka, Danqi Chen
In this work, we conduct the first large-scale human evaluation of state-of-the-art conversational QA systems, where human evaluators converse with models and judge the correctness of their answers.