1 code implementation • 22 May 2025 • Zhepei Wei, Wenlin Yao, Yao Liu, Weizhi Zhang, Qin Lu, Liang Qiu, Changlong Yu, Puyang Xu, Chao Zhang, Bing Yin, Hyokun Yun, Lihong Li
While reinforcement learning (RL) has demonstrated remarkable success in enhancing large language models (LLMs), it has primarily focused on single-turn tasks such as solving math problems.
no code implementations • 17 Nov 2024 • Mohammad Kachuee, Sarthak Ahuja, Vaibhav Kumar, Puyang Xu, Xiaohu Liu
By conducting extensive experiments on a dataset covering complex and multi-tool scenarios, we show that leveraging LLMs for query generation improves the retrieval for in-domain (seen tools) and out-of-domain (unseen tools) settings.
no code implementations • 9 Mar 2023 • Ke Bai, Guoyin Wang, Jiwei Li, Sunghyun Park, Sungjin Lee, Puyang Xu, Ricardo Henao, Lawrence Carin
Open world classification is a task in natural language processing with key practical relevance and impact.
1 code implementation • 9 Sep 2022 • Yeon Seonwoo, Guoyin Wang, Changmin Seo, Sajal Choudhary, Jiwei Li, Xiang Li, Puyang Xu, Sunghyun Park, Alice Oh
In this work, we show that the semantic meaning of a sentence is also determined by nearest-neighbor sentences that are similar to the input sentence.
no code implementations • ACL 2018 • Puyang Xu, Qi Hu
We highlight a practical yet rarely discussed problem in dialogue state tracking (DST), namely handling unknown slot values.
no code implementations • NeurIPS 2011 • Asela Gunawardana, Christopher Meek, Puyang Xu
We introduce the Piecewise-Constant Conditional Intensity Model, a model for learning temporal dependencies in event streams.