1 code implementation • 17 Jul 2023 • Shunyu Yao, Howard Chen, Austin W. Hanjie, Runzhe Yang, Karthik Narasimhan
Text generation under constraints have seen increasing interests in natural language processing, especially with the rapidly improving capabilities of large language models.
1 code implementation • 18 Feb 2022 • Vishvak Murahari, Carlos E. Jimenez, Runzhe Yang, Karthik Narasimhan
In this paper, we introduce data multiplexing (DataMUX), a technique that enables deep neural networks to process multiple inputs simultaneously using a single compact representation.
1 code implementation • ACL 2021 • Runzhe Yang, Jingxiao Chen, Karthik Narasimhan
In this paper, we explore the ability to model and infer personality types of opponents, predict their responses, and use this information to adapt a dialog agent's high-level strategy in negotiation tasks.
3 code implementations • NeurIPS 2019 • Runzhe Yang, Xingyuan Sun, Karthik Narasimhan
We introduce a new algorithm for multi-objective reinforcement learning (MORL) with linear preferences, with the goal of enabling few-shot adaptation to new tasks.
Multi-Objective Reinforcement Learning reinforcement-learning
1 code implementation • 7 May 2018 • Junwen Bai, Zihang Lai, Runzhe Yang, Yexiang Xue, John Gregoire, Carla Gomes
We propose imitation refinement, a novel approach to refine imperfect input patterns, guided by a pre-trained classifier incorporating prior knowledge from simulated theoretical data, such that the refined patterns imitate the ideal data.
no code implementations • EMNLP 2017 • Cheng Chang, Runzhe Yang, Lu Chen, Xiang Zhou, Kai Yu
The key to building an evolvable dialogue system in real-world scenarios is to ensure an affordable on-line dialogue policy learning, which requires the on-line learning process to be safe, efficient and economical.
no code implementations • EMNLP 2017 • Lu Chen, Xiang Zhou, Cheng Chang, Runzhe Yang, Kai Yu
Hand-crafted rules and reinforcement learning (RL) are two popular choices to obtain dialogue policy.
no code implementations • EACL 2017 • Lu Chen, Runzhe Yang, Cheng Chang, Zihao Ye, Xiang Zhou, Kai Yu
On-line dialogue policy learning is the key for building evolvable conversational agent in real world scenarios.