Search Results for author: Runzhe Yang

Found 8 papers, 5 papers with code

COLLIE: Systematic Construction of Constrained Text Generation Tasks

1 code implementation • 17 Jul 2023 • Shunyu Yao, Howard Chen, Austin W. Hanjie, Runzhe Yang, Karthik Narasimhan

Text generation under constraints have seen increasing interests in natural language processing, especially with the rapidly improving capabilities of large language models.

Logical Reasoning Sentence +1

Paper
Code

DataMUX: Data Multiplexing for Neural Networks

1 code implementation • 18 Feb 2022 • Vishvak Murahari, Carlos E. Jimenez, Runzhe Yang, Karthik Narasimhan

In this paper, we introduce data multiplexing (DataMUX), a technique that enables deep neural networks to process multiple inputs simultaneously using a single compact representation.

Image Classification named-entity-recognition +5

Paper
Code

Improving Dialog Systems for Negotiation with Personality Modeling

1 code implementation • ACL 2021 • Runzhe Yang, Jingxiao Chen, Karthik Narasimhan

In this paper, we explore the ability to model and infer personality types of opponents, predict their responses, and use this information to adapt a dialog agent's high-level strategy in negotiation tasks.

Paper
Code

A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation

3 code implementations • NeurIPS 2019 • Runzhe Yang, Xingyuan Sun, Karthik Narasimhan

We introduce a new algorithm for multi-objective reinforcement learning (MORL) with linear preferences, with the goal of enabling few-shot adaptation to new tasks.

Multi-Objective Reinforcement Learning reinforcement-learning

238

Paper
Code

End-to-End Refinement Guided by Pre-trained Prototypical Classifier

1 code implementation • 7 May 2018 • Junwen Bai, Zihang Lai, Runzhe Yang, Yexiang Xue, John Gregoire, Carla Gomes

We propose imitation refinement, a novel approach to refine imperfect input patterns, guided by a pre-trained classifier incorporating prior knowledge from simulated theoretical data, such that the refined patterns imitate the ideal data.

Paper
Code

Affordable On-line Dialogue Policy Learning

no code implementations • EMNLP 2017 • Cheng Chang, Runzhe Yang, Lu Chen, Xiang Zhou, Kai Yu

The key to building an evolvable dialogue system in real-world scenarios is to ensure an affordable on-line dialogue policy learning, which requires the on-line learning process to be safe, efficient and economical.

Dialogue Management

Paper
Add Code

Agent-Aware Dropout DQN for Safe and Efficient On-line Dialogue Policy Learning

no code implementations • EMNLP 2017 • Lu Chen, Xiang Zhou, Cheng Chang, Runzhe Yang, Kai Yu

Hand-crafted rules and reinforcement learning (RL) are two popular choices to obtain dialogue policy.

Automatic Speech Recognition (ASR) Dialogue Management +3

Paper
Add Code

On-line Dialogue Policy Learning with Companion Teaching

no code implementations • EACL 2017 • Lu Chen, Runzhe Yang, Cheng Chang, Zihao Ye, Xiang Zhou, Kai Yu

On-line dialogue policy learning is the key for building evolvable conversational agent in real world scenarios.

Dialogue Management

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.