Search Results for author: Kaixiang Lin

Found 19 papers, 11 papers with code

Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk

no code implementations • 10 Jan 2024 • Dennis Ulmer, Elman Mansimov, Kaixiang Lin, Justin Sun, Xibin Gao, Yi Zhang

This metric is used to filter the generated conversational data that is fed back in LLM for training.

Paper
Add Code

Automated Few-shot Classification with Instruction-Finetuned Language Models

1 code implementation • 21 May 2023 • Rami Aly, Xingjian Shi, Kaixiang Lin, Aston Zhang, Andrew Gordon Wilson

We observe, in the context of classification tasks, that instruction finetuned language models exhibit remarkable prompt robustness, and we subsequently propose a simple method to eliminate the need for handcrafted prompts, named AuT-Few.

Classification Few-Shot Learning +1

Paper
Code

Parameter and Data Efficient Continual Pre-training for Robustness to Dialectal Variance in Arabic

no code implementations • 8 Nov 2022 • Soumajyoti Sarkar, Kaixiang Lin, Sailik Sengupta, Leonard Lausen, Sheng Zha, Saab Mansour

While prior research studies have tried to adapt these multilingual models for dialectal variants of Arabic, it still remains a challenging problem owing to the lack of sufficient monolingual dialectal data and parallel translation data of such dialectal variants.

Avg Language Modelling +1

Paper
Add Code

CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous Multi-Agent Reinforcement Learning

no code implementations • 26 Aug 2022 • Vasu Sharma, Prasoon Goyal, Kaixiang Lin, Govind Thattai, Qiaozi Gao, Gaurav S. Sukhatme

We propose a multimodal (vision-and-language) benchmark for cooperative and heterogeneous multi-agent learning.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following

2 code implementations • 27 Feb 2022 • Xiaofeng Gao, Qiaozi Gao, Ran Gong, Kaixiang Lin, Govind Thattai, Gaurav S. Sukhatme

Language-guided Embodied AI benchmarks requiring an agent to navigate an environment and manipulate objects typically allow one-way communication: the human user gives a natural language command to the agent, and the agent can only follow the command passively.

Instruction Following Navigate

Paper
Code

Learning to Act with Affordance-Aware Multimodal Neural SLAM

1 code implementation • 24 Jan 2022 • Zhiwei Jia, Kaixiang Lin, Yizhou Zhao, Qiaozi Gao, Govind Thattai, Gaurav Sukhatme

With the proposed Affordance-aware Multimodal Neural SLAM (AMSLAM) approach, we obtain more than 40% improvement over prior published work on the ALFRED benchmark and set a new state-of-the-art generalization performance at a success rate of 23. 48% on the test unseen scenes.

Efficient Exploration Test unseen

Paper
Code

Learning Two-Step Hybrid Policy for Graph-Based Interpretable Reinforcement Learning

no code implementations • 21 Jan 2022 • Tongzhou Mu, Kaixiang Lin, Feiyang Niu, Govind Thattai

We present a two-step hybrid reinforcement learning (RL) policy that is designed to generate interpretable and robust hierarchical policies on the RL problem with graph-based input.

Decision Making reinforcement-learning +3

Paper
Add Code

LUMINOUS: Indoor Scene Generation for Embodied AI Challenges

1 code implementation • 10 Nov 2021 • Yizhou Zhao, Kaixiang Lin, Zhiwei Jia, Qiaozi Gao, Govind Thattai, Jesse Thomason, Gaurav S. Sukhatme

However, current simulators for Embodied AI (EAI) challenges only provide simulated indoor scenes with a limited number of layouts.

Indoor Scene Synthesis Scene Generation

Paper
Code

Off-Policy Imitation Learning from Observations

1 code implementation • NeurIPS 2020 • Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

To further accelerate the learning procedure, we regulate the policy update with an inverse action model, which assists distribution matching from the perspective of mode-covering.

Imitation Learning

Paper
Code

Unsupervised Anomaly Detection by Robust Collaborative Autoencoders

2 code implementations • 1 Jan 2021 • Boyang Liu, Ding Wang, Kaixiang Lin, Pang-Ning Tan, Jiayu Zhou

Unsupervised anomaly detection plays a crucial role in many critical applications.

Unsupervised Anomaly Detection

282

Paper
Code

PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control

no code implementations • 24 Nov 2020 • Dong Chen, Kaian Chen. Zhaojian Li, Tianshu Chu, Rui Yao, Feng Qiu, Kaixiang Lin

Specifically, we consider the decentralized inverter-based secondary voltage control problem in distributed generators (DGs), which is first formulated as a cooperative multi-agent reinforcement learning (MARL) problem.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Transfer Learning in Deep Reinforcement Learning: A Survey

no code implementations • 16 Sep 2020 • Zhuangdi Zhu, Kaixiang Lin, Anil K. Jain, Jiayu Zhou

Reinforcement learning is a learning paradigm for solving sequential decision-making problems.

Decision Making reinforcement-learning +2

Paper
Add Code

A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg

no code implementations • 11 Jul 2020 • Zhaonan Qu, Kaixiang Lin, Zhaojian Li, Jiayu Zhou, Zhengyuan Zhou

For strongly convex and convex problems, we also characterize the corresponding convergence rates for the Nesterov accelerated FedAvg algorithm, which are the first linear speedup guarantees for momentum variants of FedAvg in convex settings.

Distributed Optimization Federated Learning

Paper
Add Code

Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations

1 code implementation • 1 Apr 2020 • Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

SAIL bridges the advantages of IL and RL to reduce the sample complexity substantially, by effectively exploiting sup-optimal demonstrations and efficiently exploring the environment to surpass the demonstrated performance.

Continuous Control Imitation Learning +1

Paper
Code

Ranking Policy Gradient

1 code implementation • ICLR 2020 • Kaixiang Lin, Jiayu Zhou

Sample inefficiency is a long-lasting problem in reinforcement learning (RL).

Policy Gradient Methods Reinforcement Learning (RL)

Paper
Code

Differentially Private Generative Adversarial Network

2 code implementations • 19 Feb 2018 • Liyang Xie, Kaixiang Lin, Shu Wang, Fei Wang, Jiayu Zhou

Generative Adversarial Network (GAN) and its variants have recently attracted intensive research interests due to their elegant theoretical foundation and excellent empirical performance as generative models.

Generative Adversarial Network

Paper
Code

Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management

1 code implementation • 18 Feb 2018 • Kaixiang Lin, Renyu Zhao, Zhe Xu, Jiayu Zhou

Large-scale online ride-sharing platforms have substantially transformed our lives by reallocating transportation resources to alleviate traffic congestion and promote transportation efficiency.

Management Multi-agent Reinforcement Learning +3

184

Paper
Code

Improving Mild Cognitive Impairment Prediction via Reinforcement Learning and Dialogue Simulation

no code implementations • 18 Feb 2018 • Fengyi Tang, Kaixiang Lin, Ikechukwu Uchendu, Hiroko H. Dodge, Jiayu Zhou

Even though there is mild cognitive decline in MCI patients, they have normal overall cognition and thus is challenging to distinguish from normal aging.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Collaborative Deep Reinforcement Learning

1 code implementation • 19 Feb 2017 • Kaixiang Lin, Shu Wang, Jiayu Zhou

Motivated by human collaborative learning, in this paper we propose a collaborative deep reinforcement learning (CDRL) framework that performs adaptive knowledge transfer among heterogeneous learning agents.

Knowledge Distillation OpenAI Gym +3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.