Search Results for author: Kaixiang Lin

Found 19 papers, 11 papers with code

Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk

no code implementations10 Jan 2024 Dennis Ulmer, Elman Mansimov, Kaixiang Lin, Justin Sun, Xibin Gao, Yi Zhang

This metric is used to filter the generated conversational data that is fed back in LLM for training.

Automated Few-shot Classification with Instruction-Finetuned Language Models

1 code implementation21 May 2023 Rami Aly, Xingjian Shi, Kaixiang Lin, Aston Zhang, Andrew Gordon Wilson

We observe, in the context of classification tasks, that instruction finetuned language models exhibit remarkable prompt robustness, and we subsequently propose a simple method to eliminate the need for handcrafted prompts, named AuT-Few.

Classification Few-Shot Learning +1

Parameter and Data Efficient Continual Pre-training for Robustness to Dialectal Variance in Arabic

no code implementations8 Nov 2022 Soumajyoti Sarkar, Kaixiang Lin, Sailik Sengupta, Leonard Lausen, Sheng Zha, Saab Mansour

While prior research studies have tried to adapt these multilingual models for dialectal variants of Arabic, it still remains a challenging problem owing to the lack of sufficient monolingual dialectal data and parallel translation data of such dialectal variants.

Avg Language Modelling +1

DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following

2 code implementations27 Feb 2022 Xiaofeng Gao, Qiaozi Gao, Ran Gong, Kaixiang Lin, Govind Thattai, Gaurav S. Sukhatme

Language-guided Embodied AI benchmarks requiring an agent to navigate an environment and manipulate objects typically allow one-way communication: the human user gives a natural language command to the agent, and the agent can only follow the command passively.

Instruction Following Navigate

Learning to Act with Affordance-Aware Multimodal Neural SLAM

1 code implementation24 Jan 2022 Zhiwei Jia, Kaixiang Lin, Yizhou Zhao, Qiaozi Gao, Govind Thattai, Gaurav Sukhatme

With the proposed Affordance-aware Multimodal Neural SLAM (AMSLAM) approach, we obtain more than 40% improvement over prior published work on the ALFRED benchmark and set a new state-of-the-art generalization performance at a success rate of 23. 48% on the test unseen scenes.

Efficient Exploration Test unseen

Learning Two-Step Hybrid Policy for Graph-Based Interpretable Reinforcement Learning

no code implementations21 Jan 2022 Tongzhou Mu, Kaixiang Lin, Feiyang Niu, Govind Thattai

We present a two-step hybrid reinforcement learning (RL) policy that is designed to generate interpretable and robust hierarchical policies on the RL problem with graph-based input.

Decision Making reinforcement-learning +3

LUMINOUS: Indoor Scene Generation for Embodied AI Challenges

1 code implementation10 Nov 2021 Yizhou Zhao, Kaixiang Lin, Zhiwei Jia, Qiaozi Gao, Govind Thattai, Jesse Thomason, Gaurav S. Sukhatme

However, current simulators for Embodied AI (EAI) challenges only provide simulated indoor scenes with a limited number of layouts.

Indoor Scene Synthesis Scene Generation

Off-Policy Imitation Learning from Observations

1 code implementation NeurIPS 2020 Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

To further accelerate the learning procedure, we regulate the policy update with an inverse action model, which assists distribution matching from the perspective of mode-covering.

Imitation Learning

PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control

no code implementations24 Nov 2020 Dong Chen, Kaian Chen. Zhaojian Li, Tianshu Chu, Rui Yao, Feng Qiu, Kaixiang Lin

Specifically, we consider the decentralized inverter-based secondary voltage control problem in distributed generators (DGs), which is first formulated as a cooperative multi-agent reinforcement learning (MARL) problem.

Multi-agent Reinforcement Learning reinforcement-learning +1

A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg

no code implementations11 Jul 2020 Zhaonan Qu, Kaixiang Lin, Zhaojian Li, Jiayu Zhou, Zhengyuan Zhou

For strongly convex and convex problems, we also characterize the corresponding convergence rates for the Nesterov accelerated FedAvg algorithm, which are the first linear speedup guarantees for momentum variants of FedAvg in convex settings.

Distributed Optimization Federated Learning

Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations

1 code implementation1 Apr 2020 Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou

SAIL bridges the advantages of IL and RL to reduce the sample complexity substantially, by effectively exploiting sup-optimal demonstrations and efficiently exploring the environment to surpass the demonstrated performance.

Continuous Control Imitation Learning +1

Differentially Private Generative Adversarial Network

2 code implementations19 Feb 2018 Liyang Xie, Kaixiang Lin, Shu Wang, Fei Wang, Jiayu Zhou

Generative Adversarial Network (GAN) and its variants have recently attracted intensive research interests due to their elegant theoretical foundation and excellent empirical performance as generative models.

Generative Adversarial Network

Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management

1 code implementation18 Feb 2018 Kaixiang Lin, Renyu Zhao, Zhe Xu, Jiayu Zhou

Large-scale online ride-sharing platforms have substantially transformed our lives by reallocating transportation resources to alleviate traffic congestion and promote transportation efficiency.

Management Multi-agent Reinforcement Learning +3

Improving Mild Cognitive Impairment Prediction via Reinforcement Learning and Dialogue Simulation

no code implementations18 Feb 2018 Fengyi Tang, Kaixiang Lin, Ikechukwu Uchendu, Hiroko H. Dodge, Jiayu Zhou

Even though there is mild cognitive decline in MCI patients, they have normal overall cognition and thus is challenging to distinguish from normal aging.

reinforcement-learning Reinforcement Learning (RL)

Collaborative Deep Reinforcement Learning

1 code implementation19 Feb 2017 Kaixiang Lin, Shu Wang, Jiayu Zhou

Motivated by human collaborative learning, in this paper we propose a collaborative deep reinforcement learning (CDRL) framework that performs adaptive knowledge transfer among heterogeneous learning agents.

Knowledge Distillation OpenAI Gym +3

Cannot find the paper you are looking for? You can Submit a new open access paper.