Search Results for author: Grace Zhang

Found 6 papers, 3 papers with code

Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing

no code implementations • 1 Feb 2023 • Grace Zhang, Ayush Jain, Injune Hwang, Shao-Hua Sun, Joseph J. Lim

The ability to leverage shared behaviors between tasks is critical for sample-efficient multi-task reinforcement learning (MTRL).

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

CoMPS: Continual Meta Policy Search

no code implementations • ICLR 2022 • Glen Berseth, Zhiwei Zhang, Grace Zhang, Chelsea Finn, Sergey Levine

Beyond simply transferring past experience to new tasks, our goal is to devise continual reinforcement learning algorithms that learn to learn, using their experience on previous tasks to learn new tasks more quickly.

Continual Learning Continuous Control +5

Paper
Add Code

Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding

1 code implementation • 1 Jul 2021 • Grace Zhang, Linghan Zhong, Youngwoon Lee, Joseph J. Lim

In this paper, we propose a novel policy transfer method with iterative "environment grounding", IDAPT, that alternates between (1) directly minimizing both visual and dynamics domain gaps by grounding the source environment in the target environment domains, and (2) training a policy on the grounded source environment.

Paper
Code

Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

5 code implementations • 1 Oct 2019 • Xue Bin Peng, Aviral Kumar, Grace Zhang, Sergey Levine

In this paper, we aim to develop a simple and scalable reinforcement learning algorithm that uses standard supervised learning methods as subroutines.

Ranked #1 on OpenAI Gym on Humanoid-v2

Continuous Control OpenAI Gym +3

7,953

Paper
Code

Advantage Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

no code implementations • 25 Sep 2019 • Xue Bin Peng, Aviral Kumar, Grace Zhang, Sergey Levine

In this paper, we aim to develop a simple and scalable reinforcement learning algorithm that uses standard supervised learning methods as subroutines.

Continuous Control OpenAI Gym +3

Paper
Add Code

MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies

1 code implementation • NeurIPS 2019 • Xue Bin Peng, Michael Chang, Grace Zhang, Pieter Abbeel, Sergey Levine

In this work, we propose multiplicative compositional policies (MCP), a method for learning reusable motor skills that can be composed to produce a range of complex behaviors.

Continuous Control

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.