Search Results for author: Huazhe Xu

Found 60 papers, 33 papers with code

Natural Language Object Retrieval

1 code implementation • CVPR 2016 • Ronghang Hu, Huazhe Xu, Marcus Rohrbach, Jiashi Feng, Kate Saenko, Trevor Darrell

In this paper, we address the task of natural language object retrieval, to localize a target object within a given image based on a natural language query of the object.

Ranked #12 on Referring Expression Comprehension on Talk2Car

Image Captioning Image Retrieval +4

113

Paper
Code

End-to-end Learning of Driving Models from Large-scale Video Datasets

2 code implementations • CVPR 2017 • Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell

Robust perception-action models should be learned from training data with diverse visual appearances and realistic behaviors, yet current approaches to deep visuomotor policy learning have been generally limited to in-situ models learned from a single vehicle or a simulation environment.

Scene Segmentation

218

Paper
Code

Reinforcement Learning from Imperfect Demonstrations

no code implementations • ICLR 2018 • Yang Gao, Huazhe Xu, Ji Lin, Fisher Yu, Sergey Levine, Trevor Darrell

We propose a unified reinforcement learning algorithm, Normalized Actor-Critic (NAC), that effectively normalizes the Q-function, reducing the Q-values of actions unseen in the demonstration data.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees

2 code implementations • ICLR 2019 • Yuping Luo, Huazhe Xu, Yuanzhi Li, Yuandong Tian, Trevor Darrell, Tengyu Ma

Model-based reinforcement learning (RL) is considered to be a promising approach to reduce the sample complexity that hinders model-free RL.

Continuous Control Model-based Reinforcement Learning +3

Paper
Code

Modular Architecture for StarCraft II with Deep Reinforcement Learning

no code implementations • 8 Nov 2018 • Dennis Lee, Haoran Tang, Jeffrey O. Zhang, Huazhe Xu, Trevor Darrell, Pieter Abbeel

We present a novel modular architecture for StarCraft II AI.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Add Code

Disentangling Propagation and Generation for Video Prediction

1 code implementation • ICCV 2019 • Hang Gao, Huazhe Xu, Qi-Zhi Cai, Ruth Wang, Fisher Yu, Trevor Darrell

A dynamic scene has two types of elements: those that move fluidly and can be predicted from previous frames, and those which are disoccluded (exposed) and cannot be extrapolated.

Predict Future Video Frames

Paper
Code

Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling

1 code implementation • ICLR 2020 • Yuping Luo, Huazhe Xu, Tengyu Ma

Imitation learning, followed by reinforcement learning algorithms, is a promising paradigm to solve complex control tasks sample-efficiently.

Imitation Learning reinforcement-learning +1

Paper
Code

Composable Semi-parametric Modelling for Long-range Motion Generation

no code implementations • 25 Sep 2019 • Jingwei Xu, Huazhe Xu, Bingbing Ni, Xiaokang Yang, Trevor Darrell

Learning diverse and natural behaviors is one of the longstanding goal for creating intelligent characters in the animated world.

Paper
Add Code

Scoring-Aggregating-Planning: Learning task-agnostic priors from interactions and sparse rewards for zero-shot generalization

no code implementations • 25 Sep 2019 • Huazhe Xu, Boyuan Chen, Yang Gao, Trevor Darrell

In this paper, we propose Scoring-Aggregating-Planning (SAP), a framework that can learn task-agnostic semantics and dynamics priors from arbitrary quality interactions as well as the corresponding sparse rewards and then plan on unseen tasks in zero-shot condition.

Zero-shot Generalization

Paper
Add Code

Zero-shot Policy Learning with Spatial Temporal RewardDecomposition on Contingency-aware Observation

1 code implementation • 17 Oct 2019 • Huazhe Xu, Boyuan Chen, Yang Gao, Trevor Darrell

The agent is first presented with previous experiences in the training environment, along with task description in the form of trajectory-level sparse rewards.

Continuous Control Model Predictive Control +2

Paper
Code

Multi-Task Reinforcement Learning with Soft Modularization

1 code implementation • NeurIPS 2020 • Ruihan Yang, Huazhe Xu, Yi Wu, Xiaolong Wang

While training multiple tasks jointly allow the policies to share parameters across different tasks, the optimization problem becomes non-trivial: It remains unclear what parameters in the network should be reused across tasks, and how the gradients from different tasks may interfere with each other.

Ranked #1 on Meta-Learning on MT50

Meta-Learning Multi-Task Learning +2

100

Paper
Code

Video Prediction via Example Guidance

1 code implementation • ICML 2020 • Jingwei Xu, Huazhe Xu, Bingbing Ni, Xiaokang Yang, Trevor Darrell

In video prediction tasks, one major challenge is to capture the multi-modal nature of future contents and dynamics.

Video Prediction

Paper
Code

Hierarchical Style-based Networks for Motion Synthesis

no code implementations • ECCV 2020 • Jingwei Xu, Huazhe Xu, Bingbing Ni, Xiaokang Yang, Xiaolong Wang, Trevor Darrell

Generating diverse and natural human motion is one of the long-standing goals for creating intelligent characters in the animated world.

Motion Synthesis

Paper
Add Code

Multi-Agent Collaboration via Reward Attribution Decomposition

2 code implementations • 16 Oct 2020 • Tianjun Zhang, Huazhe Xu, Xiaolong Wang, Yi Wu, Kurt Keutzer, Joseph E. Gonzalez, Yuandong Tian

In this work, we propose Collaborative Q-learning (CollaQ) that achieves state-of-the-art performance in the StarCraft multi-agent challenge and supports ad hoc team play.

Dota 2 Multi-agent Reinforcement Learning +2

2,539

Paper
Code

Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes

1 code implementation • CVPR 2021 • Jiashun Wang, Huazhe Xu, Jingwei Xu, Sifei Liu, Xiaolong Wang

Synthesizing 3D human motion plays an important role in many graphics applications as well as understanding human activity.

Motion Synthesis

Paper
Code

BeBold: Exploration Beyond the Boundary of Explored Regions

2 code implementations • 15 Dec 2020 • Tianjun Zhang, Huazhe Xu, Xiaolong Wang, Yi Wu, Kurt Keutzer, Joseph E. Gonzalez, Yuandong Tian

In this paper, we analyze the pros and cons of each method and propose the regulated difference of inverse visitation counts as a simple but effective criterion for IR.

Efficient Exploration NetHack

932

Paper
Code

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

2 code implementations • ICLR 2021 • Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Du, Yu Wang, Yi Wu

We propose a simple, general and effective technique, Reward Randomization for discovering diverse strategic policies in complex multi-agent games.

Paper
Code

Solving Compositional Reinforcement Learning Problems via Task Reduction

1 code implementation • ICLR 2021 • Yunfei Li, Yilin Wu, Huazhe Xu, Xiaolong Wang, Yi Wu

We propose a novel learning paradigm, Self-Imitation via Reduction (SIR), for solving compositional reinforcement learning problems.

Continuous Control reinforcement-learning +1

Paper
Code

PyTouch: A Machine Learning Library for Touch Processing

1 code implementation • 26 May 2021 • Mike Lambeta, Huazhe Xu, Jingwei Xu, Po-Wei Chou, Shaoxiong Wang, Trevor Darrell, Roberto Calandra

With the increased availability of rich tactile sensors, there is an equally proportional need for open-source and integrated software capable of efficiently and effectively processing raw touch measurements into high-level signals that can be used for control and decision-making.

BIG-bench Machine Learning Decision Making +1

224

Paper
Code

Towards Learning to Play Piano with Dexterous Hands and Touch

1 code implementation • 3 Jun 2021 • Huazhe Xu, Yuping Luo, Shaoxiong Wang, Trevor Darrell, Roberto Calandra

The virtuoso plays the piano with passion, poetry and extraordinary technical ability.

Reinforcement Learning (RL)

Paper
Code

DAIR: Disentangled Attention Intrinsic Regularization for Safe and Efficient Bimanual Manipulation

no code implementations • 10 Jun 2021 • Minghao Zhang, Pingcheng Jian, Yi Wu, Huazhe Xu, Xiaolong Wang

We address the problem of safely solving complex bimanual robot manipulation tasks with sparse rewards.

Robot Manipulation

Paper
Add Code

Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers

1 code implementation • ICLR 2022 • Ruihan Yang, Minghao Zhang, Nicklas Hansen, Huazhe Xu, Xiaolong Wang

Our key insight is that proprioceptive states only offer contact measurements for immediate reaction, whereas an agent equipped with visual sensory observations can learn to proactively maneuver environments with obstacles and uneven terrain by anticipating changes in the environment many steps ahead.

Reinforcement Learning (RL)

196

Paper
Code

Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification

1 code implementation • 22 Nov 2021 • Ling Pan, Longbo Huang, Tengyu Ma, Huazhe Xu

Conservatism has led to significant progress in offline reinforcement learning (RL) where an agent learns from pre-collected datasets.

Continuous Control Multi-agent Reinforcement Learning +3

Paper
Code

Multi-Person 3D Motion Prediction with Multi-Range Transformers

1 code implementation • NeurIPS 2021 • Jiashun Wang, Huazhe Xu, Medhini Narasimhan, Xiaolong Wang

Thus, instead of predicting each human pose trajectory in isolation, we introduce a Multi-Range Transformers model which contains of a local-range encoder for individual motion and a global-range encoder for social interactions.

Ranked #3 on Multi-Person Pose forecasting on Expi - common actions split

motion prediction Multi-Person Pose forecasting +1

Paper
Code

NovelD: A Simple yet Effective Exploration Criterion

1 code implementation • NeurIPS 2021 • Tianjun Zhang, Huazhe Xu, Xiaolong Wang, Yi Wu, Kurt Keutzer, Joseph E. Gonzalez, Yuandong Tian

We analyze NovelD thoroughly in MiniGrid and found that empirically it helps the agent explore the environment more uniformly with a focus on exploring beyond the boundary.

Efficient Exploration Montezuma's Revenge +1

Paper
Code

Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning

1 code implementation • 21 Feb 2022 • Zhecheng Yuan, Guozheng Ma, Yao Mu, Bo Xia, Bo Yuan, Xueqian Wang, Ping Luo, Huazhe Xu

One of the key challenges in visual Reinforcement Learning (RL) is to learn policies that can generalize to unseen environments.

Data Augmentation Reinforcement Learning (RL)

Paper
Code

RoboCraft: Learning to See, Simulate, and Shape Elasto-Plastic Objects with Graph Networks

no code implementations • 5 May 2022 • Haochen Shi, Huazhe Xu, Zhiao Huang, Yunzhu Li, Jiajun Wu

Our learned model-based planning framework is comparable to and sometimes better than human subjects on the tested tasks.

Model Predictive Control

Paper
Add Code

Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning

no code implementations • 24 Jun 2022 • Yunfei Li, Tian Gao, Jiaqi Yang, Huazhe Xu, Yi Wu

It has been a recent trend to leverage the power of supervised learning (SL) towards more effective reinforcement learning (RL) methods.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

USEEK: Unsupervised SE(3)-Equivariant 3D Keypoints for Generalizable Manipulation

no code implementations • 28 Sep 2022 • Zhengrong Xue, Zhecheng Yuan, Jiashun Wang, Xueqian Wang, Yang Gao, Huazhe Xu

Can a robot manipulate intra-category unseen objects in arbitrary poses with the help of a mere demonstration of grasping pose on a single object instance?

Keypoint Detection Object

Paper
Add Code

Extraneousness-Aware Imitation Learning

no code implementations • 4 Oct 2022 • Ray Chen Zheng, Kaizhe Hu, Zhecheng Yuan, Boyuan Chen, Huazhe Xu

To tackle this problem, we introduce Extraneousness-Aware Imitation Learning (EIL), a self-supervised approach that learns visuomotor policies from third-person demonstrations with extraneous subsequences.

Imitation Learning

Paper
Add Code

Simple Emergent Action Representations from Multi-Task Policy Training

no code implementations • 18 Oct 2022 • Pu Hua, Yubei Chen, Huazhe Xu

The low-level sensory and motor signals in deep reinforcement learning, which exist in high-dimensional spaces such as image observations or motor torques, are inherently challenging to understand or utilize directly for downstream tasks.

Paper
Add Code

Scaling up and Stabilizing Differentiable Planning with Implicit Differentiation

no code implementations • 24 Oct 2022 • Linfeng Zhao, Huazhe Xu, Lawson L. S. Wong

To alleviate this issue, we propose to differentiate through the Bellman fixed-point equation to decouple forward and backward passes for Value Iteration Network and its variants, which enables constant backward cost (in planning horizon) and flexible forward budget and helps scale up to large tasks.

Visual Navigation

Paper
Add Code

E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance

no code implementations • 5 Dec 2022 • Can Chang, Ni Mu, Jiajun Wu, Ling Pan, Huazhe Xu

Specifically, we introduce Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance(E-MAPP), a novel framework that leverages parallel programs to guide multiple agents to efficiently accomplish goals that require planning over $10+$ stages.

Multi-agent Reinforcement Learning reinforcement-learning +2

Paper
Add Code

See, Hear, and Feel: Smart Sensory Fusion for Robotic Manipulation

no code implementations • 7 Dec 2022 • Hao Li, Yizhi Zhang, Junzhe Zhu, Shaoxiong Wang, Michelle A Lee, Huazhe Xu, Edward Adelson, Li Fei-Fei, Ruohan Gao, Jiajun Wu

Humans use all of their senses to accomplish different tasks in everyday activities.

Decision Making

Paper
Add Code

On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline

1 code implementation • 12 Dec 2022 • Nicklas Hansen, Zhecheng Yuan, Yanjie Ze, Tongzhou Mu, Aravind Rajeswaran, Hao Su, Huazhe Xu, Xiaolong Wang

In this paper, we examine the effectiveness of pre-training for visuo-motor control tasks.

Benchmarking Data Augmentation

Paper
Code

Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning

no code implementations • 17 Dec 2022 • Zhecheng Yuan, Zhengrong Xue, Bo Yuan, Xueqian Wang, Yi Wu, Yang Gao, Huazhe Xu

Hence, we propose Pre-trained Image Encoder for Generalizable visual reinforcement learning (PIE-G), a simple yet effective framework that can generalize to the unseen visual scenarios in a zero-shot manner.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Scene Synthesis from Human Motion

no code implementations • 4 Jan 2023 • Sifan Ye, Yixing Wang, Jiaman Li, Dennis Park, C. Karen Liu, Huazhe Xu, Jiajun Wu

Large-scale capture of human motion with diverse, complex scenes, while immensely useful, is often considered prohibitively costly.

Ranked #3 on 3D Semantic Scene Completion on PRO-teXt

2D Semantic Segmentation task 1 (8 classes) 3D Semantic Scene Completion +1

Paper
Add Code

Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function

no code implementations • 2 Feb 2023 • Ruijie Zheng, Xiyao Wang, Huazhe Xu, Furong Huang

To test this hypothesis, we devise two practical robust training mechanisms through computing the adversarial noise and regularizing the value network's spectral norm to directly regularize the Lipschitz condition of the value functions.

Model-based Reinforcement Learning

Paper
Add Code

Decision Transformer under Random Frame Dropping

1 code implementation • 3 Mar 2023 • Kaizhe Hu, Ray Chen Zheng, Yang Gao, Huazhe Xu

Typical RL methods usually require considerable online interaction data that are costly and unsafe to collect in the real world.

Offline RL

Paper
Code

Efficient bimanual handover and rearrangement via symmetry-aware actor-critic learning

1 code implementation • IEEE International Conference on Robotics and Automation (ICRA) 2023 • Yunfei Li;, Chaoyi Pan, Huazhe Xu, Xiaolong Wang, Yi Wu

We develop a symmetry-aware actor-critic framework that leverages the interchangeable roles of the two manipulators in the bimanual control setting to reduce the policy search space.

Reinforcement Learning (RL)

Paper
Code

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic

no code implementations • 5 Jun 2023 • Tianying Ji, Yu Luo, Fuchun Sun, Xianyuan Zhan, Jianwei Zhang, Huazhe Xu

Learning high-quality Q-value functions plays a key role in the success of many modern off-policy deep reinforcement learning (RL) algorithms.

Continuous Control Reinforcement Learning (RL)

Paper
Add Code

TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning

1 code implementation • 22 Jun 2023 • Ruijie Zheng, Xiyao Wang, Yanchao Sun, Shuang Ma, Jieyu Zhao, Huazhe Xu, Hal Daumé III, Furong Huang

Despite recent progress in reinforcement learning (RL) from raw pixel data, sample inefficiency continues to present a substantial obstacle.

Continuous Control Contrastive Learning +3

Paper
Code

ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through Touch

no code implementations • 29 Jun 2023 • Zhengrong Xue, Han Zhang, Jingwen Cheng, Zhengmao He, Yuanchen Ju, Changyi Lin, Gu Zhang, Huazhe Xu

We present ArrayBot, a distributed manipulation system consisting of a $16 \times 16$ array of vertically sliding pillars integrated with tactile sensors, which can simultaneously support, perceive, and manipulate the tabletop objects.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization

1 code implementation • NeurIPS 2023 • Zhecheng Yuan, Sizhe Yang, Pu Hua, Can Chang, Kaizhe Hu, Huazhe Xu

Visual Reinforcement Learning (Visual RL), coupled with high-dimensional observations, has consistently confronted the long-standing challenge of out-of-distribution generalization.

Out-of-Distribution Generalization reinforcement-learning

Paper
Code

OTAS: Unsupervised Boundary Detection for Object-Centric Temporal Action Segmentation

1 code implementation • 12 Sep 2023 • Yuerong Li, Zhengrong Xue, Huazhe Xu

In this paper, we explore the merits of local features by proposing the unsupervised framework of Object-centric Temporal Action Segmentation (OTAS).

Action Segmentation Boundary Detection +1

Paper
Code

GenSim: Generating Robotic Simulation Tasks via Large Language Models

1 code implementation • 2 Oct 2023 • Lirui Wang, Yiyang Ling, Zhecheng Yuan, Mohit Shridhar, Chen Bao, Yuzhe Qin, Bailin Wang, Huazhe Xu, Xiaolong Wang

Collecting large amounts of real-world interaction data to train general robotic policies is often prohibitively expensive, thus motivating the use of simulation data.

Code Generation

231

Paper
Code

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

no code implementations • 11 Oct 2023 • Xiyao Wang, Ruijie Zheng, Yanchao Sun, Ruonan Jia, Wichayaporn Wongkamjan, Huazhe Xu, Furong Huang

In this paper, we propose $\texttt{COPlanner}$, a planning-driven framework for model-based methods to address the inaccurately learned dynamics model problem with conservative model rollouts and optimistic environment exploration.

Continuous Control Model-based Reinforcement Learning +1

Paper
Add Code

DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization

2 code implementations • 30 Oct 2023 • Guowei Xu, Ruijie Zheng, Yongyuan Liang, Xiyao Wang, Zhecheng Yuan, Tianying Ji, Yu Luo, Xiaoyu Liu, Jiaxin Yuan, Pu Hua, Shuzhen Li, Yanjie Ze, Hal Daumé III, Furong Huang, Huazhe Xu

To quantify this inactivity, we adopt dormant ratio as a metric to measure inactivity in the RL agent's network.

Continuous Control reinforcement-learning +1

Paper
Code

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

1 code implementation • 31 Oct 2023 • Ruizhe Shi, Yuyao Liu, Yanjie Ze, Simon S. Du, Huazhe Xu

Offline reinforcement learning (RL) aims to find a near-optimal policy using pre-collected datasets.

Few-Shot Learning Offline RL +1

Paper
Code

Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization

no code implementations • 6 Nov 2023 • Kun Lei, Zhengmao He, Chenhao Lu, Kaizhe Hu, Yang Gao, Huazhe Xu

Owning to the alignment of objectives in two phases, the RL agent can transfer between offline and online learning seamlessly.

Reinforcement Learning (RL)

Paper
Add Code

Diffusion Reward: Learning Rewards via Conditional Video Diffusion

no code implementations • 21 Dec 2023 • Tao Huang, Guangqi Jiang, Yanjie Ze, Huazhe Xu

Learning rewards from expert videos offers an affordable and effective solution to specify the intended behaviors for reinforcement learning tasks.

Paper
Add Code

Generalizable Visual Reinforcement Learning with Segment Anything Model

1 code implementation • 28 Dec 2023 • Ziyu Wang, Yanjie Ze, Yifei Sun, Zhecheng Yuan, Huazhe Xu

Learning policies that can generalize to unseen environments is a fundamental challenge in visual reinforcement learning (RL).

Data Augmentation reinforcement-learning +1

Paper
Code

Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation

no code implementations • 15 Jan 2024 • Yuanchen Ju, Kaizhe Hu, Guowei Zhang, Gu Zhang, Mingrun Jiang, Huazhe Xu

The next step is to map the contact points of the retrieved objects to the new object.

Robot Manipulation Semantic correspondence

Paper
Add Code

DittoGym: Learning to Control Soft Shape-Shifting Robots

2 code implementations • 24 Jan 2024 • Suning Huang, Boyuan Chen, Huazhe Xu, Vincent Sitzmann

Inspired by nature and recent novel robot designs, we propose to go a step further and explore the novel reconfigurable robots, defined as robots that can change their morphology within their lifetime.

Reinforcement Learning (RL)

Paper
Code

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

1 code implementation • 9 Feb 2024 • Ruijie Zheng, Yongyuan Liang, Xiyao Wang, Shuang Ma, Hal Daumé III, Huazhe Xu, John Langford, Praveen Palanisamy, Kalyan Shankar Basu, Furong Huang

We present Premier-TACO, a multitask feature representation learning approach designed to improve few-shot policy learning efficiency in sequential decision-making tasks.

Computational Efficiency Continuous Control +4

Paper
Code

RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation

no code implementations • 22 Feb 2024 • Junting Chen, Yao Mu, Qiaojun Yu, Tianming Wei, Silang Wu, Zhecheng Yuan, Zhixuan Liang, Chao Yang, Kaipeng Zhang, Wenqi Shao, Yu Qiao, Huazhe Xu, Mingyu Ding, Ping Luo

To bridge this ``ideal-to-real'' gap, this paper presents \textbf{RobotScript}, a platform for 1) a deployable robot manipulation pipeline powered by code generation; and 2) a code generation benchmark for robot manipulation tasks in free-form natural language.

Code Generation Common Sense Reasoning +2

Paper
Add Code

ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

no code implementations • 22 Feb 2024 • Tianying Ji, Yongyuan Liang, Yan Zeng, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun, Huazhe Xu

The varying significance of distinct primitive behaviors during the policy learning process has been overlooked by prior model-free RL algorithms.

Continuous Control Efficient Exploration

Paper
Add Code

3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

1 code implementation • 6 Mar 2024 • Yanjie Ze, Gu Zhang, Kangning Zhang, Chenyuan Hu, Muhan Wang, Huazhe Xu

Imitation learning provides an efficient way to teach robots dexterous skills; however, learning complex skills robustly and generalizablely usually consumes large amounts of human demonstrations.

Imitation Learning

173

Paper
Code

RiEMann: Near Real-Time SE(3)-Equivariant Robot Manipulation without Point Cloud Segmentation

no code implementations • 28 Mar 2024 • Chongkai Gao, Zhengrong Xue, Shuying Deng, Tianhai Liang, Siqi Yang, Lin Shao, Huazhe Xu

RiEMann learns a manipulation task from scratch with 5 to 10 demonstrations, generalizes to unseen SE(3) transformations and instances of target objects, resists visual interference of distracting objects, and follows the near real-time pose change of the target object.

Imitation Learning Object +3

Paper
Add Code

Learning Visual Quadrupedal Loco-Manipulation from Demonstrations

no code implementations • 29 Mar 2024 • Zhengmao He, Kun Lei, Yanjie Ze, Koushil Sreenath, Zhongyu Li, Huazhe Xu

Our approach is validated through simulations and real-world experiments, demonstrating the robot's ability to perform tasks that demand mobility and high precision, such as lifting a basket from the ground while moving, closing a dishwasher, pressing a button, and pushing a door.

Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.