Search Results for author: Yanjie Ze

Found 12 papers, 9 papers with code

Learning Visual Quadrupedal Loco-Manipulation from Demonstrations

no code implementations • 29 Mar 2024 • Zhengmao He, Kun Lei, Yanjie Ze, Koushil Sreenath, Zhongyu Li, Huazhe Xu

Our approach is validated through simulations and real-world experiments, demonstrating the robot's ability to perform tasks that demand mobility and high precision, such as lifting a basket from the ground while moving, closing a dishwasher, pressing a button, and pushing a door.

Reinforcement Learning (RL)

Paper
Add Code

3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

1 code implementation • 6 Mar 2024 • Yanjie Ze, Gu Zhang, Kangning Zhang, Chenyuan Hu, Muhan Wang, Huazhe Xu

Imitation learning provides an efficient way to teach robots dexterous skills; however, learning complex skills robustly and generalizablely usually consumes large amounts of human demonstrations.

Imitation Learning

173

Paper
Code

Generalizable Visual Reinforcement Learning with Segment Anything Model

1 code implementation • 28 Dec 2023 • Ziyu Wang, Yanjie Ze, Yifei Sun, Zhecheng Yuan, Huazhe Xu

Learning policies that can generalize to unseen environments is a fundamental challenge in visual reinforcement learning (RL).

Data Augmentation reinforcement-learning +1

Paper
Code

Diffusion Reward: Learning Rewards via Conditional Video Diffusion

no code implementations • 21 Dec 2023 • Tao Huang, Guangqi Jiang, Yanjie Ze, Huazhe Xu

Learning rewards from expert videos offers an affordable and effective solution to specify the intended behaviors for reinforcement learning tasks.

Paper
Add Code

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

1 code implementation • 31 Oct 2023 • Ruizhe Shi, Yuyao Liu, Yanjie Ze, Simon S. Du, Huazhe Xu

Offline reinforcement learning (RL) aims to find a near-optimal policy using pre-collected datasets.

Few-Shot Learning Offline RL +1

Paper
Code

DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization

2 code implementations • 30 Oct 2023 • Guowei Xu, Ruijie Zheng, Yongyuan Liang, Xiyao Wang, Zhecheng Yuan, Tianying Ji, Yu Luo, Xiaoyu Liu, Jiaxin Yuan, Pu Hua, Shuzhen Li, Yanjie Ze, Hal Daumé III, Furong Huang, Huazhe Xu

To quantify this inactivity, we adopt dormant ratio as a metric to measure inactivity in the RL agent's network.

Continuous Control reinforcement-learning +1

Paper
Code

GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields

1 code implementation • 31 Aug 2023 • Yanjie Ze, Ge Yan, Yueh-Hua Wu, Annabella Macaluso, Yuying Ge, Jianglong Ye, Nicklas Hansen, Li Erran Li, Xiaolong Wang

To incorporate semantics in 3D, the reconstruction module utilizes a vision-language foundation model ($\textit{e. g.}$, Stable Diffusion) to distill rich semantic information into the deep 3D voxel.

Decision Making

Paper
Code

DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning

1 code implementation • 19 Aug 2023 • Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li

Communication lays the foundation for cooperation in human society and in multi-agent reinforcement learning (MARL).

Multi-agent Reinforcement Learning Privacy Preserving +1

Paper
Code

On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline

1 code implementation • 12 Dec 2022 • Nicklas Hansen, Zhecheng Yuan, Yanjie Ze, Tongzhou Mu, Aravind Rajeswaran, Hao Su, Huazhe Xu, Xiaolong Wang

In this paper, we examine the effectiveness of pre-training for visuo-motor control tasks.

Benchmarking Data Augmentation

Paper
Code

Visual Reinforcement Learning with Self-Supervised 3D Representations

1 code implementation • 13 Oct 2022 • Yanjie Ze, Nicklas Hansen, Yinbo Chen, Mohit Jain, Xiaolong Wang

A prominent approach to visual Reinforcement Learning (RL) is to learn an internal state representation using self-supervised methods, which has the potential benefit of improved sample-efficiency and generalization through additional learning signal and inductive biases.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Code

Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization

no code implementations • 25 Jan 2022 • Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li

Temporal difference (TD) learning is a widely used method to evaluate policies in reinforcement learning.

OpenAI Gym

Paper
Add Code

UKPGAN: A General Self-Supervised Keypoint Detector

1 code implementation • CVPR 2022 • Yang You, Wenhai Liu, Yanjie Ze, Yong-Lu Li, Weiming Wang, Cewu Lu

Keypoint detection is an essential component for the object registration and alignment.

Keypoint Detection Object

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.