Search Results for author: Hongtao Wu

Found 18 papers, 11 papers with code

Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization

1 code implementation10 Oct 2024 Hongtao Wu, Yijun Yang, Angelica I Aviles-Rivero, Jingjing Ren, Sixiang Chen, Haoyu Chen, Lei Zhu

Specifically, we construct a real-world dataset with 85 snowy videos, and then present a Semi-supervised Video Desnowing Network (SemiVDN) equipped by a novel Distribution-driven Contrastive Regularization.

Ranked #2 on Snow Removal on RVSD (using extra training data)

Snow Removal

World Model-based Perception for Visual Legged Locomotion

no code implementations25 Sep 2024 Hang Lai, Jiahang Cao, Jiafeng Xu, Hongtao Wu, Yunfeng Lin, Tao Kong, Yong Yu, Weinan Zhang

To address this issue, traditional methods attempt to learn a teacher policy with access to privileged information first and then learn a student policy to imitate the teacher's behavior with visual input.

GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal-Conditioned Policy

1 code implementation26 Aug 2024 Peiyan Li, Hongtao Wu, Yan Huang, Chilam Cheang, Liang Wang, Tao Kong

During training, GR-MG samples goal images from trajectories and conditions on both the text and the goal image or solely on the image when text is not available.

 Ranked #1 on Zero-shot Generalization on CALVIN (using extra training data)

Few-Shot Learning Image Generation +1

RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining

1 code implementation31 Jul 2024 Hongtao Wu, Yijun Yang, Huihui Xu, Weiming Wang, Jinni Zhou, Lei Zhu

Recently, the linear-complexity operator of the state space models (SSMs) has contrarily facilitated efficient long-term temporal modeling, which is crucial for rain streaks and raindrops removal in videos.

Optical Flow Estimation Rain Removal +3

IRASim: Learning Interactive Real-Robot Action Simulators

no code implementations20 Jun 2024 Fangqi Zhu, Hongtao Wu, Song Guo, Yuxiao Liu, Chilam Cheang, Tao Kong

We hope that IRASim can serve as an effective and scalable approach to enhance robot learning in the real world.

Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal

1 code implementation CVPR 2024 Yijun Yang, Hongtao Wu, Angelica I. Aviles-Rivero, Yulun Zhang, Jing Qin, Lei Zhu

Although ViWS-Net is proposed to remove adverse weather conditions in videos with a single set of pre-trained weights, it is seriously blinded by seen weather at train-time and degenerates when coming to unseen weather during test-time.

Test-time Adaptation

Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation

3 code implementations20 Dec 2023 Hongtao Wu, Ya Jing, Chilam Cheang, Guangzeng Chen, Jiafeng Xu, Xinghang Li, Minghuan Liu, Hang Li, Tao Kong

In this paper, we extend the scope of this effectiveness by showing that visual robot manipulation can significantly benefit from large-scale video generative pre-training.

Ranked #5 on Zero-shot Generalization on CALVIN (using extra training data)

Robot Manipulation Zero-shot Generalization

Vision-Language Foundation Models as Effective Robot Imitators

no code implementations2 Nov 2023 Xinghang Li, Minghuan Liu, Hanbo Zhang, Cunjun Yu, Jie Xu, Hongtao Wu, Chilam Cheang, Ya Jing, Weinan Zhang, Huaping Liu, Hang Li, Tao Kong

We believe RoboFlamingo has the potential to be a cost-effective and easy-to-use solution for robotics manipulation, empowering everyone with the ability to fine-tune their own robotics policy.

Imitation Learning

Transporters with Visual Foresight for Solving Unseen Rearrangement Tasks

no code implementations22 Feb 2022 Hongtao Wu, Jikai Ye, Xin Meng, Chris Paxton, Gregory Chirikjian

We propose a visual foresight model for pick-and-place rearrangement manipulation which is able to learn efficiently.

Imitation Learning Multi-Task Learning +1

Put the Bear on the Chair! Intelligent Robot Interaction with Previously Unseen Chairs via Robot Imagination

no code implementations12 Aug 2021 Hongtao Wu, Xin Meng, Sipu Ruan, Gregory Chirikjian

Results show that our method enables the robot to autonomously seat the teddy bear on the 12 previously unseen chairs with a very high success rate.

Motion Planning

LSG-CPD: Coherent Point Drift with Local Surface Geometry for Point Cloud Registration

1 code implementation ICCV 2021 Weixiao Liu, Hongtao Wu, Gregory Chirikjian

In this paper, we propose a novel method called CPD with Local Surface Geometry (LSG-CPD) for rigid point cloud registration.

Point Cloud Registration

Can I Pour into It? Robot Imagining Open Containability Affordance of Previously Unseen Objects via Physical Simulations

1 code implementation5 Aug 2020 Hongtao Wu, Gregory S. Chirikjian

In this letter, we propose a novel method for robots to "imagine" the open containability affordance of a previously unseen object via physical simulations.

Binary Classification Classification +3

"Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer

1 code implementation25 Sep 2019 Andrew Hundt, Benjamin Killeen, Nicholas Greene, Hongtao Wu, Heeyeon Kwon, Chris Paxton, Gregory D. Hager

We are able to create real stacks in 100% of trials with 61% efficiency and real rows in 100% of trials with 59% efficiency by directly loading the simulation-trained model on the real robot with no additional real-world fine-tuning.

reinforcement-learning Reinforcement Learning +1

Is That a Chair? Imagining Affordances Using Simulations of an Articulated Human Body

1 code implementation17 Sep 2019 Hongtao Wu, Deven Misra, Gregory S. Chirikjian

In our method, the robot "imagines" the affordance of an arbitrarily oriented object as a chair by simulating a physical sitting interaction between an articulated human body and the object.

Object Physical Simulations

Cannot find the paper you are looking for? You can Submit a new open access paper.