1 code implementation • 10 Oct 2024 • Hongtao Wu, Yijun Yang, Angelica I Aviles-Rivero, Jingjing Ren, Sixiang Chen, Haoyu Chen, Lei Zhu
Specifically, we construct a real-world dataset with 85 snowy videos, and then present a Semi-supervised Video Desnowing Network (SemiVDN) equipped by a novel Distribution-driven Contrastive Regularization.
Ranked #2 on Snow Removal on RVSD (using extra training data)
no code implementations • 8 Oct 2024 • Chi-Lam Cheang, Guangzeng Chen, Ya Jing, Tao Kong, Hang Li, Yifeng Li, Yuxiao Liu, Hongtao Wu, Jiafeng Xu, Yichu Yang, Hanbo Zhang, Minzhao Zhu
We present GR-2, a state-of-the-art generalist robot agent for versatile and generalizable robot manipulation.
no code implementations • 25 Sep 2024 • Hang Lai, Jiahang Cao, Jiafeng Xu, Hongtao Wu, Yunfeng Lin, Tao Kong, Yong Yu, Weinan Zhang
To address this issue, traditional methods attempt to learn a teacher policy with access to privileged information first and then learn a student policy to imitate the teacher's behavior with visual input.
1 code implementation • 26 Aug 2024 • Peiyan Li, Hongtao Wu, Yan Huang, Chilam Cheang, Liang Wang, Tao Kong
During training, GR-MG samples goal images from trajectories and conditions on both the text and the goal image or solely on the image when text is not available.
Ranked #1 on Zero-shot Generalization on CALVIN (using extra training data)
1 code implementation • 31 Jul 2024 • Hongtao Wu, Yijun Yang, Huihui Xu, Weiming Wang, Jinni Zhou, Lei Zhu
Recently, the linear-complexity operator of the state space models (SSMs) has contrarily facilitated efficient long-term temporal modeling, which is crucial for rain streaks and raindrops removal in videos.
Ranked #1 on Video deraining on Video Waterdrop Removal Dataset
no code implementations • 20 Jun 2024 • Fangqi Zhu, Hongtao Wu, Song Guo, Yuxiao Liu, Chilam Cheang, Tao Kong
We hope that IRASim can serve as an effective and scalable approach to enhance robot learning in the real world.
1 code implementation • CVPR 2024 • Yijun Yang, Hongtao Wu, Angelica I. Aviles-Rivero, Yulun Zhang, Jing Qin, Lei Zhu
Although ViWS-Net is proposed to remove adverse weather conditions in videos with a single set of pre-trained weights, it is seriously blinded by seen weather at train-time and degenerates when coming to unseen weather during test-time.
3 code implementations • 20 Dec 2023 • Hongtao Wu, Ya Jing, Chilam Cheang, Guangzeng Chen, Jiafeng Xu, Xinghang Li, Minghuan Liu, Hang Li, Tao Kong
In this paper, we extend the scope of this effectiveness by showing that visual robot manipulation can significantly benefit from large-scale video generative pre-training.
Ranked #5 on Zero-shot Generalization on CALVIN (using extra training data)
no code implementations • 2 Nov 2023 • Xinghang Li, Minghuan Liu, Hanbo Zhang, Cunjun Yu, Jie Xu, Hongtao Wu, Chilam Cheang, Ya Jing, Weinan Zhang, Huaping Liu, Hang Li, Tao Kong
We believe RoboFlamingo has the potential to be a cost-effective and easy-to-use solution for robotics manipulation, empowering everyone with the ability to fine-tune their own robotics policy.
no code implementations • 7 Aug 2023 • Taozheng Yang, Ya Jing, Hongtao Wu, Jiafeng Xu, Kuankuan Sima, Guangzeng Chen, Qie Sima, Tao Kong
In this paper, we present a novel method for mobile manipulators to perform multiple contact-rich manipulation tasks.
1 code implementation • ICCV 2023 • Haoyu Chen, Jingjing Ren, Jinjin Gu, Hongtao Wu, Xuequan Lu, Haoming Cai, Lei Zhu
We also develop a deep learning framework for video snow removal.
Ranked #4 on Snow Removal on RVSD
no code implementations • 22 Feb 2022 • Hongtao Wu, Jikai Ye, Xin Meng, Chris Paxton, Gregory Chirikjian
We propose a visual foresight model for pick-and-place rearrangement manipulation which is able to learn efficiently.
no code implementations • 12 Aug 2021 • Hongtao Wu, Xin Meng, Sipu Ruan, Gregory Chirikjian
Results show that our method enables the robot to autonomously seat the teddy bear on the 12 previously unseen chairs with a very high success rate.
1 code implementation • ICCV 2021 • Weixiao Liu, Hongtao Wu, Gregory Chirikjian
In this paper, we propose a novel method called CPD with Local Surface Geometry (LSG-CPD) for rigid point cloud registration.
1 code implementation • 5 Aug 2020 • Hongtao Wu, Gregory S. Chirikjian
In this letter, we propose a novel method for robots to "imagine" the open containability affordance of a previously unseen object via physical simulations.
1 code implementation • ACL 2020 • Canwen Xu, Jiaxin Pei, Hongtao Wu, Yiyu Liu, Chenliang Li
Recently, large-scale datasets have vastly facilitated the development in nearly all domains of Natural Language Processing.
1 code implementation • 25 Sep 2019 • Andrew Hundt, Benjamin Killeen, Nicholas Greene, Hongtao Wu, Heeyeon Kwon, Chris Paxton, Gregory D. Hager
We are able to create real stacks in 100% of trials with 61% efficiency and real rows in 100% of trials with 59% efficiency by directly loading the simulation-trained model on the real robot with no additional real-world fine-tuning.
1 code implementation • 17 Sep 2019 • Hongtao Wu, Deven Misra, Gregory S. Chirikjian
In our method, the robot "imagines" the affordance of an arbitrarily oriented object as a chair by simulating a physical sitting interaction between an articulated human body and the object.