1 code implementation • 18 Dec 2024 • Tongzhou Mu, Zhaoyang Li, Stanisław Wiktor Strzelecki, Xiu Yuan, Yunchao Yao, Litian Liang, Hao Su
Visual reinforcement learning is a promising approach that directly trains policies from visual observations, although it faces challenges in sample efficiency and computational costs.
no code implementations • 18 Dec 2024 • Xiu Yuan, Tongzhou Mu, Stone Tao, Yunhao Fang, Mengke Zhang, Hao Su
Recent advancements in robot learning have used imitation learning with large models and extensive demonstrations to develop effective policies.
1 code implementation • 9 Dec 2024 • Arth Shukla, Stone Tao, Hao Su
To this end, we present MS-HAB, a holistic benchmark for low-level manipulation and in-home object rearrangement.
1 code implementation • 24 Oct 2024 • Hansheng Chen, Bokui Shen, Yulin Liu, Ruoxi Shi, Linqi Zhou, Connor Z. Lin, Jiayuan Gu, Hao Su, Gordon Wetzstein, Leonidas Guibas
Multi-view image diffusion models have significantly advanced open-domain 3D object generation.
no code implementations • 17 Oct 2024 • Shangzhe Li, Zhiao Huang, Hao Su
By employing a learned latent dynamics model and planning for control, our approach consistently achieves stable, expert-level performance in tasks with high-dimensional observation or action spaces and intricate dynamics.
1 code implementation • 1 Oct 2024 • Stone Tao, Fanbo Xiang, Arth Shukla, Yuzhe Qin, Xander Hinrichsen, Xiaodi Yuan, Chen Bao, Xinsong Lin, Yulin Liu, Tse-kai Chan, Yuan Gao, Xuanlin Li, Tongzhou Mu, Nan Xiao, Arnav Gurha, Zhiao Huang, Roberto Calandra, Rui Chen, Shan Luo, Hao Su
We introduce and open source ManiSkill3, the fastest state-visual GPU parallelized robotics simulator with contact-rich physics targeting generalizable manipulation.
no code implementations • 30 Aug 2024 • Xiaoshuai Zhang, Zhicheng Wang, Howard Zhou, Soham Ghosh, Danushen Gnanapragasam, Varun Jampani, Hao Su, Leonidas Guibas
To advance the state of the art in the creation of 3D foundation models, this paper introduces the ConDense framework for 3D pre-training utilizing existing pre-trained 2D networks and large-scale multi-view datasets.
no code implementations • 19 Aug 2024 • Chao Xu, Ang Li, Linghao Chen, Yulin Liu, Ruoxi Shi, Hao Su, Minghua Liu
The diffusion model is trained to jointly predict surrogate representations for camera poses and multi-view images of the object under known poses, integrating all information from the input sparse views.
no code implementations • 19 Aug 2024 • Minghua Liu, Chong Zeng, Xinyue Wei, Ruoxi Shi, Linghao Chen, Chao Xu, Mengqi Zhang, Zhaoning Wang, Xiaoshuai Zhang, Isabella Liu, Hongzhi Wu, Hao Su
The input normal maps can be predicted by 2D diffusion models, significantly aiding in the guidance and refinement of the geometry's learning.
1 code implementation • 28 Jun 2024 • Xinghao Wu, Xuefeng Liu, Jianwei Niu, Haolin Wang, Shaojie Tang, Guogang Zhu, Hao Su
Existing PFL methods primarily adopt a parameter partitioning approach, where the parameters of a model are designated as one of two types: parameters shared with other clients to extract general knowledge and parameters retained locally to learn client-specific knowledge.
1 code implementation • 25 Jun 2024 • Yuchen Zhou, Jiayuan Gu, Tung Yen Chiang, Fanbo Xiang, Hao Su
The development of 2D foundation models for image segmentation has been significantly advanced by the Segment Anything Model (SAM).
1 code implementation • 30 May 2024 • Guogang Zhu, Xuefeng Liu, Xinghao Wu, Shaojie Tang, Chao Tang, Jianwei Niu, Hao Su
Federated Semi-Supervised Learning (FSSL) leverages both labeled and unlabeled data on clients to collaboratively train a model. In FSSL, the heterogeneous data can introduce prediction bias into the model, causing the model's prediction to skew towards some certain classes.
no code implementations • 28 May 2024 • Nicklas Hansen, Jyothir S V, Vlad Sobal, Yann Lecun, Xiaolong Wang, Hao Su
Whole-body control for humanoids is challenging due to the high-dimensional nature of the problem, coupled with the inherent instability of a bipedal morphology.
no code implementations • 23 May 2024 • Yutao Feng, Yintong Shang, Xiang Feng, Lei Lan, Shandian Zhe, Tianjia Shao, Hongzhi Wu, Kun Zhou, Hao Su, Chenfanfu Jiang, Yin Yang
We present ElastoGen, a knowledge-driven AI model that generates physically accurate 4D elastodynamics.
1 code implementation • 9 May 2024 • Xuanlin Li, Kyle Hsu, Jiayuan Gu, Karl Pertsch, Oier Mees, Homer Rich Walke, Chuyuan Fu, Ishikaa Lunawat, Isabel Sieh, Sean Kirmani, Sergey Levine, Jiajun Wu, Chelsea Finn, Hao Su, Quan Vuong, Ted Xiao
We then employ these approaches to create SIMPLER, a collection of simulated environments for manipulation policy evaluation on common real robot setups.
1 code implementation • 6 May 2024 • Stone Tao, Arth Shukla, Tse-kai Chan, Hao Su
A forward curriculum is then used to accelerate the training of the initial policy to perform well on the full initial state distribution of the task and improve demonstration and sample efficiency.
no code implementations • 6 May 2024 • Kaiwen Jiang, Yang Fu, Mukund Varma T, Yash Belhe, Xiaolong Wang, Hao Su, Ravi Ramamoorthi
We also introduce a novel notion of an expected surface in Gaussian splatting, which is critical to our optimization.
1 code implementation • 26 Apr 2024 • Pengwei Xie, Rui Chen, Siang Chen, Yuzhe Qin, Fanbo Xiang, Tianyu Sun, Jing Xu, Guijin Wang, Hao Su
Manipulating unseen articulated objects through visual feedback is a critical but challenging task for real robots.
no code implementations • 25 Apr 2024 • Tongzhou Mu, Minghua Liu, Hao Su
The success of many RL techniques heavily relies on human-engineered dense rewards, which typically demand substantial domain expertise and extensive trial and error.
no code implementations • 18 Apr 2024 • Isabella Liu, Hao Su, Xiaolong Wang
We introduce the Gaussian-Mesh Anchoring, which encourages evenly distributed Gaussians, resulting better mesh reconstruction through mesh-guided densification and pruning on the deformed Gaussians.
no code implementations • 18 Apr 2024 • Xinyue Wei, Kai Zhang, Sai Bi, Hao Tan, Fujun Luan, Valentin Deschaintre, Kalyan Sunkavalli, Hao Su, Zexiang Xu
This allows for end-to-end mesh reconstruction by fine-tuning a pre-trained NeRF LRM with mesh rendering.
no code implementations • 11 Apr 2024 • Tongzhou Mu, Yijie Guo, Jie Xu, Ankit Goyal, Hao Su, Dieter Fox, Animesh Garg
Encouraged by the remarkable achievements of language and vision foundation models, developing generalist robotic agents through imitation learning, using large demonstration datasets, has become a prominent area of interest in robot learning.
no code implementations • CVPR 2024 • Mukund Varma T, Peihao Wang, Zhiwen Fan, Zhangyang Wang, Hao Su, Ravi Ramamoorthi
In recent years, there has been an explosion of 2D vision models for numerous tasks such as semantic segmentation, style transfer or scene editing, enabled by large-scale 2D image datasets.
1 code implementation • 18 Mar 2024 • Hansheng Chen, Ruoxi Shi, Yulin Liu, Bokui Shen, Jiayuan Gu, Gordon Wetzstein, Hao Su, Leonidas Guibas
Open-domain 3D object synthesis has been lagging behind image synthesis due to limited data and higher computational complexity.
no code implementations • CVPR 2024 • Jun Wang, Yuzhe Qin, Kaiming Kuang, Yigit Korkmaz, Akhilan Gurumoorthy, Hao Su, Xiaolong Wang
We introduce CyberDemo, a novel approach to robotic imitation learning that leverages simulated human demonstrations for real-world tasks.
no code implementations • 2 Jan 2024 • Piao Hu, Jiashuo Jiang, Guodong Lyu, Hao Su
When the model parameters are drawn from unknown non-stationary distributions and we are given machine-learned predictions of the distributions, we develop a new algorithm from our framework with a regret $O(W_T+\sqrt{T})$, where $W_T$ measures the total inaccuracy of the machine-learned predictions.
no code implementations • CVPR 2024 • Ruoxi Shi, Xinyue Wei, Cheng Wang, Hao Su
We present ZeroRF a novel per-scene optimization method addressing the challenge of sparse view 360deg reconstruction in neural field representations.
no code implementations • 14 Dec 2023 • Ruoxi Shi, Xinyue Wei, Cheng Wang, Hao Su
We present ZeroRF, a novel per-scene optimization method addressing the challenge of sparse view 360{\deg} reconstruction in neural field representations.
no code implementations • NeurIPS 2023 • Zhiao Huang, Feng Chen, Yewen Pu, Chunru Lin, Hao Su, Chuang Gan
Combining gradient-based trajectory optimization with differentiable physics simulation is an efficient technique for solving soft-body manipulation problems.
no code implementations • 9 Dec 2023 • Litian Liang, Liuyu Bian, Caiwei Xiao, Jialin Zhang, Linghao Chen, Isabella Liu, Fanbo Xiang, Zhiao Huang, Hao Su
Building robots that can automate labor-intensive tasks has long been the core motivation behind the advancements in computer vision and the robotics community.
1 code implementation • 5 Dec 2023 • Yuchen Zhou, Jiayuan Gu, Xuanlin Li, Minghua Liu, Yunhao Fang, Hao Su
Open-world 3D part segmentation is pivotal in diverse applications such as robotics and AR/VR.
no code implementations • CVPR 2024 • Minghua Liu, Ruoxi Shi, Linghao Chen, Zhuoyang Zhang, Chao Xu, Xinyue Wei, Hansheng Chen, Chong Zeng, Jiayuan Gu, Hao Su
Recent advancements in open-world 3D object generation have been remarkable, with image-to-3D methods offering superior fine-grained control over their text-to-3D counterparts.
no code implementations • 3 Nov 2023 • Jiayuan Gu, Sean Kirmani, Paul Wohlhart, Yao Lu, Montserrat Gonzalez Arenas, Kanishka Rao, Wenhao Yu, Chuyuan Fu, Keerthana Gopalakrishnan, Zhuo Xu, Priya Sundaresan, Peng Xu, Hao Su, Karol Hausman, Chelsea Finn, Quan Vuong, Ted Xiao
Generalization remains one of the most important desiderata for robust robot learning systems.
1 code implementation • 1 Nov 2023 • Zhan Ling, Yunhao Fang, Xuanlin Li, Tongzhou Mu, Mingu Lee, Reza Pourreza, Roland Memisevic, Hao Su
Large Language Models (LLMs) have achieved tremendous progress, yet they still often struggle with challenging reasoning problems.
2 code implementations • 25 Oct 2023 • Nicklas Hansen, Hao Su, Xiaolong Wang
TD-MPC is a model-based reinforcement learning (RL) algorithm that performs local trajectory optimization in the latent space of a learned implicit (decoder-free) world model.
1 code implementation • 23 Oct 2023 • Ruoxi Shi, Hansheng Chen, Zhuoyang Zhang, Minghua Liu, Chao Xu, Xinyue Wei, Linghao Chen, Chong Zeng, Hao Su
We report Zero123++, an image-conditioned diffusion model for generating 3D-consistent multi-view images from a single input view.
no code implementations • 28 Sep 2023 • Songlin Wei, Jiazhao Zhang, Yang Wang, Fanbo Xiang, Hao Su, He Wang
Existing works rely on the independence assumption of points in the radiance field or the pixels in input views to obtain tractable forms of the probability density function.
no code implementations • NeurIPS 2023 • Isabella Liu, Linghao Chen, Ziyang Fu, Liwen Wu, Haian Jin, Zhong Li, Chin Ming Ryan Wong, Yi Xu, Ravi Ramamoorthi, Zexiang Xu, Hao Su
We introduce OpenIllumination, a real-world dataset containing over 108K images of 64 objects with diverse materials, captured under 72 camera views and a large number of different illuminations.
1 code implementation • ICCV 2023 • Quankai Gao, Qiangeng Xu, Hao Su, Ulrich Neumann, Zexiang Xu
In contrast to TensoRF which uses a global tensor and focuses on their vector-matrix decomposition, we propose to utilize a cloud of local tensors and apply the classic CANDECOMP/PARAFAC (CP) decomposition to factorize each tensor into triple vectors that express local feature distributions along spatial axes and compactly encode a local neural field.
no code implementations • 20 Jul 2023 • Zhiao Huang, Litian Liang, Zhan Ling, Xuanlin Li, Chuang Gan, Hao Su
We then present a practical model-based RL method, called Reparameterized Policy Gradient (RPG), which leverages the multimodal policy parameterization and learned world model to achieve strong exploration capabilities and high data efficiency.
no code implementations • 19 Jul 2023 • Hao Su, Xuefeng Liu, Jianwei Niu, Ji Wan, Xinghao Wu
Unlike these studies, our 3Deformer is a non-training and common framework, which only requires supervision of readily-available semantic images, and is compatible with editing various objects unlimited by datasets.
no code implementations • 10 Jul 2023 • Yuzhe Qin, Wei Yang, Binghao Huang, Karl Van Wyk, Hao Su, Xiaolong Wang, Yu-Wei Chao, Dieter Fox
For real-world experiments, AnyTeleop can outperform a previous system that was designed for a specific robot hardware with a higher success rate, using the same robot.
1 code implementation • ICCV 2023 • Xuanlin Li, Yunhao Fang, Minghua Liu, Zhan Ling, Zhuowen Tu, Hao Su
Model distillation, the process of creating smaller, faster models that maintain the performance of larger models, is a promising direction towards the solution.
1 code implementation • NeurIPS 2023 • Minghua Liu, Chao Xu, Haian Jin, Linghao Chen, Mukund Varma T, Zexiang Xu, Hao Su
Single image 3D reconstruction is an important but challenging task that requires extensive knowledge of our natural world.
1 code implementation • 11 Jun 2023 • Zhan Ling, Yunchao Yao, Xuanlin Li, Hao Su
Recent studies on visual reinforcement learning (visual RL) have explored the use of 3D visual representations.
1 code implementation • NeurIPS 2023 • Zhan Ling, Yunhao Fang, Xuanlin Li, Zhiao Huang, Mingu Lee, Roland Memisevic, Hao Su
In light of this, we propose to decompose a reasoning verification process into a series of step-by-step subprocesses, each only receiving their necessary context and premises.
no code implementations • 28 May 2023 • Zhiwei Jia, Pradyumna Narayana, Arjun R. Akula, Garima Pruthi, Hao Su, Sugato Basu, Varun Jampani
Image ad understanding is a crucial task with wide real-world applications.
no code implementations • 26 May 2023 • Xinyue Wei, Fanbo Xiang, Sai Bi, Anpei Chen, Kalyan Sunkavalli, Zexiang Xu, Hao Su
We present a method for generating high-quality watertight manifold meshes from multi-view input images.
1 code implementation • NeurIPS 2023 • Minghua Liu, Ruoxi Shi, Kaiming Kuang, Yinhao Zhu, Xuanlin Li, Shizhong Han, Hong Cai, Fatih Porikli, Hao Su
Due to their alignment with CLIP embeddings, our learned shape representations can also be integrated with off-the-shelf CLIP-based models for various applications, such as point cloud captioning and point cloud-conditioned image generation.
Ranked #5 on Zero-shot 3D Point Cloud Classification on OmniObject3D (Pretrained on ShapeNet) (using extra training data)
no code implementations • 2 May 2023 • Linghao Chen, Yuzhe Qin, Xiaowei Zhou, Hao Su
Hand-eye calibration is a critical task in robotics, as it directly affects the efficacy of critical operations such as manipulation and grasping.
1 code implementation • CVPR 2023 • Haian Jin, Isabella Liu, Peijia Xu, Xiaoshuai Zhang, Songfang Han, Sai Bi, Xiaowei Zhou, Zexiang Xu, Hao Su
We propose TensoIR, a novel inverse rendering approach based on tensor factorization and neural fields.
1 code implementation • ICCV 2023 • Hansheng Chen, Jiatao Gu, Anpei Chen, Wei Tian, Zhuowen Tu, Lingjie Liu, Hao Su
3D-aware image synthesis encompasses a variety of tasks, such as scene generation and novel view synthesis from images.
1 code implementation • 3 Apr 2023 • Zhiwei Jia, Vineet Thumuluri, Fangchen Liu, Linghao Chen, Zhiao Huang, Hao Su
By grouping temporarily close and functionally similar actions into subskill-level demo segments, the observations at the segment boundaries constitute a chain of planning steps for the task, which we refer to as the chain-of-thought (CoT).
no code implementations • 27 Mar 2023 • Sizhe Li, Zhiao Huang, Tao Chen, Tao Du, Hao Su, Joshua B. Tenenbaum, Chuang Gan
Reinforcement learning approaches for dexterous rigid object manipulation would struggle in this setting due to the complexity of physics interaction with deformable objects.
no code implementations • 23 Mar 2023 • Tongzhou Mu, Hao Su
Although reinforcement learning has seen tremendous success recently, this kind of trial-and-error learning can be impractical or inefficient in complex environments.
no code implementations • 10 Mar 2023 • Kaizhi Yang, Xiaoshuai Zhang, Zhiao Huang, Xuejin Chen, Zexiang Xu, Hao Su
Under the Lagrangian view, we parameterize the scene motion by tracking the trajectory of particles on objects.
no code implementations • CVPR 2023 • Xiaoshuai Zhang, Abhijit Kundu, Thomas Funkhouser, Leonidas Guibas, Hao Su, Kyle Genova
We address efficient and structure-aware 3D scene representation from images.
1 code implementation • 9 Feb 2023 • Jiayuan Gu, Fanbo Xiang, Xuanlin Li, Zhan Ling, Xiqiang Liu, Tongzhou Mu, Yihe Tang, Stone Tao, Xinyue Wei, Yunchao Yao, Xiaodi Yuan, Pengwei Xie, Zhiao Huang, Rui Chen, Hao Su
Generalizable manipulation skills, which can be composed to tackle long-horizon and complex daily chores, are one of the cornerstones of Embodied AI.
1 code implementation • 2 Feb 2023 • Anpei Chen, Zexiang Xu, Xinyue Wei, Siyu Tang, Hao Su, Andreas Geiger
Our experiments show that DiF leads to improvements in approximation quality, compactness, and training time when compared to previous fast reconstruction methods.
1 code implementation • 12 Dec 2022 • Nicklas Hansen, Zhecheng Yuan, Yanjie Ze, Tongzhou Mu, Aravind Rajeswaran, Hao Su, Huazhe Xu, Xiaolong Wang
In this paper, we examine the effectiveness of pre-training for visuo-motor control tasks.
1 code implementation • 12 Dec 2022 • Nicklas Hansen, Yixin Lin, Hao Su, Xiaolong Wang, Vikash Kumar, Aravind Rajeswaran
We identify key ingredients for leveraging demonstrations in model learning -- policy pretraining, targeted exploration, and oversampling of demonstration data -- which forms the three phases of our model-based RL framework.
Deep Reinforcement Learning Model-based Reinforcement Learning +2
2 code implementations • CVPR 2023 • Minghua Liu, Yinhao Zhu, Hong Cai, Shizhong Han, Zhan Ling, Fatih Porikli, Hao Su
Generalizable 3D part segmentation is important but challenging in vision and robotics.
no code implementations • 17 Nov 2022 • Yuzhe Qin, Binghao Huang, Zhao-Heng Yin, Hao Su, Xiaolong Wang
We empirically evaluate our method using an Allegro Hand to grasp novel objects in both simulation and real world.
1 code implementation • 19 Oct 2022 • Yifan Xu, Nicklas Hansen, ZiRui Wang, Yung-Chieh Chan, Hao Su, Zhuowen Tu
Reinforcement Learning (RL) algorithms can solve challenging control problems directly from image observations, but they often require millions of environment interactions to do so.
1 code implementation • 14 Oct 2022 • Stone Tao, Xiaochen Li, Tongzhou Mu, Zhiao Huang, Yuzhe Qin, Hao Su
In the abstract environment, complex dynamics such as physical manipulation are removed, making abstract trajectories easier to generate.
1 code implementation • 14 Oct 2022 • Minghua Liu, Xuanlin Li, Zhan Ling, Yangyan Li, Hao Su
We study how choices of input point cloud coordinate frames impact learning of manipulation skills from 3D point clouds.
no code implementations • 14 Oct 2022 • Minghua Liu, Yin Zhou, Charles R. Qi, Boqing Gong, Hao Su, Dragomir Anguelov
Our method co-designs an efficient labeling process with semi/weakly supervised learning and is applicable to nearly any 3D semantic segmentation backbones.
1 code implementation • 6 Sep 2022 • Jiayuan Gu, Devendra Singh Chaplot, Hao Su, Jitendra Malik
To tackle the entire task, prior work chains multiple stationary manipulation skills with a point-goal navigation skill, which are learned individually on subtasks.
1 code implementation • 26 Jun 2022 • Zhiwei Jia, Xuanlin Li, Zhan Ling, Shuang Liu, Yiran Wu, Hao Su
Generalization in deep reinforcement learning over unseen environment variations usually requires policy learning over a large set of diverse training variations.
1 code implementation • 5 May 2022 • Xinyue Wei, Minghua Liu, Zhan Ling, Hao Su
Approximate convex decomposition aims to decompose a 3D shape into a set of almost convex components, whose convex hulls can then be used to represent the input shape.
no code implementations • ICLR 2022 • Sizhe Li, Zhiao Huang, Tao Du, Hao Su, Joshua B. Tenenbaum, Chuang Gan
Extensive experimental results suggest that: 1) on multi-stage tasks that are infeasible for the vanilla differentiable physics solver, our approach discovers contact points that efficiently guide the solver to completion; 2) on tasks where the vanilla solver performs sub-optimally or near-optimally, our contact point discovery method performs better than or on par with the manipulation performance obtained with handcrafted contact points.
no code implementations • 26 Apr 2022 • Yuzhe Qin, Hao Su, Xiaolong Wang
We propose to perform imitation learning for dexterous manipulation with multi-finger robot hand from human demonstrations, and transfer the policy to the real robot hand.
no code implementations • 21 Apr 2022 • Shuang Liu, Hao Su
We propose and analyze a kernelized version of Q-learning.
1 code implementation • CVPR 2022 • Xiaoshuai Zhang, Sai Bi, Kalyan Sunkavalli, Hao Su, Zexiang Xu
We demonstrate that NeRFusion achieves state-of-the-art quality on both large-scale indoor and small-scale object scenes, with substantially faster reconstruction than NeRF and other recent methods.
2 code implementations • 17 Mar 2022 • Anpei Chen, Zexiang Xu, Andreas Geiger, Jingyi Yu, Hao Su
We demonstrate that applying traditional CP decomposition -- that factorizes tensors into rank-one components with compact vectors -- in our framework leads to improvements over vanilla NeRF.
Ranked #3 on Novel View Synthesis on X3D
2 code implementations • 9 Mar 2022 • Nicklas Hansen, Xiaolong Wang, Hao Su
Data-driven model predictive control has two key advantages over model-free methods: a potential for improved sample efficiency through model learning, and better performance as computational budget for planning increases.
no code implementations • CVPR 2022 • Isabella Liu, Edward Yang, Jianyu Tao, Rui Chen, Xiaoshuai Zhang, Qing Ran, Zhu Liu, Hao Su
First, we demonstrate the transferability of our method to out-of-distribution real data by using a mixed domain learning strategy.
1 code implementation • 2 Dec 2021 • Xingchao Liu, Chengyue Gong, Lemeng Wu, Shujian Zhang, Hao Su, Qiang Liu
We approach text-to-image generation by combining the power of the retrained CLIP representation with an off-the-shelf image generator (GANs), optimizing in the latent space of GAN to find images that achieve maximum CLIP score with the given input text.
Ranked #46 on Text-to-Image Generation on MS COCO
1 code implementation • 10 Oct 2021 • Hao Su, Jianwei Niu, Xuefeng Liu, Jiahe Cui, Ji Wan
Manga is a fashionable Japanese-style comic form that is composed of black-and-white strokes and is generally displayed as raster images on digital devices.
3 code implementations • 30 Jul 2021 • Tongzhou Mu, Zhan Ling, Fanbo Xiang, Derek Yang, Xuanlin Li, Stone Tao, Zhiao Huang, Zhiwei Jia, Hao Su
Here we propose SAPIEN Manipulation Skill Benchmark (ManiSkill) to benchmark manipulation skills over diverse objects in a full-physics simulator.
3 code implementations • NeurIPS 2021 • Nicklas Hansen, Hao Su, Xiaolong Wang
Our method greatly improves stability and sample efficiency of ConvNets under augmentation, and achieves generalization results competitive with state-of-the-art methods for image-based RL in environments with unseen visuals.
1 code implementation • 29 Jun 2021 • Kaichun Mo, Yuzhe Qin, Fanbo Xiang, Hao Su, Leonidas Guibas
Contrary to the vast literature in modeling, perceiving, and understanding agent-object (e. g., human-object, hand-object, robot-object) interaction in computer vision and robotics, very few past works have studied the task of object-object interaction, which also plays an important role in robotic manipulation and planning tasks.
no code implementations • 28 Jun 2021 • Quan Vuong, Yuzhe Qin, Runlin Guo, Xiaolong Wang, Hao Su, Henrik Christensen
We propose a teleoperation system that uses a single RGB-D camera as the human motion capture device.
2 code implementations • NeurIPS 2021 • Raghav Kansal, Javier Duarte, Hao Su, Breno Orzari, Thiago Tomei, Maurizio Pierini, Mary Touranakou, Jean-Roch Vlimant, Dimitrios Gunopulos
We propose JetNet as a novel point-cloud-style dataset for the ML community to experiment with, and set MPGAN as a benchmark to improve upon for future generative models.
1 code implementation • CVPR 2021 • Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Ji Wan, Mingliang Xu, Tao Ren
Quick Response (QR) code is one of the most worldwide used two-dimensional codes.
1 code implementation • 28 May 2021 • Hao Su
One-shot learning is proposed to make a pretrained classifier workable on a new dataset based on one labeled samples from each pattern.
1 code implementation • ICCV 2021 • Yijia Weng, He Wang, Qiang Zhou, Yuzhe Qin, Yueqi Duan, Qingnan Fan, Baoquan Chen, Hao Su, Leonidas J. Guibas
For the first time, we propose a unified framework that can handle 9DoF pose tracking for novel rigid object instances as well as per-part pose tracking for articulated objects from known categories.
1 code implementation • ICLR 2021 • Zhiao Huang, Yuanming Hu, Tao Du, Siyuan Zhou, Hao Su, Joshua B. Tenenbaum, Chuang Gan
Experimental results suggest that 1) RL-based approaches struggle to solve most of the tasks efficiently; 2) gradient-based approaches, by optimizing open-loop control sequences with the built-in differentiable physics engine, can rapidly find a solution within tens of iterations, but still fall short on multi-stage tasks that require long-term planning.
2 code implementations • ICCV 2021 • Anpei Chen, Zexiang Xu, Fuqiang Zhao, Xiaoshuai Zhang, Fanbo Xiang, Jingyi Yu, Hao Su
We present MVSNeRF, a novel neural rendering approach that can efficiently reconstruct neural radiance fields for view synthesis.
1 code implementation • ICCV 2021 • Quan Meng, Anpei Chen, Haimin Luo, Minye Wu, Hao Su, Lan Xu, Xuming He, Jingyi Yu
We introduce GNeRF, a framework to marry Generative Adversarial Networks (GAN) with Neural Radiance Field (NeRF) reconstruction for the complex scenarios with unknown and even randomly initialized camera poses.
1 code implementation • CVPR 2021 • Fanbo Xiang, Zexiang Xu, Miloš Hašan, Yannick Hold-Geoffroy, Kalyan Sunkavalli, Hao Su
We achieve this by introducing a 3D-to-2D texture mapping (or surface parameterization) network into volumetric representations.
1 code implementation • CVPR 2021 • Minghua Liu, Minhyuk Sung, Radomir Mech, Hao Su
Given a collection of 3D meshes of a category and their deformation handles (control points), our method learns a set of meta-handles for each shape, which are represented as combinations of the given handles.
no code implementations • 14 Dec 2020 • Antonio Di Lallo, Robin R. Murphy, Axel Krieger, Junxi Zhu, Russell H. Taylor, Hao Su
Medical robots can play an important role in mitigating the spread of infectious diseases and delivering quality care to patients during the COVID-19 pandemic.
Robotics
1 code implementation • ICCV 2021 • Zhiwei Jia, Bodi Yuan, Kangkang Wang, Hong Wu, David Clifford, Zhiqiang Yuan, Hao Su
Many applications of unpaired image-to-image translation require the input contents to be preserved semantically during translations.
1 code implementation • 4 Dec 2020 • Songfang Han, Jiayuan Gu, Kaichun Mo, Li Yi, Siyu Hu, Xuejin Chen, Hao Su
However, there remains a much more difficult and under-explored issue on how to generalize the learned skills over unseen object categories that have very different shape geometry distributions.
no code implementations • NeurIPS 2020 • Hao Tang, Zhiao Huang, Jiayuan Gu, Bao-liang Lu, Hao Su
Current graph neural networks (GNNs) lack generalizability with respect to scales (graph sizes, graph diameters, edge weights, etc..) when solving many graph analysis problems.
no code implementations • 16 Nov 2020 • Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Ji Wan, Mingliang Xu, Tao Ren
Quick Response (QR) code is one of the most worldwide used two-dimensional codes.~Traditional QR codes appear as random collections of black-and-white modules that lack visual semantics and aesthetic elements, which inspires the recent works to beautify the appearances of QR codes.
no code implementations • 3 Nov 2020 • Dhruv Batra, Angel X. Chang, Sonia Chernova, Andrew J. Davison, Jia Deng, Vladlen Koltun, Sergey Levine, Jitendra Malik, Igor Mordatch, Roozbeh Mottaghi, Manolis Savva, Hao Su
In the rearrangement task, the goal is to bring a given physical environment into a specified state.
no code implementations • NeurIPS 2020 • Tongzhou Mu, Jiayuan Gu, Zhiwei Jia, Hao Tang, Hao Su
We study how to learn a policy with compositional generalizability.
1 code implementation • 26 Oct 2020 • Hao Tang, Zhiao Huang, Jiayuan Gu, Bao-liang Lu, Hao Su
Current graph neural networks (GNNs) lack generalizability with respect to scales (graph sizes, graph diameters, edge weights, etc..) when solving many graph analysis problems.
1 code implementation • ICLR 2021 • Haotong Qin, Zhongang Cai, Mingyuan Zhang, Yifu Ding, Haiyu Zhao, Shuai Yi, Xianglong Liu, Hao Su
To alleviate the resource constraint for real-time point cloud applications that run on edge devices, in this paper we present BiPointNet, the first model binarization approach for efficient deep learning on point clouds.
no code implementations • 5 Oct 2020 • Shilin Zhu, Zexiang Xu, Tiancheng Sun, Alexandr Kuznetsov, Mark Meyer, Henrik Wann Jensen, Hao Su, Ravi Ramamoorthi
To fully make use of our deep neural network, we partition the scene space into an adaptive hierarchical grid, in which we apply our network to reconstruct high-quality sampling distributions for any local region in the scene.
no code implementations • ECCV 2020 • Jiayuan Gu, Wei-Chiu Ma, Sivabalan Manivasagam, Wenyuan Zeng, ZiHao Wang, Yuwen Xiong, Hao Su, Raquel Urtasun
3D shape completion for real data is important but challenging, since partial point clouds acquired by real-world sensors are usually sparse, noisy and unaligned.
1 code implementation • 29 Jul 2020 • You-Yi Jau, Rui Zhu, Hao Su, Manmohan Chandraker
Estimating relative camera poses from consecutive frames is a fundamental problem in visual odometry (VO) and simultaneous localization and mapping (SLAM), where classic methods consisting of hand-crafted features and sampling-based outlier rejection have been a dominant choice for over a decade.
no code implementations • 21 Jul 2020 • Tianwen Zhang, Xiaoling Zhang, Jun Shi, Shunjun Wei, Jianguo Wang, Jianwei Li, Hao Su, Yue Zhou
Huge imbalance of different scenes' sample numbers seriously reduces Synthetic Aperture Radar (SAR) ship detection accuracy.
1 code implementation • ECCV 2020 • Minghua Liu, Xiaoshuai Zhang, Hao Su
We are interested in reconstructing the mesh representation of object surfaces from point clouds.
1 code implementation • 7 Jul 2020 • Anpei Chen, Ruiyang Liu, Ling Xie, Zhang Chen, Hao Su, Jingyi Yu
To address this issue, we propose a SofGAN image generator to decouple the latent space of portraits into two subspaces: a geometry space and a texture space.
no code implementations • 12 Jun 2020 • He Wang, Zetian Jiang, Li Yi, Kaichun Mo, Hao Su, Leonidas J. Guibas
We further study how different evaluation metrics weigh the sampling pattern against the geometry and propose several perceptual metrics forming a sampling spectrum of metrics.
no code implementations • 25 Apr 2020 • Shilin Zhu, Zexiang Xu, Henrik Wann Jensen, Hao Su, Ravi Ramamoorthi
This network is easy to incorporate in many previous photon mapping methods (by simply swapping the kernel density estimator) and can produce high-quality reconstructions of complex global illumination effects like caustics with an order of magnitude fewer photons compared to previous photon mapping methods.
no code implementations • 22 Apr 2020 • Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Jiahe Cui, Ji Wan
Manga is a world popular comic form originated in Japan, which typically employs black-and-white stroke lines and geometric exaggeration to describe humans' appearances, poses, and actions.
1 code implementation • CVPR 2020 • Fanbo Xiang, Yuzhe Qin, Kaichun Mo, Yikuan Xia, Hao Zhu, Fangchen Liu, Minghua Liu, Hanxiao Jiang, Yifu Yuan, He Wang, Li Yi, Angel X. Chang, Leonidas J. Guibas, Hao Su
To achieve this task, a simulated environment with physically realistic simulation, sufficient articulated objects, and transferability to the real robot is indispensable.
1 code implementation • ICLR 2020 • Tiange Luo, Kaichun Mo, Zhiao Huang, Jiarui Xu, Siyu Hu, Li-Wei Wang, Hao Su
We address the problem of discovering 3D parts for objects in unseen categories.
no code implementations • 12 Feb 2020 • Shuang Liu, Hao Su
We give motivations and derive lower and upper bounds for such measures.
1 code implementation • CVPR 2020 • Shuo Cheng, Zexiang Xu, Shilin Zhu, Zhuwen Li, Li Erran Li, Ravi Ramamoorthi, Hao Su
In contrast, we propose adaptive thin volumes (ATVs); in an ATV, the depth hypothesis of each plane is spatially varying, which adapts to the uncertainties of previous per-pixel depth predictions.
Ranked #14 on 3D Reconstruction on DTU
1 code implementation • CVPR 2020 • Kaichun Mo, Paul Guerrero, Li Yi, Hao Su, Peter Wonka, Niloy Mitra, Leonidas J. Guibas
Learning to encode differences in the geometry and (topological) structure of the shapes of ordinary objects is key to generating semantically plausible variations of a given shape, transferring edits from one shape to another, and many other applications in 3D content creation.
1 code implementation • CVPR 2020 • Uday Kusupati, Shuo Cheng, Rui Chen, Hao Su
We couple the learning of a multi-view normal estimation module and a multi-view depth estimation module.
no code implementations • ICLR 2020 • Fangchen Liu, Zhan Ling, Tongzhou Mu, Hao Su
Consider an imitation learning problem that the imitator and the expert have different dynamics models.
1 code implementation • ICML 2020 • Zhiwei Jia, Hao Su
Recent advances in deep learning theory have evoked the study of generalizability across different local minima of deep neural networks (DNNs).
1 code implementation • 31 Oct 2019 • Yuzhe Qin, Rui Chen, Hao Zhu, Meng Song, Jing Xu, Hao Su
Grasping is among the most fundamental and long-lasting problems in robotics study.
no code implementations • 30 Sep 2019 • Maximilian Jaritz, Jiayuan Gu, Hao Su
Fusion of 2D images and 3D point clouds is important because information from dense images can enhance sparse point clouds.
no code implementations • NeurIPS 2020 • Jiachen Li, Quan Vuong, Shuang Liu, Minghua Liu, Kamil Ciosek, Keith Ross, Henrik Iskov Christensen, Hao Su
To perform well, the policy must infer the task identity from collected transitions by modelling its dependency on states, actions and rewards.
no code implementations • 25 Sep 2019 • Yueh-Hua Wu, Ting-Han Fan, Peter J. Ramadge, Hao Su
Based on the claim, we propose to learn the transition model by matching the distributions of multi-step rollouts sampled from the transition model and the real ones via WGAN.
Model-based Reinforcement Learning reinforcement-learning +2
no code implementations • 25 Sep 2019 • Quan Vuong, Shuang Liu, Minghua Liu, Kamil Ciosek, Hao Su, Henrik Iskov Christensen
Combining ideas from Batch RL and Meta RL, we propose tiMe, which learns distillation of multiple value functions and MDP embeddings from only existing data.
2 code implementations • 16 Aug 2019 • Daniel Liu, Ronald Yu, Hao Su
The importance of training robust neural network grows as 3D data is increasingly utilized in deep learning for vision tasks in robotics, drone control, and autonomous driving.
1 code implementation • NeurIPS 2019 • Zhiao Huang, Fangchen Liu, Hao Su
An agent that has well understood the environment should be able to apply its skills for any given goals, leading to the fundamental problem of learning the Universal Value Function Approximator (UVFA).
1 code implementation • ICCV 2019 • Rui Chen, Songfang Han, Jing Xu, Hao Su
More specifically, our method predicts the depth in a coarse-to-fine manner.
2 code implementations • 1 Aug 2019 • Kaichun Mo, Paul Guerrero, Li Yi, Hao Su, Peter Wonka, Niloy Mitra, Leonidas J. Guibas
We introduce StructureNet, a hierarchical graph network which (i) can directly encode shapes represented as such n-ary graphs; (ii) can be robustly trained on large and complex shape families; and (iii) can be used to generate a great diversity of realistic structured shape geometries.
1 code implementation • 28 Mar 2019 • Quan Vuong, Sharad Vikram, Hao Su, Sicun Gao, Henrik I. Christensen
A human-specified design choice in domain randomization is the form and parameters of the distribution of simulated environments.
2 code implementations • 12 Jan 2019 • Jun Gao, Chengcheng Tang, Vignesh Ganapathi-Subramanian, Jiahui Huang, Hao Su, Leonidas J. Guibas
Reconstruction of geometry based on different input modes, such as images or point clouds, has been instrumental in the development of computer aided design and computer graphics.
1 code implementation • 10 Jan 2019 • Daniel Liu, Ronald Yu, Hao Su
We present a preliminary evaluation of adversarial attacks on deep 3D point cloud classifiers, namely PointNet and PointNet++, by evaluating both white-box and black-box adversarial attacks that were proposed for 2D images and extending those attacks to reduce the perceptibility of the perturbations in 3D space.
no code implementations • 11 Dec 2018 • Yinghao Xu, Xin Dong, Yudian Li, Hao Su
To reduce memory footprint and run-time latency, techniques such as neural network pruning and binarization have been explored separately.
5 code implementations • CVPR 2019 • Kaichun Mo, Shilin Zhu, Angel X. Chang, Li Yi, Subarna Tripathi, Leonidas J. Guibas, Hao Su
We present PartNet: a consistent, large-scale dataset of 3D objects annotated with fine-grained, instance-level, and hierarchical 3D part information.
Ranked #3 on 3D Instance Segmentation on PartNet
1 code implementation • CVPR 2019 • Bo Sun, Nian-hsuan Tsai, Fangchen Liu, Ronald Yu, Hao Su
We propose an adversarial defense method that achieves state-of-the-art performance among attack-agnostic adversarial defense methods while also maintaining robustness to input resolution, scale of adversarial perturbation, and scale of dataset size.
no code implementations • CVPR 2019 • Xibin Song, Peng Wang, Dingfu Zhou, Rui Zhu, Chenye Guan, Yuchao Dai, Hao Su, Hongdong Li, Ruigang Yang
Specifically, we first segment each car with a pre-trained Mask R-CNN, and then regress towards its 3D pose and shape based on a deformable 3D car model with or without using semantic keypoints.
no code implementations • 4 Oct 2018 • Cheng Fu, Shilin Zhu, Hao Su, Ching-En Lee, Jishen Zhao
Thus there does exist redundancy that can be exploited to further reduce the amount of on-chip computations.
no code implementations • 27 Sep 2018 • Xingchao Liu, Tongzhou Mu, Hao Su
In this paper, we investigate the problem of transfer learning across environments with different dynamics while accomplishing the same task in the continuous control domain.
1 code implementation • 19 Sep 2018 • Li Yi, Haibin Huang, Difan Liu, Evangelos Kalogerakis, Hao Su, Leonidas Guibas
In this paper, we explore how the observation of different articulation states provides evidence for part structure and motion of 3D objects.
1 code implementation • CVPR 2019 • Shilin Zhu, Xin Dong, Hao Su
Binary neural networks (BNN) have been studied extensively since they run dramatically faster at lower memory and power consumption than floating-point networks, thanks to the efficiency of bit operations.
no code implementations • CVPR 2018 • Chuang Gan, Boqing Gong, Kun Liu, Hao Su, Leonidas J. Guibas
In addition, we also find that a progressive training strategy can foster a better neural network for the video recognition task than blindly pooling the distinct sources of geometry cues together.
1 code implementation • NeurIPS 2018 • Minhyuk Sung, Hao Su, Ronald Yu, Leonidas Guibas
Even though our shapes have independent discretizations and no functional correspondences are provided, the network is able to generate latent bases, in a consistent order, that reflect the shared semantic structure among the shapes.
no code implementations • CVPR 2018 • Hao Zhu, Hao Su, Peng Wang, Xun Cao, Ruigang Yang
We study how to synthesize novel views of human body from a single image.
2 code implementations • 5 Feb 2018 • Jingwei Huang, Hao Su, Leonidas Guibas
In this paper, we describe a robust algorithm for 2-Manifold generation of various kinds of ShapeNet Models.
Computational Geometry
67 code implementations • CVPR 2018 • Charles R. Qi, Wei Liu, Chenxia Wu, Hao Su, Leonidas J. Guibas
In this work, we study 3D object detection from RGB-D data in both indoor and outdoor scenes.
Ranked #1 on Object Localization on KITTI Pedestrians Easy
1 code implementation • 17 Oct 2017 • Li Yi, Lin Shao, Manolis Savva, Haibin Huang, Yang Zhou, Qirui Wang, Benjamin Graham, Martin Engelcke, Roman Klokov, Victor Lempitsky, Yuan Gan, Pengyu Wang, Kun Liu, Fenggen Yu, Panpan Shui, Bingyang Hu, Yan Zhang, Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Minki Jeong, Jaehoon Choi, Changick Kim, Angom Geetchandra, Narasimha Murthy, Bhargava Ramu, Bharadwaj Manda, M. Ramanathan, Gautam Kumar, P Preetham, Siddharth Srivastava, Swati Bhugra, Brejesh lall, Christian Haene, Shubham Tulsiani, Jitendra Malik, Jared Lafer, Ramsey Jones, Siyuan Li, Jie Lu, Shi Jin, Jingyi Yu, Qi-Xing Huang, Evangelos Kalogerakis, Silvio Savarese, Pat Hanrahan, Thomas Funkhouser, Hao Su, Leonidas Guibas
We introduce a large-scale 3D shape understanding benchmark using data and annotation from ShapeNet 3D object database.
1 code implementation • 6 Aug 2017 • Minhyuk Sung, Hao Su, Vladimir G. Kim, Siddhartha Chaudhuri, Leonidas Guibas
The combinatorial nature of part arrangements poses another challenge, since the retrieval network is not a function: several complements can be appropriate for the same input.
Graphics I.3.5
66 code implementations • NeurIPS 2017 • Charles R. Qi, Li Yi, Hao Su, Leonidas J. Guibas
By exploiting metric space distances, our network is able to learn local features with increasing contextual scales.
Ranked #2 on Semantic Segmentation on Toronto-3D L002
1 code implementation • CVPR 2017 • Jian Shi, Yue Dong, Hao Su, Stella X. Yu
Rendered with realistic environment maps, millions of synthetic images of objects and their corresponding albedo, shading, and specular ground-truth images are used to train an encoder-decoder CNN.
no code implementations • CVPR 2018 • Cewu Lu, Hao Su, Yongyi Lu, Li Yi, Chi-Keung Tang, Leonidas Guibas
Important high-level vision tasks such as human-object interaction, image captioning and robotic manipulation require rich semantic descriptions of objects at part level.
4 code implementations • CVPR 2017 • Haoqiang Fan, Hao Su, Leonidas Guibas
Our final solution is a conditional shape sampler, capable of predicting multiple plausible 3D point clouds from an input image.
Ranked #2 on 3D Reconstruction on Data3D−R2N2 (using extra training data)
3D Object Reconstruction From A Single Image 3D Reconstruction
no code implementations • CVPR 2017 • Li Yi, Hao Su, Xingwen Guo, Leonidas Guibas
To enable the prediction of vertex functions on them by convolutional neural networks, we resort to spectral CNN method that enables weight sharing by parameterizing kernels in the spectral domain spanned by graph laplacian eigenbases.
Ranked #57 on 3D Part Segmentation on ShapeNet-Part
110 code implementations • CVPR 2017 • Charles R. Qi, Hao Su, Kaichun Mo, Leonidas J. Guibas
Point cloud is an important type of geometric data structure.
Ranked #1 on 3D Face Reconstruction on !(()&&!|*|*|
4 code implementations • CVPR 2017 • Shubham Tulsiani, Hao Su, Leonidas J. Guibas, Alexei A. Efros, Jitendra Malik
We present a learning framework for abstracting complex shapes by learning to assemble objects using 3D volumetric primitives.
no code implementations • CVPR 2016 • Xianglong Liu, Xinjie Fan, Cheng Deng, Zhujin Li, Hao Su, DaCheng Tao
Despite its successful progress in classic point-to-point search, there are few studies regarding point-to-hyperplane search, which has strong practical capabilities of scaling up in many applications like active learning with SVMs.
1 code implementation • NeurIPS 2016 • Yangyan Li, Soeren Pirk, Hao Su, Charles R. Qi, Leonidas J. Guibas
Each field probing filter is a set of probing points --- sensors that perceive the space.
Ranked #5 on 3D Object Recognition on ModelNet40