Search Results for author: Hao Su

Found 95 papers, 51 papers with code

Approximate Convex Decomposition for 3D Meshes with Collision-Aware Concavity and Tree Search

no code implementations5 May 2022 Xinyue Wei, Minghua Liu, Zhan Ling, Hao Su

Approximate convex decomposition aims to decompose a 3D shape into a set of almost convex components, whose convex hulls can then be used to represent the input shape.

Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics

no code implementations ICLR 2022 Sizhe Li, Zhiao Huang, Tao Du, Hao Su, Joshua B. Tenenbaum, Chuang Gan

Extensive experimental results suggest that: 1) on multi-stage tasks that are infeasible for the vanilla differentiable physics solver, our approach discovers contact points that efficiently guide the solver to completion; 2) on tasks where the vanilla solver performs sub-optimally or near-optimally, our contact point discovery method performs better than or on par with the manipulation performance obtained with handcrafted contact points.

From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation

no code implementations26 Apr 2022 Yuzhe Qin, Hao Su, Xiaolong Wang

We propose to perform imitation learning for dexterous manipulation with multi-finger robot hand from human demonstrations, and transfer the policy to the real robot hand.

Imitation Learning

Provably Efficient Kernelized Q-Learning

no code implementations21 Apr 2022 Shuang Liu, Hao Su

We propose and analyze a kernelized version of Q-learning.


NeRFusion: Fusing Radiance Fields for Large-Scale Scene Reconstruction

no code implementations21 Mar 2022 Xiaoshuai Zhang, Sai Bi, Kalyan Sunkavalli, Hao Su, Zexiang Xu

We demonstrate that NeRFusion achieves state-of-the-art quality on both large-scale indoor and small-scale object scenes, with substantially faster reconstruction than NeRF and other recent methods.

3D Reconstruction Frame

TensoRF: Tensorial Radiance Fields

1 code implementation17 Mar 2022 Anpei Chen, Zexiang Xu, Andreas Geiger, Jingyi Yu, Hao Su

We demonstrate that applying traditional CP decomposition -- that factorizes tensors into rank-one components with compact vectors -- in our framework leads to improvements over vanilla NeRF.

Temporal Difference Learning for Model Predictive Control

1 code implementation9 Mar 2022 Nicklas Hansen, Xiaolong Wang, Hao Su

Data-driven model predictive control has two key advantages over model-free methods: a potential for improved sample efficiency through model learning, and better performance as computational budget for planning increases.

Continuous Control

ActiveZero: Mixed Domain Learning for Active Stereovision with Zero Annotation

no code implementations6 Dec 2021 Isabella Liu, Edward Yang, Jianyu Tao, Rui Chen, Xiaoshuai Zhang, Qing Ran, Zhu Liu, Hao Su

First, we demonstrate the transferability of our method to out-of-distribution real data by using a mixed domain learning strategy.

FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization

1 code implementation2 Dec 2021 Xingchao Liu, Chengyue Gong, Lemeng Wu, Shujian Zhang, Hao Su, Qiang Liu

We approach text-to-image generation by combining the power of the retrained CLIP representation with an off-the-shelf image generator (GANs), optimizing in the latent space of GAN to find images that achieve maximum CLIP score with the given input text.

Text to image generation Zero-Shot Text-to-Image Generation

Vectorization of Raster Manga by Deep Reinforcement Learning

no code implementations10 Oct 2021 Hao Su, Jianwei Niu, Xuefeng Liu, Jiahe Cui, Ji Wan

Manga is a popular Japanese-style comic form that consists of black-and-white stroke lines.


ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations

3 code implementations30 Jul 2021 Tongzhou Mu, Zhan Ling, Fanbo Xiang, Derek Yang, Xuanlin Li, Stone Tao, Zhiao Huang, Zhiwei Jia, Hao Su

Here we propose SAPIEN Manipulation Skill Benchmark (ManiSkill) to benchmark manipulation skills over diverse objects in a full-physics simulator.

Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation

2 code implementations NeurIPS 2021 Nicklas Hansen, Hao Su, Xiaolong Wang

Our method greatly improves stability and sample efficiency of ConvNets under augmentation, and achieves generalization results competitive with state-of-the-art methods for image-based RL in environments with unseen visuals.

Data Augmentation Q-Learning

O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning

1 code implementation29 Jun 2021 Kaichun Mo, Yuzhe Qin, Fanbo Xiang, Hao Su, Leonidas Guibas

Contrary to the vast literature in modeling, perceiving, and understanding agent-object (e. g., human-object, hand-object, robot-object) interaction in computer vision and robotics, very few past works have studied the task of object-object interaction, which also plays an important role in robotic manipulation and planning tasks.

Single RGB-D Camera Teleoperation for General Robotic Manipulation

no code implementations28 Jun 2021 Quan Vuong, Yuzhe Qin, Runlin Guo, Xiaolong Wang, Hao Su, Henrik Christensen

We propose a teleoperation system that uses a single RGB-D camera as the human motion capture device.


Particle Cloud Generation with Message Passing Generative Adversarial Networks

2 code implementations NeurIPS 2021 Raghav Kansal, Javier Duarte, Hao Su, Breno Orzari, Thiago Tomei, Maurizio Pierini, Mary Touranakou, Jean-Roch Vlimant, Dimitrios Gunopulos

We propose JetNet as a novel point-cloud-style dataset for the ML community to experiment with, and set MPGAN as a benchmark to improve upon for future generative models.

One-shot Learning with Absolute Generalization

1 code implementation28 May 2021 Hao Su

One-shot learning is proposed to make a pretrained classifier workable on a new dataset based on one labeled samples from each pattern.

One-Shot Learning

CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

1 code implementation ICCV 2021 Yijia Weng, He Wang, Qiang Zhou, Yuzhe Qin, Yueqi Duan, Qingnan Fan, Baoquan Chen, Hao Su, Leonidas J. Guibas

For the first time, we propose a unified framework that can handle 9DoF pose tracking for novel rigid object instances as well as per-part pose tracking for articulated objects from known categories.

Frame Pose Tracking

PlasticineLab: A Soft-Body Manipulation Benchmark with Differentiable Physics

1 code implementation ICLR 2021 Zhiao Huang, Yuanming Hu, Tao Du, Siyuan Zhou, Hao Su, Joshua B. Tenenbaum, Chuang Gan

Experimental results suggest that 1) RL-based approaches struggle to solve most of the tasks efficiently; 2) gradient-based approaches, by optimizing open-loop control sequences with the built-in differentiable physics engine, can rapidly find a solution within tens of iterations, but still fall short on multi-stage tasks that require long-term planning.

GNeRF: GAN-based Neural Radiance Field without Posed Camera

1 code implementation ICCV 2021 Quan Meng, Anpei Chen, Haimin Luo, Minye Wu, Hao Su, Lan Xu, Xuming He, Jingyi Yu

We introduce GNeRF, a framework to marry Generative Adversarial Networks (GAN) with Neural Radiance Field (NeRF) reconstruction for the complex scenarios with unknown and even randomly initialized camera poses.

Novel View Synthesis

MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo

2 code implementations ICCV 2021 Anpei Chen, Zexiang Xu, Fuqiang Zhao, Xiaoshuai Zhang, Fanbo Xiang, Jingyi Yu, Hao Su

We present MVSNeRF, a novel neural rendering approach that can efficiently reconstruct neural radiance fields for view synthesis.

Neural Rendering

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

1 code implementation CVPR 2021 Fanbo Xiang, Zexiang Xu, Miloš Hašan, Yannick Hold-Geoffroy, Kalyan Sunkavalli, Hao Su

We achieve this by introducing a 3D-to-2D texture mapping (or surface parameterization) network into volumetric representations.

Neural Rendering

DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes with Biharmonic Coordinates

1 code implementation CVPR 2021 Minghua Liu, Minhyuk Sung, Radomir Mech, Hao Su

Given a collection of 3D meshes of a category and their deformation handles (control points), our method learns a set of meta-handles for each shape, which are represented as combinations of the given handles.

Medical Robots for Infectious Diseases: Lessons and Challenges from the COVID-19 Pandemic

no code implementations14 Dec 2020 Antonio Di Lallo, Robin R. Murphy, Axel Krieger, Junxi Zhu, Russell H. Taylor, Hao Su

Medical robots can play an important role in mitigating the spread of infectious diseases and delivering quality care to patients during the COVID-19 pandemic.


Compositionally Generalizable 3D Structure Prediction

1 code implementation4 Dec 2020 Songfang Han, Jiayuan Gu, Kaichun Mo, Li Yi, Siyu Hu, Xuejin Chen, Hao Su

However, there remains a much more difficult and under-explored issue on how to generalize the learned skills over unseen object categories that have very different shape geometry distributions.

3D Shape Reconstruction Translation

Towards Scale-Invariant Graph-related Problem Solving by Iterative Homogeneous GNNs

no code implementations NeurIPS 2020 Hao Tang, Zhiao Huang, Jiayuan Gu, Bao-liang Lu, Hao Su

Current graph neural networks (GNNs) lack generalizability with respect to scales (graph sizes, graph diameters, edge weights, etc..) when solving many graph analysis problems.

An End-to-end Method for Producing Scanning-robust Stylized QR Codes

no code implementations16 Nov 2020 Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Ji Wan, Mingliang Xu, Tao Ren

Quick Response (QR) code is one of the most worldwide used two-dimensional codes.~Traditional QR codes appear as random collections of black-and-white modules that lack visual semantics and aesthetic elements, which inspires the recent works to beautify the appearances of QR codes.

Style Transfer

Towards Scale-Invariant Graph-related Problem Solving by Iterative Homogeneous Graph Neural Networks

no code implementations26 Oct 2020 Hao Tang, Zhiao Huang, Jiayuan Gu, Bao-liang Lu, Hao Su

Current graph neural networks (GNNs) lack generalizability with respect to scales (graph sizes, graph diameters, edge weights, etc..) when solving many graph analysis problems.

BiPointNet: Binary Neural Network for Point Clouds

1 code implementation ICLR 2021 Haotong Qin, Zhongang Cai, Mingyuan Zhang, Yifu Ding, Haiyu Zhao, Shuai Yi, Xianglong Liu, Hao Su

To alleviate the resource constraint for real-time point cloud applications that run on edge devices, in this paper we present BiPointNet, the first model binarization approach for efficient deep learning on point clouds.


Photon-Driven Neural Path Guiding

no code implementations5 Oct 2020 Shilin Zhu, Zexiang Xu, Tiancheng Sun, Alexandr Kuznetsov, Mark Meyer, Henrik Wann Jensen, Hao Su, Ravi Ramamoorthi

To fully make use of our deep neural network, we partition the scene space into an adaptive hierarchical grid, in which we apply our network to reconstruct high-quality sampling distributions for any local region in the scene.

Weakly-supervised 3D Shape Completion in the Wild

no code implementations ECCV 2020 Jiayuan Gu, Wei-Chiu Ma, Sivabalan Manivasagam, Wenyuan Zeng, ZiHao Wang, Yuwen Xiong, Hao Su, Raquel Urtasun

3D shape completion for real data is important but challenging, since partial point clouds acquired by real-world sensors are usually sparse, noisy and unaligned.

Point Cloud Registration Pose Estimation

Deep Keypoint-Based Camera Pose Estimation with Geometric Constraints

1 code implementation29 Jul 2020 You-Yi Jau, Rui Zhu, Hao Su, Manmohan Chandraker

Estimating relative camera poses from consecutive frames is a fundamental problem in visual odometry (VO) and simultaneous localization and mapping (SLAM), where classic methods consisting of hand-crafted features and sampling-based outlier rejection have been a dominant choice for over a decade.

Pose Estimation Simultaneous Localization and Mapping +1

Balance Scene Learning Mechanism for Offshore and Inshore Ship Detection in SAR Images

no code implementations21 Jul 2020 Tianwen Zhang, Xiaoling Zhang, Jun Shi, Shunjun Wei, Jianguo Wang, Jianwei Li, Hao Su, Yue Zhou

Huge imbalance of different scenes' sample numbers seriously reduces Synthetic Aperture Radar (SAR) ship detection accuracy.

SofGAN: A Portrait Image Generator with Dynamic Styling

1 code implementation7 Jul 2020 Anpei Chen, Ruiyang Liu, Ling Xie, Zhang Chen, Hao Su, Jingyi Yu

To address this issue, we propose a SofGAN image generator to decouple the latent space of portraits into two subspaces: a geometry space and a texture space.

2D Semantic Segmentation Image Generation +1

Rethinking Sampling in 3D Point Cloud Generative Adversarial Networks

no code implementations12 Jun 2020 He Wang, Zetian Jiang, Li Yi, Kaichun Mo, Hao Su, Leonidas J. Guibas

We further study how different evaluation metrics weigh the sampling pattern against the geometry and propose several perceptual metrics forming a sampling spectrum of metrics.

Deep Photon Mapping

no code implementations25 Apr 2020 Shilin Zhu, Zexiang Xu, Henrik Wann Jensen, Hao Su, Ravi Ramamoorthi

This network is easy to incorporate in many previous photon mapping methods (by simply swapping the kernel density estimator) and can produce high-quality reconstructions of complex global illumination effects like caustics with an order of magnitude fewer photons compared to previous photon mapping methods.

Denoising Density Estimation

MangaGAN: Unpaired Photo-to-Manga Translation Based on The Methodology of Manga Drawing

no code implementations22 Apr 2020 Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Jiahe Cui, Ji Wan

Manga is a world popular comic form originated in Japan, which typically employs black-and-white stroke lines and geometric exaggeration to describe humans' appearances, poses, and actions.


SAPIEN: A SimulAted Part-based Interactive ENvironment

1 code implementation CVPR 2020 Fanbo Xiang, Yuzhe Qin, Kaichun Mo, Yikuan Xia, Hao Zhu, Fangchen Liu, Minghua Liu, Hanxiao Jiang, Yifu Yuan, He Wang, Li Yi, Angel X. Chang, Leonidas J. Guibas, Hao Su

To achieve this task, a simulated environment with physically realistic simulation, sufficient articulated objects, and transferability to the real robot is indispensable.

Regret Bounds for Discounted MDPs

no code implementations12 Feb 2020 Shuang Liu, Hao Su

We give motivations and derive lower and upper bounds for such measures.


Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness

1 code implementation CVPR 2020 Shuo Cheng, Zexiang Xu, Shilin Zhu, Zhuwen Li, Li Erran Li, Ravi Ramamoorthi, Hao Su

In contrast, we propose adaptive thin volumes (ATVs); in an ATV, the depth hypothesis of each plane is spatially varying, which adapts to the uncertainties of previous per-pixel depth predictions.

3D Reconstruction Point Clouds

StructEdit: Learning Structural Shape Variations

1 code implementation CVPR 2020 Kaichun Mo, Paul Guerrero, Li Yi, Hao Su, Peter Wonka, Niloy Mitra, Leonidas J. Guibas

Learning to encode differences in the geometry and (topological) structure of the shapes of ordinary objects is key to generating semantically plausible variations of a given shape, transferring edits from one shape to another, and many other applications in 3D content creation.

Normal Assisted Stereo Depth Estimation

1 code implementation CVPR 2020 Uday Kusupati, Shuo Cheng, Rui Chen, Hao Su

We couple the learning of a multi-view normal estimation module and a multi-view depth estimation module.

Stereo Depth Estimation

State Alignment-based Imitation Learning

no code implementations ICLR 2020 Fangchen Liu, Zhan Ling, Tongzhou Mu, Hao Su

Consider an imitation learning problem that the imitator and the expert have different dynamics models.

Imitation Learning reinforcement-learning

Information-Theoretic Local Minima Characterization and Regularization

1 code implementation ICML 2020 Zhiwei Jia, Hao Su

Recent advances in deep learning theory have evoked the study of generalizability across different local minima of deep neural networks (DNNs).

Learning Theory

S4G: Amodal Single-view Single-Shot SE(3) Grasp Detection in Cluttered Scenes

no code implementations31 Oct 2019 Yuzhe Qin, Rui Chen, Hao Zhu, Meng Song, Jing Xu, Hao Su

Grasping is among the most fundamental and long-lasting problems in robotics study.

Multi-view PointNet for 3D Scene Understanding

no code implementations30 Sep 2019 Maximilian Jaritz, Jiayuan Gu, Hao Su

Fusion of 2D images and 3D point clouds is important because information from dense images can enhance sparse point clouds.

3D Instance Segmentation 3D Semantic Segmentation +1

Pre-training as Batch Meta Reinforcement Learning with tiMe

no code implementations25 Sep 2019 Quan Vuong, Shuang Liu, Minghua Liu, Kamil Ciosek, Hao Su, Henrik Iskov Christensen

Combining ideas from Batch RL and Meta RL, we propose tiMe, which learns distillation of multiple value functions and MDP embeddings from only existing data.

Meta Reinforcement Learning reinforcement-learning

Model Imitation for Model-Based Reinforcement Learning

no code implementations25 Sep 2019 Yueh-Hua Wu, Ting-Han Fan, Peter J. Ramadge, Hao Su

Based on the claim, we propose to learn the transition model by matching the distributions of multi-step rollouts sampled from the transition model and the real ones via WGAN.

Model-based Reinforcement Learning reinforcement-learning

Adversarial shape perturbations on 3D point clouds

2 code implementations16 Aug 2019 Daniel Liu, Ronald Yu, Hao Su

The importance of training robust neural network grows as 3D data is increasingly utilized in deep learning for vision tasks in robotics, drone control, and autonomous driving.

3D Point Cloud Classification Autonomous Driving +1

Mapping State Space using Landmarks for Universal Goal Reaching

1 code implementation NeurIPS 2019 Zhiao Huang, Fangchen Liu, Hao Su

An agent that has well understood the environment should be able to apply its skills for any given goals, leading to the fundamental problem of learning the Universal Value Function Approximator (UVFA).

Point-Based Multi-View Stereo Network

1 code implementation ICCV 2019 Rui Chen, Songfang Han, Jing Xu, Hao Su

More specifically, our method predicts the depth in a coarse-to-fine manner.

StructureNet: Hierarchical Graph Networks for 3D Shape Generation

2 code implementations1 Aug 2019 Kaichun Mo, Paul Guerrero, Li Yi, Hao Su, Peter Wonka, Niloy Mitra, Leonidas J. Guibas

We introduce StructureNet, a hierarchical graph network which (i) can directly encode shapes represented as such n-ary graphs; (ii) can be robustly trained on large and complex shape families; and (iii) can be used to generate a great diversity of realistic structured shape geometries.

3D Shape Generation

How to pick the domain randomization parameters for sim-to-real transfer of reinforcement learning policies?

1 code implementation28 Mar 2019 Quan Vuong, Sharad Vikram, Hao Su, Sicun Gao, Henrik I. Christensen

A human-specified design choice in domain randomization is the form and parameters of the distribution of simulated environments.

DeepSpline: Data-Driven Reconstruction of Parametric Curves and Surfaces

2 code implementations12 Jan 2019 Jun Gao, Chengcheng Tang, Vignesh Ganapathi-Subramanian, Jiahui Huang, Hao Su, Leonidas J. Guibas

Reconstruction of geometry based on different input modes, such as images or point clouds, has been instrumental in the development of computer aided design and computer graphics.

Extending Adversarial Attacks and Defenses to Deep 3D Point Cloud Classifiers

1 code implementation10 Jan 2019 Daniel Liu, Ronald Yu, Hao Su

We present a preliminary evaluation of adversarial attacks on deep 3D point cloud classifiers, namely PointNet and PointNet++, by evaluating both white-box and black-box adversarial attacks that were proposed for 2D images and extending those attacks to reduce the perceptibility of the perturbations in 3D space.

3D Object Classification General Classification +1

A Main/Subsidiary Network Framework for Simplifying Binary Neural Network

no code implementations11 Dec 2018 Yinghao Xu, Xin Dong, Yudian Li, Hao Su

To reduce memory footprint and run-time latency, techniques such as neural network pruning and binarization have been explored separately.

Binarization Image Classification +1

Adversarial Defense by Stratified Convolutional Sparse Coding

no code implementations CVPR 2019 Bo Sun, Nian-hsuan Tsai, Fangchen Liu, Ronald Yu, Hao Su

We propose an adversarial defense method that achieves state-of-the-art performance among attack-agnostic adversarial defense methods while also maintaining robustness to input resolution, scale of adversarial perturbation, and scale of dataset size.

Adversarial Defense

ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving

no code implementations CVPR 2019 Xibin Song, Peng Wang, Dingfu Zhou, Rui Zhu, Chenye Guan, Yuchao Dai, Hao Su, Hongdong Li, Ruigang Yang

Specifically, we first segment each car with a pre-trained Mask R-CNN, and then regress towards its 3D pose and shape based on a deformable 3D car model with or without using semantic keypoints.

3D Car Instance Understanding Autonomous Driving

Towards Fast and Energy-Efficient Binarized Neural Network Inference on FPGA

no code implementations4 Oct 2018 Cheng Fu, Shilin Zhu, Hao Su, Ching-En Lee, Jishen Zhao

Thus there does exist redundancy that can be exploited to further reduce the amount of on-chip computations.

Transfer Value or Policy? A Value-centric Framework Towards Transferrable Continuous Reinforcement Learning

no code implementations27 Sep 2018 Xingchao Liu, Tongzhou Mu, Hao Su

In this paper, we investigate the problem of transfer learning across environments with different dynamics while accomplishing the same task in the continuous control domain.

Continuous Control reinforcement-learning +1

Deep Part Induction from Articulated Object Pairs

1 code implementation19 Sep 2018 Li Yi, Haibin Huang, Difan Liu, Evangelos Kalogerakis, Hao Su, Leonidas Guibas

In this paper, we explore how the observation of different articulation states provides evidence for part structure and motion of 3D objects.

Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?

1 code implementation CVPR 2019 Shilin Zhu, Xin Dong, Hao Su

Binary neural networks (BNN) have been studied extensively since they run dramatically faster at lower memory and power consumption than floating-point networks, thanks to the efficiency of bit operations.

Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning

no code implementations CVPR 2018 Chuang Gan, Boqing Gong, Kun Liu, Hao Su, Leonidas J. Guibas

In addition, we also find that a progressive training strategy can foster a better neural network for the video recognition task than blindly pooling the distinct sources of geometry cues together.

Action Recognition Frame +5

Deep Functional Dictionaries: Learning Consistent Semantic Structures on 3D Models from Functions

1 code implementation NeurIPS 2018 Minhyuk Sung, Hao Su, Ronald Yu, Leonidas Guibas

Even though our shapes have independent discretizations and no functional correspondences are provided, the network is able to generate latent bases, in a consistent order, that reflect the shared semantic structure among the shapes.

Robust Watertight Manifold Surface Generation Method for ShapeNet Models

2 code implementations5 Feb 2018 Jingwei Huang, Hao Su, Leonidas Guibas

In this paper, we describe a robust algorithm for 2-Manifold generation of various kinds of ShapeNet Models.

Computational Geometry

ComplementMe: Weakly-Supervised Component Suggestions for 3D Modeling

1 code implementation6 Aug 2017 Minhyuk Sung, Hao Su, Vladimir G. Kim, Siddhartha Chaudhuri, Leonidas Guibas

The combinatorial nature of part arrangements poses another challenge, since the retrieval network is not a function: several complements can be appropriate for the same input.

Graphics I.3.5

Learning Non-Lambertian Object Intrinsics across ShapeNet Categories

1 code implementation CVPR 2017 Jian Shi, Yue Dong, Hao Su, Stella X. Yu

Rendered with realistic environment maps, millions of synthetic images of objects and their corresponding albedo, shading, and specular ground-truth images are used to train an encoder-decoder CNN.

Beyond Holistic Object Recognition: Enriching Image Understanding with Part States

no code implementations CVPR 2018 Cewu Lu, Hao Su, Yongyi Lu, Li Yi, Chi-Keung Tang, Leonidas Guibas

Important high-level vision tasks such as human-object interaction, image captioning and robotic manipulation require rich semantic descriptions of objects at part level.

Human-Object Interaction Detection Image Captioning +1

SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation

no code implementations CVPR 2017 Li Yi, Hao Su, Xingwen Guo, Leonidas Guibas

To enable the prediction of vertex functions on them by convolutional neural networks, we resort to spectral CNN method that enables weight sharing by parameterizing kernels in the spectral domain spanned by graph laplacian eigenbases.

3D Part Segmentation

A Point Set Generation Network for 3D Object Reconstruction from a Single Image

4 code implementations CVPR 2017 Haoqiang Fan, Hao Su, Leonidas Guibas

Our final solution is a conditional shape sampler, capable of predicting multiple plausible 3D point clouds from an input image.

Ranked #2 on 3D Reconstruction on Data3D−R2N2 (using extra training data)

3D Object Reconstruction From A Single Image 3D Reconstruction

Learning Shape Abstractions by Assembling Volumetric Primitives

3 code implementations CVPR 2017 Shubham Tulsiani, Hao Su, Leonidas J. Guibas, Alexei A. Efros, Jitendra Malik

We present a learning framework for abstracting complex shapes by learning to assemble objects using 3D volumetric primitives.

Multilinear Hyperplane Hashing

no code implementations CVPR 2016 Xianglong Liu, Xinjie Fan, Cheng Deng, Zhujin Li, Hao Su, DaCheng Tao

Despite its successful progress in classic point-to-point search, there are few studies regarding point-to-hyperplane search, which has strong practical capabilities of scaling up in many applications like active learning with SVMs.

Active Learning Quantization

Volumetric and Multi-View CNNs for Object Classification on 3D Data

2 code implementations CVPR 2016 Charles R. Qi, Hao Su, Matthias Niessner, Angela Dai, Mengyuan Yan, Leonidas J. Guibas

Empirical results from these two types of CNNs exhibit a large gap, indicating that existing volumetric CNN architectures and approaches are unable to fully exploit the power of 3D representations.

3D Object Recognition 3D Point Cloud Classification +1

3D-Assisted Feature Synthesis for Novel Views of an Object

no code implementations ICCV 2015 Hao Su, Fan Wang, Eric Yi, Leonidas J. Guibas

Comparing two images from different views has been a long-standing challenging problem in computer vision, as visual features are not stable under large view point changes.

Image Retrieval

Density Estimation via Discrepancy

no code implementations23 Sep 2015 Kun Yang, Hao Su, Wing Hung Wang

Given i. i. d samples from some unknown continuous density on hyper-rectangle $[0, 1]^d$, we attempt to learn a piecewise constant function that approximates this underlying density non-parametrically.

Density Estimation

Pose Estimation Based on 3D Models

no code implementations20 Jun 2015 Chuiwen Ma, Hao Su, Liang Shi

In this paper, we proposed a pose estimation system based on rendered image training set, which predicts the pose of objects in real image, with knowledge of object category and tight bounding box.

General Classification Multi-class Classification +1

3D-Assisted Image Feature Synthesis for Novel Views of an Object

no code implementations26 Nov 2014 Hao Su, Fan Wang, Li Yi, Leonidas Guibas

In this paper, given a single input image of an object, we synthesize new features for other views of the same object.

Image Retrieval

ImageNet Large Scale Visual Recognition Challenge

9 code implementations1 Sep 2014 Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, Li Fei-Fei

The ImageNet Large Scale Visual Recognition Challenge is a benchmark in object category classification and detection on hundreds of object categories and millions of images.

General Classification Image Classification +2

Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification

no code implementations NeurIPS 2010 Li-Jia Li, Hao Su, Li Fei-Fei, Eric P. Xing

Robust low-level image features have been proven to be effective representations for a variety of visual recognition tasks such as object recognition and scene classification; but pixels, or even local image patches, carry little semantic meanings.

General Classification Object Recognition +1

Cannot find the paper you are looking for? You can Submit a new open access paper.