Search Results for author: QiXing Huang

Found 36 papers, 20 papers with code

Scalable Semidefinite Relaxation for Maximum A Posterior Estimation

no code implementations • 19 May 2014 • Qixing Huang, Yuxin Chen, Leonidas Guibas

Maximum a posteriori (MAP) inference over discrete Markov random fields is a fundamental task spanning a wide spectrum of real-world applications, which is known to be NP-hard for general graphs.

Paper
Add Code

Single-View Reconstruction via Joint Analysis of Image and Shape Collections

1 code implementation • ACM Transactions on Graphics 2015 • Qixing Huang, Hai Wang, Vladlen Koltun

We present an approach to automatic 3D reconstruction of objects depicted in Web images.

3D Reconstruction

Paper
Code

Dense Correspondences between Human Bodies via Learning Transformation Synchronization on Graphs

no code implementations • NeurIPS 2020 • Xiangru Huang, Haitao Yang, Etienne Vouga, QiXing Huang

We introduce an approach for establishing dense correspondences between partial scans of human models and a complete template model.

Paper
Add Code

MVS2D: Efficient Multi-view Stereo via Attention-Driven 2D Convolutions

1 code implementation • CVPR 2022 • Zhenpei Yang, Zhile Ren, Qi Shan, QiXing Huang

Deep learning has made significant impacts on multi-view stereo systems.

121

Paper
Code

HPNet: Deep Primitive Segmentation Using Hybrid Representations

1 code implementation • ICCV 2021 • Siming Yan, Zhenpei Yang, Chongyang Ma, Haibin Huang, Etienne Vouga, QiXing Huang

This paper introduces HPNet, a novel deep-learning approach for segmenting a 3D shape represented as a point cloud into primitive patches.

Clustering Segmentation

Paper
Code

Attention-guided Temporally Coherent Video Object Matting

1 code implementation • 24 May 2021 • Yunke Zhang, Chi Wang, Miaomiao Cui, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Hujun Bao, QiXing Huang, Weiwei Xu

Experimental results show that our method can generate high-quality alpha mattes for various videos featuring appearance change, occlusion, and fast motion.

Image Matting Object +4

Paper
Code

StruMonoNet: Structure-Aware Monocular 3D Prediction

no code implementations • CVPR 2021 • Zhenpei Yang, Li Erran Li, QiXing Huang

Monocular 3D prediction is one of the fundamental problems in 3D vision.

Paper
Add Code

Learning View Selection for 3D Scenes

no code implementations • CVPR 2021 • Yifan Sun, QiXing Huang, Dun-Yu Hsiao, Li Guan, Gang Hua

Efficient 3D space sampling to represent an underlying3D object/scene is essential for 3D vision, robotics, and be-yond.

Paper
Add Code

ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators

1 code implementation • ICCV 2021 • QiXing Huang, Xiangru Huang, Bo Sun, Zaiwei Zhang, Junfeng Jiang, Chandrajit Bajaj

Our approach builds on an approximation of the as-rigid-as possible (or ARAP) deformation energy.

3D Shape Generation 3D Shape Reconstruction

149

Paper
Code

Scene Synthesis via Uncertainty-Driven Attribute Synchronization

1 code implementation • ICCV 2021 • Haitao Yang, Zaiwei Zhang, Siming Yan, Haibin Huang, Chongyang Ma, Yi Zheng, Chandrajit Bajaj, QiXing Huang

This task is challenging because 3D scenes exhibit diverse patterns, ranging from continuous ones, such as object sizes and the relative poses between pairs of shapes, to discrete patterns, such as occurrence and co-occurrence of objects with symmetrical relationships.

Attribute

Paper
Code

InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition

1 code implementation • CVPR 2022 • Hyung-gun Chi, Myoung Hoon Ha, Seunggeun Chi, Sang Wan Lee, QiXing Huang, Karthik Ramani

Human skeleton-based action recognition offers a valuable means to understand the intricacies of human behavior because it can handle the complex relationships between physical constraints and intention.

Ranked #7 on Skeleton Based Action Recognition on N-UCLA

Action Recognition Representation Learning +1

106

Paper
Code

Implicit Autoencoder for Point-Cloud Self-Supervised Representation Learning

1 code implementation • ICCV 2023 • Siming Yan, Zhenpei Yang, Haoxiang Li, Chen Song, Li Guan, Hao Kang, Gang Hua, QiXing Huang

The most popular and accessible 3D representation, i. e., point clouds, involves discrete samples of the underlying continuous 3D surface.

Ranked #5 on 3D Point Cloud Linear Classification on ModelNet40 (using extra training data)

3D Point Cloud Classification 3D Point Cloud Linear Classification +3

Paper
Code

E-CIR: Event-Enhanced Continuous Intensity Recovery

1 code implementation • CVPR 2022 • Chen Song, QiXing Huang, Chandrajit Bajaj

A camera begins to sense light the moment we press the shutter button.

Deblurring

Paper
Code

FvOR: Robust Joint Shape and Pose Optimization for Few-view Object Reconstruction

1 code implementation • CVPR 2022 • Zhenpei Yang, Zhile Ren, Miguel Angel Bautista, Zaiwei Zhang, Qi Shan, QiXing Huang

In this paper, we present FvOR, a learning-based object reconstruction method that predicts accurate 3D models given a few images with noisy input poses.

Object Reconstruction Pose Estimation

Paper
Code

HM3D-ABO: A Photo-realistic Dataset for Object-centric Multi-view 3D Reconstruction

1 code implementation • 24 Jun 2022 • Zhenpei Yang, Zaiwei Zhang, QiXing Huang

Reconstructing 3D objects is an important computer vision task that has wide application in AR/VR.

3D Reconstruction Multi-View 3D Reconstruction +3

Paper
Code

PatchRD: Detail-Preserving Shape Completion by Learning Patch Retrieval and Deformation

1 code implementation • 24 Jul 2022 • Bo Sun, Vladimir G. Kim, Noam Aigerman, QiXing Huang, Siddhartha Chaudhuri

Our key insight is to copy and deform patches from the partial input to complete missing regions.

Retrieval

Paper
Code

Neural Volumetric Mesh Generator

no code implementations • 6 Oct 2022 • Yan Zheng, Lemeng Wu, Xingchao Liu, Zhen Chen, Qiang Liu, QiXing Huang

We first propose a diffusion-based generative model to tackle this problem by generating voxelized shapes with close-to-reality outlines and structures.

Paper
Add Code

Pose Synchronization Under Multiple Pair-Wise Relative Poses

no code implementations • CVPR 2023 • Yifan Sun, QiXing Huang

The first step performs diffusion and clustering to compute the candidate poses of the input objects.

Paper
Add Code

DeblurSR: Event-Based Motion Deblurring Under the Spiking Representation

1 code implementation • 15 Mar 2023 • Chen Song, Chandrajit Bajaj, QiXing Huang

We additionally show that our approach easily extends to video super-resolution when combined with recent advances in implicit neural representation.

Deblurring Video Super-Resolution

Paper
Code

LiDAR-Based 3D Object Detection via Hybrid 2D Semantic Scene Generation

1 code implementation • 4 Apr 2023 • Haitao Yang, Zaiwei Zhang, Xiangru Huang, Min Bai, Chen Song, Bo Sun, Li Erran Li, QiXing Huang

Bird's-Eye View (BEV) features are popular intermediate scene representations shared by the 3D backbone and the detector head in LiDAR-based object detectors.

3D Object Detection object-detection +1

Paper
Code

3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining

no code implementations • 14 Apr 2023 • Siming Yan, YuQi Yang, YuXiao Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu, QiXing Huang

Masked autoencoders (MAE) have recently been introduced to 3D self-supervised pretraining for point clouds due to their great success in NLP and computer vision.

Paper
Add Code

GenCorres: Consistent Shape Matching via Coupled Implicit-Explicit Shape Generative Models

1 code implementation • 20 Apr 2023 • Haitao Yang, Xiangru Huang, Bo Sun, Chandrajit Bajaj, QiXing Huang

GenCorres addresses this issue by learning an implicit generator from the input shapes, which provides intermediate shapes between two arbitrary shapes.

Paper
Code

Jigsaw: Learning to Assemble Multiple Fractured Objects

1 code implementation • NeurIPS 2023 • Jiaxin Lu, Yifan Sun, QiXing Huang

Our framework consists of four components: (1) front-end point feature extractor with attention layers, (2) surface segmentation to separate fracture and original parts, (3) multi-parts matching to find correspondences among fracture surface points, and (4) robust global alignment to recover the global poses of the pieces.

Paper
Code

Multi-View Representation is What You Need for Point-Cloud Pre-Training

no code implementations • 5 Jun 2023 • Siming Yan, Chen Song, Youkang Kong, QiXing Huang

Different from the popular practice of predicting 2D features first and then obtaining 3D features through dimensionality lifting, our approach directly uses a 3D network for feature extraction.

3D Object Detection 3D Shape Classification +4

Paper
Add Code

Detector-Free Structure from Motion

1 code implementation • 27 Jun 2023 • Xingyi He, Jiaming Sun, Yifan Wang, Sida Peng, QiXing Huang, Hujun Bao, Xiaowei Zhou

We propose a new detector-free SfM framework to draw benefits from the recent success of detector-free matchers to avoid the early determination of keypoints, while solving the multi-view inconsistency issue of detector-free matchers.

Keypoint Detection

397

Paper
Code

Rapid Flood Inundation Forecast Using Fourier Neural Operator

no code implementations • 29 Jul 2023 • Alexander Y. Sun, Zhi Li, Wonhyun Lee, QiXing Huang, Bridget R. Scanlon, Clint Dawson

Flood inundation forecast provides critical information for emergency planning before and during flood events.

Depth Estimation Depth Prediction

Paper
Add Code

Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On

no code implementations • 8 Aug 2023 • Daiheng Gao, Xu Chen, Xindi Zhang, Qi Wang, Ke Sun, Bang Zhang, Liefeng Bo, QiXing Huang

Since traditional warping-based texture generation methods require a significant number of control points to be manually selected for each type of garment, which can be a time-consuming and tedious process.

Texture Synthesis Virtual Try-on

Paper
Add Code

What is the Best Automated Metric for Text to Motion Generation?

no code implementations • 19 Sep 2023 • Jordan Voas, Yili Wang, QiXing Huang, Raymond Mooney

Our findings indicate that none of the metrics currently used for this task show even a moderate correlation with human judgments on a sample level.

Paper
Add Code

LEAP: Liberate Sparse-view 3D Modeling from Camera Poses

1 code implementation • 2 Oct 2023 • Hanwen Jiang, Zhenyu Jiang, Yue Zhao, QiXing Huang

Are camera poses necessary for multi-view 3D modeling?

Novel View Synthesis

147

Paper
Code

InfoGCN++: Learning Representation by Predicting the Future for Online Human Skeleton-based Action Recognition

2 code implementations • 16 Oct 2023 • Seunggeun Chi, Hyung-gun Chi, QiXing Huang, Karthik Ramani

To overcome this barrier, we introduce InfoGCN++, an innovative extension of InfoGCN, explicitly developed for online skeleton-based action recognition.

Action Recognition Skeleton Based Action Recognition

Paper
Code

Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers

no code implementations • 21 Nov 2023 • Bo Sun, QiXing Huang, Xiangru Huang

In the experiments, our method significantly outperform existing approaches in 3D semantic segmentation on several public benchmarks, such as Waymo Open Dataset, SemanticKITTI and ScanNetV2.

3D Semantic Segmentation Segmentation

Paper
Add Code

UGG: Unified Generative Grasping

1 code implementation • 28 Nov 2023 • Jiaxin Lu, Hao Kang, Haoxiang Li, Bo Liu, Yiding Yang, QiXing Huang, Gang Hua

Generation-based methods that generate grasping postures conditioned on the object can often produce diverse grasping, but they are insufficient for high grasping success due to lack of discriminative information.

Grasp Generation Object

530

Paper
Code

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

no code implementations • 9 Feb 2024 • Siming Yan, Min Bai, Weifeng Chen, Xiong Zhou, QiXing Huang, Li Erran Li

By combining natural language understanding, generation capabilities, and breadth of knowledge of large language models with image perception, recent large vision language models (LVLMs) have shown unprecedented visual reasoning capabilities.

Hallucination Natural Language Understanding +2

Paper
Add Code

VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

no code implementations • 18 Mar 2024 • Qi Zuo, Xiaodong Gu, Lingteng Qiu, Yuan Dong, Zhengyi Zhao, Weihao Yuan, Rui Peng, Siyu Zhu, Zilong Dong, Liefeng Bo, QiXing Huang

Images from video generative models are more suitable for multi-view generation because the underlying network architecture that generates them employs a temporal module to enforce frame consistency.

Denoising

Paper
Add Code

An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-to-Image Models

no code implementations • 22 Mar 2024 • Zhengyi Zhao, Chen Song, Xiaodong Gu, Yuan Dong, Qi Zuo, Weihao Yuan, Zilong Dong, Liefeng Bo, QiXing Huang

In particular, the third and fourth stages are iterated, with the cuts obtained in the fourth stage encouraging non-rigid alignment in the third stage to focus on regions close to the cuts.

Paper
Add Code

Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition

no code implementations • 3 Apr 2024 • Yisheng He, Weihao Yuan, Siyu Zhu, Zilong Dong, Liefeng Bo, QiXing Huang

This paper enables high-fidelity, transferable NeRF editing by frequency decomposition.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.