Search Results for author: QiXing Huang

Found 36 papers, 20 papers with code

Scalable Semidefinite Relaxation for Maximum A Posterior Estimation

no code implementations19 May 2014 Qixing Huang, Yuxin Chen, Leonidas Guibas

Maximum a posteriori (MAP) inference over discrete Markov random fields is a fundamental task spanning a wide spectrum of real-world applications, which is known to be NP-hard for general graphs.

Dense Correspondences between Human Bodies via Learning Transformation Synchronization on Graphs

no code implementations NeurIPS 2020 Xiangru Huang, Haitao Yang, Etienne Vouga, QiXing Huang

We introduce an approach for establishing dense correspondences between partial scans of human models and a complete template model.

HPNet: Deep Primitive Segmentation Using Hybrid Representations

1 code implementation ICCV 2021 Siming Yan, Zhenpei Yang, Chongyang Ma, Haibin Huang, Etienne Vouga, QiXing Huang

This paper introduces HPNet, a novel deep-learning approach for segmenting a 3D shape represented as a point cloud into primitive patches.

Clustering Segmentation

Attention-guided Temporally Coherent Video Object Matting

1 code implementation24 May 2021 Yunke Zhang, Chi Wang, Miaomiao Cui, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Hujun Bao, QiXing Huang, Weiwei Xu

Experimental results show that our method can generate high-quality alpha mattes for various videos featuring appearance change, occlusion, and fast motion.

Image Matting Object +4

Learning View Selection for 3D Scenes

no code implementations CVPR 2021 Yifan Sun, QiXing Huang, Dun-Yu Hsiao, Li Guan, Gang Hua

Efficient 3D space sampling to represent an underlying3D object/scene is essential for 3D vision, robotics, and be-yond.

Scene Synthesis via Uncertainty-Driven Attribute Synchronization

1 code implementation ICCV 2021 Haitao Yang, Zaiwei Zhang, Siming Yan, Haibin Huang, Chongyang Ma, Yi Zheng, Chandrajit Bajaj, QiXing Huang

This task is challenging because 3D scenes exhibit diverse patterns, ranging from continuous ones, such as object sizes and the relative poses between pairs of shapes, to discrete patterns, such as occurrence and co-occurrence of objects with symmetrical relationships.

Attribute

InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition

1 code implementation CVPR 2022 Hyung-gun Chi, Myoung Hoon Ha, Seunggeun Chi, Sang Wan Lee, QiXing Huang, Karthik Ramani

Human skeleton-based action recognition offers a valuable means to understand the intricacies of human behavior because it can handle the complex relationships between physical constraints and intention.

Action Recognition Representation Learning +1

FvOR: Robust Joint Shape and Pose Optimization for Few-view Object Reconstruction

1 code implementation CVPR 2022 Zhenpei Yang, Zhile Ren, Miguel Angel Bautista, Zaiwei Zhang, Qi Shan, QiXing Huang

In this paper, we present FvOR, a learning-based object reconstruction method that predicts accurate 3D models given a few images with noisy input poses.

Object Reconstruction Pose Estimation

Neural Volumetric Mesh Generator

no code implementations6 Oct 2022 Yan Zheng, Lemeng Wu, Xingchao Liu, Zhen Chen, Qiang Liu, QiXing Huang

We first propose a diffusion-based generative model to tackle this problem by generating voxelized shapes with close-to-reality outlines and structures.

Pose Synchronization Under Multiple Pair-Wise Relative Poses

no code implementations CVPR 2023 Yifan Sun, QiXing Huang

The first step performs diffusion and clustering to compute the candidate poses of the input objects.

DeblurSR: Event-Based Motion Deblurring Under the Spiking Representation

1 code implementation15 Mar 2023 Chen Song, Chandrajit Bajaj, QiXing Huang

We additionally show that our approach easily extends to video super-resolution when combined with recent advances in implicit neural representation.

Deblurring Video Super-Resolution

LiDAR-Based 3D Object Detection via Hybrid 2D Semantic Scene Generation

1 code implementation4 Apr 2023 Haitao Yang, Zaiwei Zhang, Xiangru Huang, Min Bai, Chen Song, Bo Sun, Li Erran Li, QiXing Huang

Bird's-Eye View (BEV) features are popular intermediate scene representations shared by the 3D backbone and the detector head in LiDAR-based object detectors.

3D Object Detection object-detection +1

3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining

no code implementations14 Apr 2023 Siming Yan, YuQi Yang, YuXiao Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu, QiXing Huang

Masked autoencoders (MAE) have recently been introduced to 3D self-supervised pretraining for point clouds due to their great success in NLP and computer vision.

GenCorres: Consistent Shape Matching via Coupled Implicit-Explicit Shape Generative Models

1 code implementation20 Apr 2023 Haitao Yang, Xiangru Huang, Bo Sun, Chandrajit Bajaj, QiXing Huang

GenCorres addresses this issue by learning an implicit generator from the input shapes, which provides intermediate shapes between two arbitrary shapes.

Jigsaw: Learning to Assemble Multiple Fractured Objects

1 code implementation NeurIPS 2023 Jiaxin Lu, Yifan Sun, QiXing Huang

Our framework consists of four components: (1) front-end point feature extractor with attention layers, (2) surface segmentation to separate fracture and original parts, (3) multi-parts matching to find correspondences among fracture surface points, and (4) robust global alignment to recover the global poses of the pieces.

Multi-View Representation is What You Need for Point-Cloud Pre-Training

no code implementations5 Jun 2023 Siming Yan, Chen Song, Youkang Kong, QiXing Huang

Different from the popular practice of predicting 2D features first and then obtaining 3D features through dimensionality lifting, our approach directly uses a 3D network for feature extraction.

3D Object Detection 3D Shape Classification +4

Detector-Free Structure from Motion

1 code implementation27 Jun 2023 Xingyi He, Jiaming Sun, Yifan Wang, Sida Peng, QiXing Huang, Hujun Bao, Xiaowei Zhou

We propose a new detector-free SfM framework to draw benefits from the recent success of detector-free matchers to avoid the early determination of keypoints, while solving the multi-view inconsistency issue of detector-free matchers.

Keypoint Detection

Rapid Flood Inundation Forecast Using Fourier Neural Operator

no code implementations29 Jul 2023 Alexander Y. Sun, Zhi Li, Wonhyun Lee, QiXing Huang, Bridget R. Scanlon, Clint Dawson

Flood inundation forecast provides critical information for emergency planning before and during flood events.

Depth Estimation Depth Prediction

Cloth2Tex: A Customized Cloth Texture Generation Pipeline for 3D Virtual Try-On

no code implementations8 Aug 2023 Daiheng Gao, Xu Chen, Xindi Zhang, Qi Wang, Ke Sun, Bang Zhang, Liefeng Bo, QiXing Huang

Since traditional warping-based texture generation methods require a significant number of control points to be manually selected for each type of garment, which can be a time-consuming and tedious process.

Texture Synthesis Virtual Try-on

What is the Best Automated Metric for Text to Motion Generation?

no code implementations19 Sep 2023 Jordan Voas, Yili Wang, QiXing Huang, Raymond Mooney

Our findings indicate that none of the metrics currently used for this task show even a moderate correlation with human judgments on a sample level.

InfoGCN++: Learning Representation by Predicting the Future for Online Human Skeleton-based Action Recognition

2 code implementations16 Oct 2023 Seunggeun Chi, Hyung-gun Chi, QiXing Huang, Karthik Ramani

To overcome this barrier, we introduce InfoGCN++, an innovative extension of InfoGCN, explicitly developed for online skeleton-based action recognition.

Action Recognition Skeleton Based Action Recognition

Instance-aware 3D Semantic Segmentation powered by Shape Generators and Classifiers

no code implementations21 Nov 2023 Bo Sun, QiXing Huang, Xiangru Huang

In the experiments, our method significantly outperform existing approaches in 3D semantic segmentation on several public benchmarks, such as Waymo Open Dataset, SemanticKITTI and ScanNetV2.

3D Semantic Segmentation Segmentation

UGG: Unified Generative Grasping

1 code implementation28 Nov 2023 Jiaxin Lu, Hao Kang, Haoxiang Li, Bo Liu, Yiding Yang, QiXing Huang, Gang Hua

Generation-based methods that generate grasping postures conditioned on the object can often produce diverse grasping, but they are insufficient for high grasping success due to lack of discriminative information.

Grasp Generation Object

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling

no code implementations9 Feb 2024 Siming Yan, Min Bai, Weifeng Chen, Xiong Zhou, QiXing Huang, Li Erran Li

By combining natural language understanding, generation capabilities, and breadth of knowledge of large language models with image perception, recent large vision language models (LVLMs) have shown unprecedented visual reasoning capabilities.

Hallucination Natural Language Understanding +2

VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

no code implementations18 Mar 2024 Qi Zuo, Xiaodong Gu, Lingteng Qiu, Yuan Dong, Zhengyi Zhao, Weihao Yuan, Rui Peng, Siyu Zhu, Zilong Dong, Liefeng Bo, QiXing Huang

Images from video generative models are more suitable for multi-view generation because the underlying network architecture that generates them employs a temporal module to enforce frame consistency.

Denoising

An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-to-Image Models

no code implementations22 Mar 2024 Zhengyi Zhao, Chen Song, Xiaodong Gu, Yuan Dong, Qi Zuo, Weihao Yuan, Zilong Dong, Liefeng Bo, QiXing Huang

In particular, the third and fourth stages are iterated, with the cuts obtained in the fourth stage encouraging non-rigid alignment in the third stage to focus on regions close to the cuts.

Cannot find the paper you are looking for? You can Submit a new open access paper.