Search Results for author: Jiahui Huang

Found 35 papers, 17 papers with code

Learning to Track Any Points from Human Motion

no code implementations8 Jul 2025 Inès Hyeonsu Kim, Seokju Cho, Jahyeok Koo, Junghyun Park, Jiahui Huang, Joon-Young Lee, Seungryong Kim

Human motion, with its inherent complexities, such as non-rigid deformations, articulated movements, clothing distortions, and frequent occlusions caused by limbs or other individuals, provides a rich and challenging source of supervision that is crucial for training robust and generalizable point trackers.

Optical Flow Estimation Point Tracking

Seurat: From Moving Points to Depth

1 code implementation CVPR 2025 Seokju Cho, Jiahui Huang, Seungryong Kim, Joon-Young Lee

Accurate depth estimation from monocular videos remains challenging due to ambiguities inherent in single-view geometry, as crucial depth cues like stereopsis are absent.

Depth Estimation Point Tracking

VideoPanda: Video Panoramic Diffusion with Multi-view Attention

no code implementations15 Apr 2025 Kevin Xie, Amirmojtaba Sabour, Jiahui Huang, Despoina Paschalidou, Greg Klar, Umar Iqbal, Sanja Fidler, Xiaohui Zeng

High resolution panoramic video content is paramount for immersive experiences in Virtual Reality, but is non-trivial to collect as it requires specialized equipment and intricate camera setups.

Video Generation

GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control

1 code implementation CVPR 2025 Xuanchi Ren, Tianchang Shen, Jiahui Huang, Huan Ling, Yifan Lu, Merlin Nimier-David, Thomas Müller, Alexander Keller, Sanja Fidler, Jun Gao

Our results demonstrate more precise camera control than prior work, as well as state-of-the-art results in sparse-view novel view synthesis, even in challenging settings such as driving scenes and monocular dynamic video.

Novel View Synthesis Video Generation

SCube: Instant Large-Scale Scene Reconstruction using VoxSplats

no code implementations26 Oct 2024 Xuanchi Ren, Yifan Lu, Hanxue Liang, Zhangjie Wu, Huan Ling, Mike Chen, Sanja Fidler, Francis Williams, Jiahui Huang

We present SCube, a novel method for reconstructing large-scale 3D scenes (geometry, appearance, and semantics) from a sparse set of posed images.

3D Reconstruction Scene Generation

OmniRe: Omni Urban Scene Reconstruction

1 code implementation29 Aug 2024 Ziyu Chen, Jiawei Yang, Jiahui Huang, Riccardo de Lutio, Janick Martinez Esturo, Boris Ivanovic, Or Litany, Zan Gojcic, Sanja Fidler, Marco Pavone, Li Song, Yue Wang

We introduce OmniRe, a comprehensive system for efficiently creating high-fidelity digital twins of dynamic real-world scenes from on-device logs.

3DGS

Local All-Pair Correspondence for Point Tracking

2 code implementations22 Jul 2024 Seokju Cho, Jiahui Huang, Jisu Nam, Honggyu An, Seungryong Kim, Joon-Young Lee

We introduce LocoTrack, a highly accurate and efficient model designed for the task of tracking any point (TAP) across video sequences.

All Point Tracking

Approximately Piecewise E(3) Equivariant Point Networks

no code implementations13 Feb 2024 Matan Atzmon, Jiahui Huang, Francis Williams, Or Litany

Integrating a notion of symmetry into point cloud neural networks is a provably effective way to improve their generalization capability.

Uncertainty Quantification

FlowTrack: Revisiting Optical Flow for Long-Range Dense Tracking

no code implementations CVPR 2024 Seokju Cho, Jiahui Huang, Seungryong Kim, Joon-Young Lee

In the domain of video tracking existing methods often grapple with a trade-off between spatial density and temporal range.

Optical Flow Estimation

XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies

1 code implementation CVPR 2024 Xuanchi Ren, Jiahui Huang, Xiaohui Zeng, Ken Museth, Sanja Fidler, Francis Williams

We present XCube (abbreviated as $\mathcal{X}^3$), a novel generative model for high-resolution sparse 3D voxel grids with arbitrary attributes.

3D Shape Generation Scene Generation +1

INVE: Interactive Neural Video Editing

no code implementations15 Jul 2023 Jiahui Huang, Leonid Sigal, Kwang Moo Yi, Oliver Wang, Joon-Young Lee

We present Interactive Neural Video Editing (INVE), a real-time video editing solution, which can assist the video editing process by consistently propagating sparse frame edits to the entire video clip.

Video Editing

Neural Kernel Surface Reconstruction

1 code implementation CVPR 2023 Jiahui Huang, Zan Gojcic, Matan Atzmon, Or Litany, Sanja Fidler, Francis Williams

We present a novel method for reconstructing a 3D implicit surface from a large-scale, sparse, and noisy point cloud.

Surface Reconstruction

Revisiting Acceptability Judgements

1 code implementation23 May 2023 Hai Hu, Ziyin Zhang, Weifang Huang, Jackie Yan-Ki Lai, Aini Li, Yina Patterson, Jiahui Huang, Peng Zhang, Chien-Jer Charles Lin, Rui Wang

We introduce CoLAC - Corpus of Linguistic Acceptability in Chinese, the first large-scale acceptability dataset for a non-Indo-European language.

Cross-Lingual Transfer Linguistic Acceptability

DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion

no code implementations ICCV 2023 Kiyohiro Nakayama, Mikaela Angelina Uy, Jiahui Huang, Shi-Min Hu, Ke Li, Leonidas J Guibas

We propose a factorization that models independent part style and part configuration distributions and presents a novel cross-diffusion network that enables us to generate coherent and plausible shapes under our proposed factorization.

Point Cloud Generation

FedSDG-FS: Efficient and Secure Feature Selection for Vertical Federated Learning

no code implementations21 Feb 2023 Anran Li, Hongyi Peng, Lan Zhang, Jiahui Huang, Qing Guo, Han Yu, Yang Liu

Vertical Federated Learning (VFL) enables multiple data owners, each holding a different subset of features about largely overlapping sets of data sample(s), to jointly train a useful global model.

Feature Importance feature selection +1

Dynamic 3D Scene Analysis by Point Cloud Accumulation

1 code implementation25 Jul 2022 Shengyu Huang, Zan Gojcic, Jiahui Huang, Andreas Wieser, Konrad Schindler

Compared to state-of-the-art scene flow estimators, our proposed approach aims to align all 3D points in a common reference frame correctly accumulating the points on the individual objects.

Autonomous Vehicles Semantic Segmentation +1

CIRCLE: Convolutional Implicit Reconstruction and Completion for Large-scale Indoor Scene

no code implementations25 Nov 2021 Haoxiang Chen, Jiahui Huang, Tai-Jiang Mu, Shi-Min Hu

We present CIRCLE, a framework for large-scale scene completion and geometric refinement based on local implicit signed distance functions.

Multiway Non-rigid Point Cloud Registration via Learned Functional Map Synchronization

1 code implementation25 Nov 2021 Jiahui Huang, Tolga Birdal, Zan Gojcic, Leonidas J. Guibas, Shi-Min Hu

We present SyNoRiM, a novel way to jointly register multiple non-rigid shapes by synchronizing the maps relating learned functions defined on the point clouds.

Point Cloud Registration

Layered Controllable Video Generation

no code implementations24 Nov 2021 Jiahui Huang, Yuhe Jin, Kwang Moo Yi, Leonid Sigal

In the first stage, with the rich set of losses and dynamic foreground size prior, we learn how to separate the frame into foreground and background layers and, conditioned on these layers, how to generate the next frame using VQ-VAE generator.

Video Generation

Subdivision-Based Mesh Convolution Networks

1 code implementation4 Jun 2021 Shi-Min Hu, Zheng-Ning Liu, Meng-Hao Guo, Jun-Xiong Cai, Jiahui Huang, Tai-Jiang Mu, Ralph R. Martin

Meshes with arbitrary connectivity can be remeshed to have Loop subdivision sequence connectivity via self-parameterization, making SubdivNet a general approach.

3D Classification Pose Estimation

MultiBodySync: Multi-Body Segmentation and Motion Estimation via 3D Scan Synchronization

1 code implementation CVPR 2021 Jiahui Huang, He Wang, Tolga Birdal, Minhyuk Sung, Federica Arrigoni, Shi-Min Hu, Leonidas Guibas

We present MultiBodySync, a novel, end-to-end trainable multi-body motion segmentation and rigid registration framework for multiple input 3D point clouds.

Motion Estimation Motion Segmentation +1

DI-Fusion: Online Implicit 3D Reconstruction with Deep Priors

1 code implementation CVPR 2021 Jiahui Huang, Shi-Sheng Huang, Haoxuan Song, Shi-Min Hu

Previous online 3D dense reconstruction methods struggle to achieve the balance between memory storage and surface quality, largely due to the usage of stagnant underlying geometry representation, such as TSDF (truncated signed distance functions) or surfels, without any knowledge of the scene priors.

3D Reconstruction

Duality Diagram Similarity: a generic framework for initialization selection in task transfer learning

2 code implementations ECCV 2020 Kshitij Dwivedi, Jiahui Huang, Radoslaw Martin Cichy, Gemma Roig

In this paper, we tackle an open research question in transfer learning, which is selecting a model initialization to achieve high performance on a new task, given several pre-trained models.

Model Selection Semantic Segmentation +1

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings

no code implementations CVPR 2020 Jiahui Huang, Sheng Yang, Tai-Jiang Mu, Shi-Min Hu

We present ClusterVO, a stereo Visual Odometry which simultaneously clusters and estimates the motion of both ego and surrounding rigid clusters/objects.

Autonomous Driving Clustering +3

Shallow2Deep: Indoor Scene Modeling by Single Image Understanding

no code implementations22 Feb 2020 Yinyu Nie, Shihui Guo, Jian Chang, Xiaoguang Han, Jiahui Huang, Shi-Min Hu, Jian Jun Zhang

Particularly, we design a shallow-to-deep architecture on the basis of convolutional networks for semantic scene understanding and modeling.

3D geometry global-optimization +2

ClusterSLAM: A SLAM Backend for Simultaneous Rigid Body Clustering and Motion Estimation

no code implementations ICCV 2019 Jiahui Huang, Sheng Yang, Zishuo Zhao, Yu-Kun Lai, Shi-Min Hu

We present a practical backend for stereo visual SLAM which can simultaneously discover individual rigid bodies and compute their motions in dynamic environments.

Clustering Motion Estimation

Deep Anchored Convolutional Neural Networks

no code implementations22 Apr 2019 Jiahui Huang, Kshitij Dwivedi, Gemma Roig

Convolutional Neural Networks (CNNs) have been proven to be extremely successful at solving computer vision tasks.

DeepSpline: Data-Driven Reconstruction of Parametric Curves and Surfaces

2 code implementations12 Jan 2019 Jun Gao, Chengcheng Tang, Vignesh Ganapathi-Subramanian, Jiahui Huang, Hao Su, Leonidas J. Guibas

Reconstruction of geometry based on different input modes, such as images or point clouds, has been instrumental in the development of computer aided design and computer graphics.

Cannot find the paper you are looking for? You can Submit a new open access paper.