Search Results for author: Binbin Huang

Found 14 papers, 7 papers with code

GenFusion: Closing the Loop between Reconstruction and Generation via Videos

no code implementations27 Mar 2025 Sibo Wu, Congrong Xu, Binbin Huang, Andreas Geiger, Anpei Chen

Moreover, we propose a cyclical fusion pipeline that iteratively adds restoration frames from the generative model to the training set, enabling progressive expansion and addressing the viewpoint saturation limitations seen in previous reconstruction and generation pipelines.

3D Generation 3D Reconstruction +2

Quadratic Gaussian Splatting for Efficient and Detailed Surface Reconstruction

no code implementations25 Nov 2024 Ziyu Zhang, Binbin Huang, Hanqing Jiang, Liyang Zhou, Xiaojun Xiang, Shunhan Shen

Recently, 3D Gaussian Splatting (3DGS) has attracted attention for its superior rendering quality and speed over Neural Radiance Fields (NeRF).

3DGS NeRF +1

GeoFormer: Learning Point Cloud Completion with Tri-Plane Integrated Transformer

1 code implementation13 Aug 2024 Jinpeng Yu, Binbin Huang, Yuxuan Zhang, Huaxia Li, Xu Tang, Shenghua Gao

In this paper, we introduce a GeoFormer that simultaneously enhances the global geometric structure of the points and improves the local details.

Point Cloud Completion

Surfel-based Gaussian Inverse Rendering for Fast and Relightable Dynamic Human Reconstruction from Monocular Video

no code implementations21 Jul 2024 Yiqun Zhao, Chenming Wu, Binbin Huang, YiHao Zhi, Chen Zhao, Jingdong Wang, Shenghua Gao

Efficient and accurate reconstruction of a relightable, dynamic clothed human avatar from a monocular video is crucial for the entertainment industry.

Disentanglement Inverse Rendering

Continual Learning for Temporal-Sensitive Question Answering

no code implementations17 Jul 2024 Wanqi Yang, Yunqiu Xu, Yanda Li, Kunze Wang, Binbin Huang, Ling Chen

In this study, we explore an emerging research area of Continual Learning for Temporal Sensitive Question Answering (CLTSQA).

Continual Learning Contrastive Learning +1

2D Gaussian Splatting for Geometrically Accurate Radiance Fields

2 code implementations26 Mar 2024 Binbin Huang, Zehao Yu, Anpei Chen, Andreas Geiger, Shenghua Gao

3D Gaussian Splatting (3DGS) has recently revolutionized radiance field reconstruction, achieving high quality novel view synthesis and fast rendering speed without baking.

3DGS Novel View Synthesis

Mip-Splatting: Alias-free 3D Gaussian Splatting

1 code implementation CVPR 2024 Zehao Yu, Anpei Chen, Binbin Huang, Torsten Sattler, Andreas Geiger

Recently, 3D Gaussian Splatting has demonstrated impressive novel view synthesis results, reaching high fidelity and efficiency.

Novel View Synthesis

TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding

1 code implementation6 Nov 2023 Shuo Wang, Jing Li, Zibo Zhao, Dongze Lian, Binbin Huang, Xiaomei Wang, Zhengxin Li, Shenghua Gao

Holistic scene understanding includes semantic segmentation, surface normal estimation, object boundary detection, depth estimation, etc.

Boundary Detection Depth Estimation +5

Omni-Line-of-Sight Imaging for Holistic Shape Reconstruction

no code implementations21 Apr 2023 Binbin Huang, Xingyue Peng, Siyuan Shen, Suan Xia, Ruiqian Li, Yanhua Yu, Yuehan Wang, Shenghua Gao, Wenzheng Chen, Shiying Li, Jingyi Yu

The core of our method is to put the object nearby diffuse walls and augment the LOS scan in the front view with the NLOS scans from the surrounding walls, which serve as virtual ``mirrors'' to trap lights toward the object.

NeRF Object

3D-aware Image Generation using 2D Diffusion Models

no code implementations ICCV 2023 Jianfeng Xiang, Jiaolong Yang, Binbin Huang, Xin Tong

In this paper, we introduce a novel 3D-aware image generation method that leverages 2D diffusion models.

Image Generation

PREF: Phasorial Embedding Fields for Compact Neural Representations

1 code implementation26 May 2022 Binbin Huang, Xinhao Yan, Anpei Chen, Shenghua Gao, Jingyi Yu

We present an efficient frequency-based neural representation termed PREF: a shallow MLP augmented with a phasor volume that covers significant border spectra than previous Fourier feature mapping or Positional Encoding.

Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding

1 code implementation CVPR 2021 Binbin Huang, Dongze Lian, Weixin Luo, Shenghua Gao

Then we combine the contextual information from the landmark feature convolution module with the target's visual features for grounding.

Descriptive Object +1

Cannot find the paper you are looking for? You can Submit a new open access paper.