Search Results for author: Hujun Bao

Found 92 papers, 44 papers with code

Learning Human Motion from Monocular Videos via Cross-Modal Manifold Alignment

no code implementations • 15 Apr 2024 • Shuaiying Hou, Hongyu Tao, Junheng Fang, Changqing Zou, Hujun Bao, Weiwei Xu

Learning 3D human motion from 2D inputs is a fundamental task in the realms of computer vision and computer graphics.

Paper
Add Code

GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image

no code implementations • 2 Apr 2024 • Chong Bao, yinda zhang, Yuan Li, Xiyu Zhang, Bangbang Yang, Hujun Bao, Marc Pollefeys, Guofeng Zhang, Zhaopeng Cui

Recently, we have witnessed the explosive growth of various volumetric representations in modeling animatable head avatars.

Paper
Add Code

CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field

1 code implementation • 24 Mar 2024 • Jiarui Hu, Xianhao Chen, Boyin Feng, Guanglin Li, Liangjing Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

Recently neural radiance fields (NeRF) have been widely exploited as 3D representations for dense simultaneous localization and mapping (SLAM).

Novel View Synthesis Simultaneous Localization and Mapping

Paper
Code

Vox-Fusion++: Voxel-based Neural Implicit Dense Tracking and Mapping with Multi-maps

no code implementations • 19 Mar 2024 • Hongjia Zhai, Hai Li, Xingrui Yang, Gan Huang, Yuhang Ming, Hujun Bao, Guofeng Zhang

In this paper, we introduce Vox-Fusion++, a multi-maps-based robust dense tracking and mapping system that seamlessly fuses neural implicit representations with traditional volumetric fusion techniques.

Paper
Add Code

3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

no code implementations • 14 Mar 2024 • Frank Zhang, Yibo Zhang, Quan Zheng, Rui Ma, Wei Hua, Hujun Bao, Weiwei Xu, Changqing Zou

Text-driven 3D scene generation techniques have made rapid progress in recent years.

Scene Generation

Paper
Add Code

Boosting Image Restoration via Priors from Pre-trained Models

no code implementations • 11 Mar 2024 • Xiaogang Xu, Shu Kong, Tao Hu, Zhe Liu, Hujun Bao

Pre-trained models with large-scale training data, such as CLIP and Stable Diffusion, have demonstrated remarkable performance in various high-level computer vision tasks such as image understanding and generation from language descriptions.

Deblurring Denoising +2

Paper
Add Code

Multi-View Neural 3D Reconstruction of Micro-/Nanostructures with Atomic Force Microscopy

1 code implementation • 21 Jan 2024 • Shuo Chen, Mao Peng, Yijin Li, Bing-Feng Ju, Hujun Bao, Yuan-Liu Chen, Guofeng Zhang

However, conventional AFM scanning struggles to reconstruct complex 3D micro-/nanostructures precisely due to limitations such as incomplete sample topography capturing and tip-sample convolution artifacts.

3D Reconstruction Surface Reconstruction

Paper
Code

Neural Rendering and Its Hardware Acceleration: A Review

no code implementations • 6 Jan 2024 • Xinkai Yan, Jieting Xu, Yuchi Huo, Hujun Bao

On this basis, we analyze the common requirements of neural rendering pipeline for hardware acceleration and the characteristics of the current hardware acceleration architecture, and then discuss the design challenges of neural rendering processor architecture.

3D Reconstruction Inverse Rendering +2

Paper
Add Code

PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields

no code implementations • 17 Dec 2023 • Boming Zhao, Luwei Yang, Mao Mao, Hujun Bao, Zhaopeng Cui

In this paper, we propose a novel visual localization framework, \ie, PNeRFLoc, based on a unified point-based representation.

Data Augmentation Neural Rendering +3

Paper
Add Code

EasyVolcap: Accelerating Neural Volumetric Video Research

1 code implementation • 11 Dec 2023 • Zhen Xu, Tao Xie, Sida Peng, Haotong Lin, Qing Shuai, Zhiyuan Yu, Guangzhao He, Jiaming Sun, Hujun Bao, Xiaowei Zhou

Volumetric video is a technology that digitally records dynamic events such as artistic performances, sporting events, and remote conversations.

496

Paper
Code

Holistic Inverse Rendering of Complex Facade via Aerial 3D Scanning

no code implementations • 20 Nov 2023 • Zixuan Xie, Rengan Xie, Rong Li, Kai Huang, Pengju Qiao, Jingsen Zhu, Xu Yin, Qi Ye, Wei Hua, Yuchi Huo, Hujun Bao

In this work, we use multi-view aerial images to reconstruct the geometry, lighting, and material of facades using neural signed distance fields (SDFs).

Benchmarking Inverse Rendering +2

Paper
Add Code

RD-VIO: Robust Visual-Inertial Odometry for Mobile Augmented Reality in Dynamic Environments

1 code implementation • 23 Oct 2023 • Jinyu Li, Xiaokun Pan, Gan Huang, Ziyang Zhang, Nan Wang, Hujun Bao, Guofeng Zhang

In this work, we design a novel visual-inertial odometry (VIO) system called RD-VIO to handle both of these two problems.

367

Paper
Code

4K4D: Real-Time 4D View Synthesis at 4K Resolution

no code implementations • 17 Oct 2023 • Zhen Xu, Sida Peng, Haotong Lin, Guangzhao He, Jiaming Sun, Yujun Shen, Hujun Bao, Xiaowei Zhou

Experiments show that our representation can be rendered at over 400 FPS on the DNA-Rendering dataset at 1080p resolution and 80 FPS on the ENeRF-Outdoor dataset at 4K resolution using an RTX 4090 GPU, which is 30x faster than previous methods and achieves the state-of-the-art rendering quality.

Paper
Add Code

FuseSR: Super Resolution for Real-time Rendering through Efficient Multi-resolution Fusion

no code implementations • 15 Oct 2023 • Zhihua Zhong, Jingsen Zhu, Yuxin Dai, Chuankun Zheng, Yuchi Huo, Guanlin Chen, Hujun Bao, Rui Wang

To mitigate this problem, one of the most popular solutions is to render images at a low resolution to reduce rendering overhead, and then manage to accurately upsample the low-resolution rendered image to the target resolution, a. k. a.

4k Super-Resolution

Paper
Add Code

Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes

no code implementations • 12 Oct 2023 • Haotong Lin, Sida Peng, Zhen Xu, Tao Xie, Xingyi He, Hujun Bao, Xiaowei Zhou

This paper aims to tackle the challenge of dynamic view synthesis from multi-view videos.

Novel View Synthesis

Paper
Add Code

Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models

no code implementations • ICCV 2023 • Huaijin Pi, Sida Peng, Minghui Yang, Xiaowei Zhou, Hujun Bao

This paper presents a novel approach to generating the 3D motion of a human interacting with a target object, with a focus on solving the challenge of synthesizing long-range and diverse motions, which could not be fulfilled by existing auto-regressive models or path planning-based methods.

Human-Object Interaction Detection

Paper
Add Code

Depth Completion with Multiple Balanced Bases and Confidence for Dense Monocular SLAM

no code implementations • 8 Sep 2023 • Weijian Xie, Guanyi Chu, Quanhao Qian, Yihao Yu, Hai Li, Danpeng Chen, Shangjin Zhai, Nan Wang, Hujun Bao, Guofeng Zhang

In this paper, we propose a novel method that integrates a light-weight depth completion network into a sparse SLAM system using a multi-basis depth representation, so that dense mapping can be performed online even on a mobile phone.

Depth Completion

Paper
Add Code

Multi-Modal Neural Radiance Field for Monocular Dense SLAM with a Light-Weight ToF Sensor

no code implementations • ICCV 2023 • Xinyang Liu, Yijin Li, Yanbin Teng, Hujun Bao, Guofeng Zhang, yinda zhang, Zhaopeng Cui

Specifically, we propose a multi-modal implicit scene representation that supports rendering both the signals from the RGB camera and light-weight ToF sensor which drives the optimization by comparing with the raw sensor inputs.

Pose Tracking

Paper
Add Code

Graph-based Asynchronous Event Processing for Rapid Object Recognition

no code implementations • ICCV 2021 • Yijin Li, Han Zhou, Bangbang Yang, Ye Zhang, Zhaopeng Cui, Hujun Bao, Guofeng Zhang

Different from traditional video cameras, event cameras capture asynchronous events stream in which each event encodes pixel location, trigger time, and the polarity of the brightness changes.

graph construction Object Recognition

Paper
Add Code

Relightable and Animatable Neural Avatar from Sparse-View Video

no code implementations • 15 Aug 2023 • Zhen Xu, Sida Peng, Chen Geng, Linzhan Mou, Zihan Yan, Jiaming Sun, Hujun Bao, Xiaowei Zhou

Based on the HDQ algorithm, we leverage sphere tracing to efficiently estimate the surface intersection and light visibility.

Inverse Rendering

Paper
Add Code

Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing

no code implementations • 7 Aug 2023 • Junyi Zeng, Chong Bao, Rui Chen, Zilong Dong, Guofeng Zhang, Hujun Bao, Zhaopeng Cui

Recently, Neural Radiance Fields (NeRF) has exhibited significant success in novel view synthesis, surface reconstruction, etc.

Neural Rendering Novel View Synthesis +1

Paper
Add Code

Dyn-E: Local Appearance Editing of Dynamic Neural Radiance Fields

no code implementations • 24 Jul 2023 • Shangzhan Zhang, Sida Peng, Yinji ShenTu, Qing Shuai, Tianrun Chen, Kaicheng Yu, Hujun Bao, Xiaowei Zhou

We extensively evaluate our approach on various scenes and show that our approach achieves spatially and temporally consistent editing results.

Paper
Add Code

Detector-Free Structure from Motion

1 code implementation • 27 Jun 2023 • Xingyi He, Jiaming Sun, Yifan Wang, Sida Peng, QiXing Huang, Hujun Bao, Xiaowei Zhou

We propose a new detector-free SfM framework to draw benefits from the recent success of detector-free matchers to avoid the early determination of keypoints, while solving the multi-view inconsistency issue of detector-free matchers.

Keypoint Detection

397

Paper
Code

Learning Human Mesh Recovery in 3D Scenes

no code implementations • CVPR 2023 • Zehong Shen, Zhi Cen, Sida Peng, Qing Shuai, Hujun Bao, Xiaowei Zhou

We present a novel method for recovering the absolute pose and shape of a human in a pre-scanned scene given a single image.

Human Mesh Recovery

Paper
Add Code

AutoRecon: Automated 3D Object Discovery and Reconstruction

no code implementations • CVPR 2023 • Yuang Wang, Xingyi He, Sida Peng, Haotong Lin, Hujun Bao, Xiaowei Zhou

A fully automated object reconstruction pipeline is crucial for digital content creation.

3D Reconstruction Object +2

Paper
Add Code

Representing Volumetric Videos as Dynamic MLP Maps

no code implementations • CVPR 2023 • Sida Peng, Yunzhi Yan, Qing Shuai, Hujun Bao, Xiaowei Zhou

This paper introduces a novel representation of volumetric videos for real-time view synthesis of dynamic scenes.

Paper
Add Code

CF-Font: Content Fusion for Few-shot Font Generation

1 code implementation • CVPR 2023 • Chi Wang, Min Zhou, Tiezheng Ge, Yuning Jiang, Hujun Bao, Weiwei Xu

Content and style disentanglement is an effective way to achieve few-shot font generation.

Disentanglement Font Generation

101

Paper
Code

SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field

1 code implementation • CVPR 2023 • Chong Bao, yinda zhang, Bangbang Yang, Tianxing Fan, Zesong Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

Despite the great success in 2D editing using user-friendly tools, such as Photoshop, semantic strokes, or even text prompts, similar capabilities in 3D areas are still limited, either relying on 3D modeling skills or allowing editing within only a few categories.

123

Paper
Code

PATS: Patch Area Transportation with Subdivision for Local Feature Matching

no code implementations • CVPR 2023 • Junjie Ni, Yijin Li, Zhaoyang Huang, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

However, estimating scale differences between these patches is non-trivial since the scale differences are determined by both relative camera poses and scene structures, and thus spatially varying over image pairs.

Graph Matching Optical Flow Estimation +2

Paper
Add Code

I$^2$-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

no code implementations • 14 Mar 2023 • Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

In this work, we present I$^2$-SDF, a new method for intrinsic indoor scene reconstruction and editing using differentiable Monte Carlo raytracing on neural signed distance fields (SDFs).

Indoor Scene Reconstruction Novel View Synthesis

Paper
Add Code

BlinkFlow: A Dataset to Push the Limits of Event-based Optical Flow Estimation

no code implementations • 14 Mar 2023 • Yijin Li, Zhaoyang Huang, Shuo Chen, Xiaoyu Shi, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

BlinkSim consists of a configurable rendering engine and a flexible engine for event data simulation.

Event-based Optical Flow Optical Flow Estimation

Paper
Add Code

Perceiving Unseen 3D Objects by Poking the Objects

no code implementations • 26 Feb 2023 • Linghao Chen, Yunzhou Song, Hujun Bao, Xiaowei Zhou

We present a novel approach to interactive 3D object perception for robots.

3D Reconstruction Robotic Grasping

Paper
Add Code

Learning Neural Volumetric Representations of Dynamic Humans in Minutes

1 code implementation • CVPR 2023 • Chen Geng, Sida Peng, Zhen Xu, Hujun Bao, Xiaowei Zhou

In this paper, we propose a novel method for learning neural volumetric videos of dynamic humans from sparse view videos in minutes with competitive visual quality.

145

Paper
Code

EC-SfM: Efficient Covisibility-based Structure-from-Motion for Both Sequential and Unordered Images

1 code implementation • 21 Feb 2023 • Zhichao Ye, Chong Bao, Xin Zhou, Haomin Liu, Hujun Bao, Guofeng Zhang

Based on this general image connection, we propose a unified framework to efficiently reconstruct sequential images, unordered images, and the mixture of these two.

152

Paper
Code

OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models

no code implementations • 18 Jan 2023 • Xingyi He, Jiaming Sun, Yuang Wang, Di Huang, Hujun Bao, Xiaowei Zhou

We propose a new method for object pose estimation without CAD models.

Keypoint Detection Object

Paper
Add Code

I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

no code implementations • CVPR 2023 • Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

Further, we propose to decompose the neural radiance field into spatially-varying material of the scene as a neural field through surface-based, differentiable Monte Carlo raytracing and emitter semantic segmentations, which enables physically based and photorealistic scene relighting and editing applications.

Indoor Scene Reconstruction Novel View Synthesis

Paper
Add Code

DPS-Net: Deep Polarimetric Stereo Depth Estimation

no code implementations • ICCV 2023 • Chaoran Tian, Weihong Pan, Zimo Wang, Mao Mao, Guofeng Zhang, Hujun Bao, Ping Tan, Zhaopeng Cui

Stereo depth estimation usually struggles to deal with textureless scenes for both traditional and learning-based methods due to the inherent dependence on image correspondence matching.

Stereo Depth Estimation

Paper
Add Code

Improving Feature-based Visual Localization by Geometry-Aided Matching

1 code implementation • 16 Nov 2022 • Hailin Yu, Youji Feng, Weicai Ye, Mingxuan Jiang, Hujun Bao, Guofeng Zhang

We apply GAM to a new hierarchical visual localization pipeline and show that GAM can effectively improve the robustness and accuracy of localization.

3D Feature Matching Pose Estimation +1

172

Paper
Code

Learning-based Inverse Rendering of Complex Indoor Scenes with Differentiable Monte Carlo Raytracing

no code implementations • 6 Nov 2022 • Jingsen Zhu, Fujun Luan, Yuchi Huo, Zihao Lin, Zhihua Zhong, Dianbing Xi, Jiaxiang Zheng, Rui Tang, Hujun Bao, Rui Wang

Indoor scenes typically exhibit complex, spatially-varying appearance from global illumination, making inverse rendering a challenging ill-posed problem.

Inverse Rendering

Paper
Add Code

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis

1 code implementation • ICCV 2023 • Weicai Ye, Shuo Chen, Chong Bao, Hujun Bao, Marc Pollefeys, Zhaopeng Cui, Guofeng Zhang

Existing inverse rendering combined with neural rendering methods can only perform editable novel view synthesis on object-specific scenes, while we present intrinsic neural radiance fields, dubbed IntrinsicNeRF, which introduce intrinsic decomposition into the NeRF-based neural rendering method and can extend its application to room-scale scenes.

Clustering Inverse Rendering +2

176

Paper
Code

DELTAR: Depth Estimation from a Light-weight ToF Sensor and RGB Image

no code implementations • 27 Sep 2022 • Yijin Li, Xinyang Liu, Wenqi Dong, Han Zhou, Hujun Bao, Guofeng Zhang, yinda zhang, Zhaopeng Cui

Light-weight time-of-flight (ToF) depth sensors are small, cheap, low-energy and have been massively deployed on mobile devices for the purposes like autofocus, obstacle detection, etc.

3D Reconstruction Depth Completion +2

Paper
Add Code

Vox-Surf: Voxel-based Implicit Surface Representation

1 code implementation • 21 Aug 2022 • Hai Li, Xingrui Yang, Hongjia Zhai, Yuqian Liu, Hujun Bao, Guofeng Zhang

Virtual content creation and interaction play an important role in modern 3D applications such as AR and VR.

valid

Paper
Code

NeuMesh: Learning Disentangled Neural Mesh-based Implicit Field for Geometry and Texture Editing

no code implementations • 25 Jul 2022 • Bangbang Yang, Chong Bao, Junyi Zeng, Hujun Bao, yinda zhang, Zhaopeng Cui, Guofeng Zhang

Very recently neural implicit rendering techniques have been rapidly evolved and shown great advantages in novel view synthesis and 3D scene reconstruction.

3D Scene Reconstruction Neural Rendering +1

Paper
Add Code

QuickPose: Real-time Multi-view Multi-person Pose Estimation in Crowded Scenes

no code implementations • SIGGRAPH 2022 • Zhize Zhou, Qing Shuai, Yize Wang, Qi Fang, Xiaopeng Ji, Fashuai Li, Hujun Bao, Xiaowei Zhou

The key challenge of this problem is to efficiently match 2D observations across multiple views.

Ranked #2 on 3D Multi-Person Pose Estimation on Shelf

2D Pose Estimation 3D Multi-Person Pose Estimation +1

Paper
Add Code

DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM

1 code implementation • 18 Jul 2022 • Weicai Ye, Xingyuan Yu, Xinyue Lan, Yuhang Ming, Jinyu Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

We present a novel dual-flow representation of scene motion that decomposes the optical flow into a static flow field caused by the camera motion and another dynamic flow field caused by the objects' movements in the scene.

Pose Estimation Simultaneous Localization and Mapping

109

Paper
Code

Factorized and Controllable Neural Re-Rendering of Outdoor Scene for Photo Extrapolation

no code implementations • 14 Jul 2022 • Boming Zhao, Bangbang Yang, Zhenyang Li, Zuoyue Li, Guofeng Zhang, Jiashu Zhao, Dawei Yin, Zhaopeng Cui, Hujun Bao

Expanding an existing tourist photo from a partially captured scene to a full scene is one of the desired experiences for photography applications.

Paper
Add Code

PVO: Panoptic Visual Odometry

1 code implementation • CVPR 2023 • Weicai Ye, Xinyue Lan, Shuo Chen, Yuhang Ming, Xingyuan Yu, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

We present PVO, a novel panoptic visual odometry framework to achieve more comprehensive modeling of the scene motion, geometry, and panoptic segmentation information.

Optical Flow Estimation Pose Estimation +3

197

Paper
Code

VIP-SLAM: An Efficient Tightly-Coupled RGB-D Visual Inertial Planar SLAM

no code implementations • 4 Jul 2022 • Danpeng Chen, Shuai Wang, Weijian Xie, Shangjin Zhai, Nan Wang, Hujun Bao, Guofeng Zhang

Even if the plane parameters are involved in the optimization, we effectively simplify the back-end map by using planar structures.

Paper
Add Code

Neural 3D Scene Reconstruction with the Manhattan-world Assumption

1 code implementation • CVPR 2022 • Haoyu Guo, Sida Peng, Haotong Lin, Qianqian Wang, Guofeng Zhang, Hujun Bao, Xiaowei Zhou

Based on the Manhattan-world assumption, planar constraints are employed to regularize the geometry in floor and wall regions predicted by a 2D semantic segmentation network.

2D Semantic Segmentation 3D Reconstruction +2

483

Paper
Code

Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects

no code implementations • 5 May 2022 • Bangbang Yang, yinda zhang, Yijin Li, Zhaopeng Cui, Sean Fanello, Hujun Bao, Guofeng Zhang

We, as human beings, can understand and picture a familiar scene from arbitrary viewpoints given a single image, whereas this is still a grand challenge for computers.

Data Augmentation Neural Rendering +1

Paper
Add Code

CNN LEGO: Disassembling and Assembling Convolutional Neural Network

no code implementations • 25 Mar 2022 • Jiacong Hu, Jing Gao, Zunlei Feng, Lechao Cheng, Jie Lei, Hujun Bao, Mingli Song

the feature maps are adopted to locate the critical features in each layer.

Incremental Learning Knowledge Distillation +2

Paper
Add Code

Hybrid Mesh-neural Representation for 3D Transparent Object Reconstruction

no code implementations • 23 Mar 2022 • Jiamin Xu, Zihan Zhu, Hujun Bao, Weiwei Xu

We propose a novel method to reconstruct the 3D shapes of transparent objects using hand-held captured images under natural light conditions.

Image Matting Object Reconstruction +1

Paper
Add Code

Animatable Implicit Neural Representations for Creating Realistic Avatars from Videos

1 code implementation • 15 Mar 2022 • Sida Peng, Zhen Xu, Junting Dong, Qianqian Wang, Shangzhan Zhang, Qing Shuai, Hujun Bao, Xiaowei Zhou

Some recent works have proposed to decompose a non-rigidly deforming scene into a canonical neural radiance field and a set of deformation fields that map observation-space points to the canonical space, thereby enabling them to learn the dynamic scene from images.

489

Paper
Code

Normal and Visibility Estimation of Human Face from a Single Image

no code implementations • 9 Mar 2022 • Fuzhi Zhong, Rui Wang, Yuchi Huo, Hujun Bao

Recent work on the intrinsic image of humans starts to consider the visibility of incident illumination and encodes the light transfer function by spherical harmonics.

Paper
Add Code

Hybrid Tracker with Pixel and Instance for Video Panoptic Segmentation

no code implementations • 2 Mar 2022 • Weicai Ye, Xinyue Lan, Ge Su, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

HybridTracker performs pixel tracker and instance tracker in parallel to obtain the association matrices, which are fused into a matching matrix.

Optical Flow Estimation Segmentation +1

Paper
Add Code

SelfRecon: Self Reconstruction Your Digital Avatar from Monocular Video

1 code implementation • CVPR 2022 • Boyi Jiang, Yang Hong, Hujun Bao, Juyong Zhang

Meanwhile, the explicit mesh is updated periodically to adjust its topology changes, and a consistency loss is designed to match both representations.

Neural Rendering

399

Paper
Code

NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

1 code implementation • CVPR 2022 • Zihan Zhu, Songyou Peng, Viktor Larsson, Weiwei Xu, Hujun Bao, Zhaopeng Cui, Martin R. Oswald, Marc Pollefeys

Neural implicit representations have recently shown encouraging results in various domains, including promising progress in simultaneous localization and mapping (SLAM).

Simultaneous Localization and Mapping

1,354

Paper
Code

Geometry-aware Two-scale PIFu Representation for Human Reconstruction

no code implementations • 3 Dec 2021 • Zheng Dong, Ke Xu, Ziheng Duan, Hujun Bao, Weiwei Xu, Rynson W. H. Lau

Our key idea is to exploit the complementary properties of depth denoising and 3D reconstruction, for learning a two-scale PIFu representation to reconstruct high-frequency facial details and consistent bodies separately.

3D Human Reconstruction 3D Reconstruction +3

Paper
Add Code

Efficient Neural Radiance Fields for Interactive Free-viewpoint Video

no code implementations • 2 Dec 2021 • Haotong Lin, Sida Peng, Zhen Xu, Yunzhi Yan, Qing Shuai, Hujun Bao, Xiaowei Zhou

We propose a novel scene representation, called ENeRF, for the fast creation of interactive free-viewpoint videos.

Depth Estimation Depth Prediction +1

Paper
Add Code

LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human Bodies

no code implementations • 30 Nov 2021 • Sandro Lombardi, Bangbang Yang, Tianxing Fan, Hujun Bao, Guofeng Zhang, Marc Pollefeys, Zhaopeng Cui

In this work, we propose a novel neural implicit representation for the human body, which is fully differentiable and optimizable with disentangled shape and pose latent spaces.

3D Reconstruction motion retargeting +1

Paper
Add Code

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

1 code implementation • 13 Sep 2021 • Yunfan Shao, Zhichao Geng, Yitao Liu, Junqi Dai, Hang Yan, Fei Yang, Li Zhe, Hujun Bao, Xipeng Qiu

In this paper, we take the advantage of previous pre-trained models (PTMs) and propose a novel Chinese Pre-trained Unbalanced Transformer (CPT).

Denoising Language Modelling +3

465

Paper
Code

Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering

no code implementations • ICCV 2021 • Bangbang Yang, yinda zhang, Yinghao Xu, Yijin Li, Han Zhou, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

In this paper, we present a novel neural scene rendering system, which learns an object-compositional neural radiance field and produces realistic rendering with editing capability for a clustered and real-world scene.

Neural Rendering Novel View Synthesis +1

Paper
Add Code

DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization

1 code implementation • ICCV 2021 • Cheng Zhang, Zhaopeng Cui, Cai Chen, Shuaicheng Liu, Bing Zeng, Hujun Bao, yinda zhang

Panorama images have a much larger field-of-view thus naturally encode enriched scene context information compared to standard perspective images, which however is not well exploited in the previous scene understanding methods.

Object Relation +1

Paper
Code

MINERVAS: Massive INterior EnviRonments VirtuAl Synthesis

no code implementations • 13 Jul 2021 • Haocheng Ren, Hao Zhang, Jia Zheng, Jiaxiang Zheng, Rui Tang, Yuchi Huo, Hujun Bao, Rui Wang

With the rapid development of data-driven techniques, data has played an essential role in various computer vision tasks.

2D Semantic Segmentation Depth Estimation +1

Paper
Add Code

Attention-guided Temporally Coherent Video Object Matting

1 code implementation • 24 May 2021 • Yunke Zhang, Chi Wang, Miaomiao Cui, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Hujun Bao, QiXing Huang, Weiwei Xu

Experimental results show that our method can generate high-quality alpha mattes for various videos featuring appearance change, occlusion, and fast motion.

Image Matting Object +4

Paper
Code

VS-Net: Voting with Segmentation for Visual Localization

1 code implementation • CVPR 2021 • Zhaoyang Huang, Han Zhou, Yijin Li, Bangbang Yang, Yan Xu, Xiaowei Zhou, Hujun Bao, Guofeng Zhang, Hongsheng Li

To address this problem, we propose a novel visual localization framework that establishes 2D-to-3D correspondences between the query image and the 3D map with a series of learnable scene-specific landmarks.

Segmentation Semantic Segmentation +1

Paper
Code

Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies

1 code implementation • ICCV 2021 • Sida Peng, Junting Dong, Qianqian Wang, Shangzhan Zhang, Qing Shuai, Xiaowei Zhou, Hujun Bao

Moreover, the learned blend weight fields can be combined with input skeletal motions to generate new deformation fields to animate the human model.

489

Paper
Code

StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision

1 code implementation • CVPR 2021 • Yang Hong, Juyong Zhang, Boyi Jiang, Yudong Guo, Ligang Liu, Hujun Bao

In this paper, we propose StereoPIFu, which integrates the geometric constraints of stereo vision with implicit function representation of PIFu, to recover the 3D shape of the clothed human from a pair of low-cost rectified images.

101

Paper
Code

NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video

3 code implementations • CVPR 2021 • Jiaming Sun, Yiming Xie, Linghao Chen, Xiaowei Zhou, Hujun Bao

We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from a monocular video.

3D Reconstruction 3D Scene Reconstruction +1

1,927

Paper
Code

LoFTR: Detector-Free Local Feature Matching with Transformers

5 code implementations • CVPR 2021 • Jiaming Sun, Zehong Shen, Yuang Wang, Hujun Bao, Xiaowei Zhou

We present a novel method for local image feature matching.

Image Matching Visual Localization

9,370

Paper
Code

Reconstructing 3D Human Pose by Watching Humans in the Mirror

1 code implementation • CVPR 2021 • Qi Fang, Qing Shuai, Junting Dong, Hujun Bao, Xiaowei Zhou

In this paper, we introduce the new task of reconstructing 3D human pose from a single image in which we can see the person and the person's image through a mirror.

3D Pose Estimation

3,302

Paper
Code

AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis

1 code implementation • ICCV 2021 • Yudong Guo, Keyu Chen, Sen Liang, Yong-Jin Liu, Hujun Bao, Juyong Zhang

Generating high-fidelity talking head video by fitting with the input audio sequence is a challenging problem that receives considerable attentions recently.

Talking Face Generation

982

Paper
Code

Active Boundary Loss for Semantic Segmentation

1 code implementation • 4 Feb 2021 • Chi Wang, Yunke Zhang, Miaomiao Cui, Peiran Ren, Yin Yang, Xuansong Xie, Xiansheng Hua, Hujun Bao, Weiwei Xu

This paper proposes a novel active boundary loss for semantic segmentation.

Segmentation Semantic Segmentation +2

Paper
Code

You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking

no code implementations • ICCV 2021 • Jiaming Sun, Yiming Xie, Siyu Zhang, Linghao Chen, Guofeng Zhang, Hujun Bao, Xiaowei Zhou

In this work, we propose a novel system for integrated 3D object detection and tracking, which uses a dynamic object occupancy map and previous object states as spatial-temporal memory to assist object detection in future frames.

3D Object Detection Object +2

Paper
Add Code

Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans

3 code implementations • CVPR 2021 • Sida Peng, Yuanqing Zhang, Yinghao Xu, Qianqian Wang, Qing Shuai, Hujun Bao, Xiaowei Zhou

To this end, we propose Neural Body, a new human body representation which assumes that the learned neural representations at different frames share the same set of latent codes anchored to a deformable mesh, so that the observations across frames can be naturally integrated.

Novel View Synthesis Representation Learning

3,302

Paper
Code

Location-aware Single Image Reflection Removal

1 code implementation • ICCV 2021 • Zheng Dong, Ke Xu, Yin Yang, Hujun Bao, Weiwei Xu, Rynson W. H. Lau

It is beneficial to strong reflection detection and substantially improves the quality of reflection removal results.

Reflection Removal

Paper
Code

Recurrent Multi-view Alignment Network for Unsupervised Surface Registration

1 code implementation • CVPR 2021 • Wanquan Feng, Juyong Zhang, Hongrui Cai, Haofei Xu, Junhui Hou, Hujun Bao

Learning non-rigid registration in an end-to-end manner is challenging due to the inherent high degrees of freedom and the lack of labeled training data.

Deformable Object Manipulation Neural Rendering +1

206

Paper
Code

SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural Networks

no code implementations • 19 Oct 2020 • Yan Xu, Zhaoyang Huang, Kwan-Yee Lin, Xinge Zhu, Jianping Shi, Hujun Bao, Guofeng Zhang, Hongsheng Li

To suit our network to self-supervised learning, we design several novel loss functions that utilize the inherent properties of LiDAR point clouds.

Self-Supervised Learning

Paper
Add Code

SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation

1 code implementation • ECCV 2020 • Jianan Zhen, Qi Fang, Jiaming Sun, Wentao Liu, Wei Jiang, Hujun Bao, Xiaowei Zhou

Recovering multi-person 3D poses with absolute scales from a single RGB image is a challenging problem due to the inherent depth and scale ambiguity from a single view.

Ranked #11 on 3D Multi-Person Pose Estimation (absolute) on MuPoTS-3D

2D Pose Estimation 3D Depth Estimation +3

238

Paper
Code

Motion Capture from Internet Videos

2 code implementations • ECCV 2020 • Junting Dong, Qing Shuai, Yuanqing Zhang, Xian Liu, Xiaowei Zhou, Hujun Bao

Therefore, we propose to capture human motion by jointly analyzing these Internet videos instead of using single videos separately.

Pose Estimation

3,302

Paper
Code

Mobile3DRecon: Real-time Monocular 3D Reconstruction on a Mobile Phone

no code implementations • ISMAR 2020 • Xingbin Yang, Liyang Zhou, Hanqing Jiang, Zhongliang Tang, Yuanbo Wang, Hujun Bao, Guofeng Zhang

The proposed mesh generation module incrementally fuses each estimated keyframe depth map to an online dense surface mesh, which is useful for achieving realistic AR effects such as occlusions and collisions.

3D Reconstruction Monocular Depth Estimation +1

Paper
Add Code

Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation

1 code implementation • CVPR 2020 • Jiaming Sun, Linghao Chen, Yiming Xie, Siyu Zhang, Qinhong Jiang, Xiaowei Zhou, Hujun Bao

In this paper, we propose a novel system named Disp R-CNN for 3D object detection from stereo images.

Ranked #3 on 3D Object Detection From Stereo Images on KITTI Cyclists Moderate

3D Object Detection From Stereo Images Disparity Estimation +2

210

Paper
Code

BCNet: Learning Body and Cloth Shape from A Single Image

1 code implementation • ECCV 2020 • Boyi Jiang, Juyong Zhang, Yang Hong, Jinhao Luo, Ligang Liu, Hujun Bao

In this paper, we consider the problem to automatically reconstruct garment and body shapes from a single near-front view RGB image.

Paper
Code

Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose

1 code implementation • 24 Feb 2020 • Ran Yi, Zipeng Ye, Juyong Zhang, Hujun Bao, Yong-Jin Liu

In this paper, we address this problem by proposing a deep neural network model that takes an audio signal A of a source person and a very short video V of a target person as input, and outputs a synthesized high-quality talking face video with personalized head pose (making use of the visual information in V), expression and lip synchronization (by considering both A and V).

3D Face Animation Video Generation

699

Paper
Code

Deep Snake for Real-Time Instance Segmentation

1 code implementation • CVPR 2020 • Sida Peng, Wen Jiang, Huaijin Pi, Xiuli Li, Hujun Bao, Xiaowei Zhou

Based on deep snake, we develop a two-stage pipeline for instance segmentation: initial contour proposal and contour deformation, which can handle errors in object localization.

Ranked #2 on Semantic Contour Prediction on Sbd val

Object Object Localization +3

1,145

Paper
Code

GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs

1 code implementation • NeurIPS 2019 • Yuan Liu, Zehong Shen, Zhixuan Lin, Sida Peng, Hujun Bao, Xiaowei Zhou

Instead of feature pooling, we use group convolutions to exploit underlying structures of the extracted features on the group, resulting in descriptors that are both discriminative and provably invariant to the group of transformations.

Pose Estimation

191

Paper
Code

Depth Completion from Sparse LiDAR Data with Depth-Normal Constraints

no code implementations • ICCV 2019 • Yan Xu, Xinge Zhu, Jianping Shi, Guofeng Zhang, Hujun Bao, Hongsheng Li

Most of existing methods directly train a network to learn a mapping from sparse depth inputs to dense depth maps, which has difficulties in utilizing the 3D geometric constraints and handling the practical sensor noises.

Autonomous Driving Depth Completion

Paper
Add Code

Fast and Robust Multi-Person 3D Pose Estimation from Multiple Views

4 code implementations • CVPR 2019 • Junting Dong, Wen Jiang, Qi-Xing Huang, Hujun Bao, Xiaowei Zhou

This paper addresses the problem of 3D pose estimation for multiple people in a few calibrated camera views.

Ranked #12 on 3D Multi-Person Pose Estimation on Campus

3D Multi-Person Pose Estimation 3D Pose Estimation

3,302

Paper
Code

PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation

4 code implementations • CVPR 2019 • Sida Peng, Yu-An Liu, Qi-Xing Huang, Hujun Bao, Xiaowei Zhou

We further create a Truncation LINEMOD dataset to validate the robustness of our approach against truncation.

Ranked #2 on 6D Pose Estimation using RGB on YCB-Video (Mean AUC metric)

6D Pose Estimation using RGB

790

Paper
Code

ICE-BA: Incremental, Consistent and Efficient Bundle Adjustment for Visual-Inertial SLAM

1 code implementation • CVPR 2018 • Hao-Min Liu, Mingyu Chen, Guofeng Zhang, Hujun Bao, Yingze Bao

However, jointly using visual and inertial measurements to optimize SLAM objective functions is a problem of high computational complexity.

Computational Efficiency Pose Estimation

699

Paper
Code

Robust Keyframe-based Dense SLAM with an RGB-D Camera

8 code implementations • 14 Nov 2017 • Hao-Min Liu, Chen Li, Guojun Chen, Guofeng Zhang, Michael Kaess, Hujun Bao

In this paper, we present RKD-SLAM, a robust keyframe-based dense SLAM approach for an RGB-D camera that can robustly handle fast motion and dense loop closure, and run without time limitation in a moderate size scene.

699

Paper
Code

ENFT: Efficient Non-Consecutive Feature Tracking for Robust Structure-from-Motion

3 code implementations • 27 Oct 2015 • Guofeng Zhang, Hao-Min Liu, Zilong Dong, Jiaya Jia, Tien-Tsin Wong, Hujun Bao

Our framework consists of steps of solving the feature `dropout' problem when indistinctive structures, noise or large image distortion exists, and of rapidly recognizing and joining common features located in different subsequences.

250

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.