Search Results for author: Zhaopeng Cui

Found 55 papers, 18 papers with code

Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning

no code implementations • 13 May 2024 • Wenqi Dong, Bangbang Yang, Lin Ma, Xiao Liu, Liyuan Cui, Hujun Bao, Yuewen Ma, Zhaopeng Cui

As humans, we aspire to create media content that is both freely willed and readily controlled.

3D Generation

Paper
Add Code

GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image

no code implementations • 2 Apr 2024 • Chong Bao, yinda zhang, Yuan Li, Xiyu Zhang, Bangbang Yang, Hujun Bao, Marc Pollefeys, Guofeng Zhang, Zhaopeng Cui

Recently, we have witnessed the explosive growth of various volumetric representations in modeling animatable head avatars.

Paper
Add Code

CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field

1 code implementation • 24 Mar 2024 • Jiarui Hu, Xianhao Chen, Boyin Feng, Guanglin Li, Liangjing Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

Recently neural radiance fields (NeRF) have been widely exploited as 3D representations for dense simultaneous localization and mapping (SLAM).

Novel View Synthesis Simultaneous Localization and Mapping

Paper
Code

Sat2Scene: 3D Urban Scene Generation from Satellite Images with Diffusion

no code implementations • 19 Jan 2024 • Zuoyue Li, Zhenqiang Li, Zhaopeng Cui, Marc Pollefeys, Martin R. Oswald

Directly generating scenes from satellite imagery offers exciting possibilities for integration into applications like games and map services.

3D Generation Neural Rendering +2

Paper
Add Code

PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields

no code implementations • 17 Dec 2023 • Boming Zhao, Luwei Yang, Mao Mao, Hujun Bao, Zhaopeng Cui

In this paper, we propose a novel visual localization framework, \ie, PNeRFLoc, based on a unified point-based representation.

Data Augmentation Neural Rendering +3

Paper
Add Code

DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation

no code implementations • 19 Oct 2023 • Bangbang Yang, Wenqi Dong, Lin Ma, WenBo Hu, Xiao Liu, Zhaopeng Cui, Yuewen Ma

To ensure meaningful and aligned textures to the scene, we develop a novel coarse-to-fine panoramic texture generation approach with dual texture alignment, which both considers the geometry and texture cues of the captured scenes.

Texture Synthesis

Paper
Add Code

Multi-Modal Neural Radiance Field for Monocular Dense SLAM with a Light-Weight ToF Sensor

no code implementations • ICCV 2023 • Xinyang Liu, Yijin Li, Yanbin Teng, Hujun Bao, Guofeng Zhang, yinda zhang, Zhaopeng Cui

Specifically, we propose a multi-modal implicit scene representation that supports rendering both the signals from the RGB camera and light-weight ToF sensor which drives the optimization by comparing with the raw sensor inputs.

Pose Tracking

Paper
Add Code

Graph-based Asynchronous Event Processing for Rapid Object Recognition

no code implementations • ICCV 2021 • Yijin Li, Han Zhou, Bangbang Yang, Ye Zhang, Zhaopeng Cui, Hujun Bao, Guofeng Zhang

Different from traditional video cameras, event cameras capture asynchronous events stream in which each event encodes pixel location, trigger time, and the polarity of the brightness changes.

graph construction Object Recognition

Paper
Add Code

Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views

no code implementations • ICCV 2023 • Wentian Qu, Zhaopeng Cui, yinda zhang, Chenyu Meng, Cuixia Ma, Xiaoming Deng, Hongan Wang

Hand-object interaction understanding and the barely addressed novel view synthesis are highly desired in the immersive communication, whereas it is challenging due to the high deformation of hand and heavy occlusions between hand and object.

Neural Rendering Novel View Synthesis +3

Paper
Add Code

Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing

no code implementations • 7 Aug 2023 • Junyi Zeng, Chong Bao, Rui Chen, Zilong Dong, Guofeng Zhang, Hujun Bao, Zhaopeng Cui

Recently, Neural Radiance Fields (NeRF) has exhibited significant success in novel view synthesis, surface reconstruction, etc.

Neural Rendering Novel View Synthesis +1

Paper
Add Code

SINE: Semantic-driven Image-based NeRF Editing with Prior-guided Editing Field

1 code implementation • CVPR 2023 • Chong Bao, yinda zhang, Bangbang Yang, Tianxing Fan, Zesong Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

Despite the great success in 2D editing using user-friendly tools, such as Photoshop, semantic strokes, or even text prompts, similar capabilities in 3D areas are still limited, either relying on 3D modeling skills or allowing editing within only a few categories.

124

Paper
Code

PATS: Patch Area Transportation with Subdivision for Local Feature Matching

no code implementations • CVPR 2023 • Junjie Ni, Yijin Li, Zhaoyang Huang, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

However, estimating scale differences between these patches is non-trivial since the scale differences are determined by both relative camera poses and scene structures, and thus spatially varying over image pairs.

Graph Matching Optical Flow Estimation +2

Paper
Add Code

BlinkFlow: A Dataset to Push the Limits of Event-based Optical Flow Estimation

no code implementations • 14 Mar 2023 • Yijin Li, Zhaoyang Huang, Shuo Chen, Xiaoyu Shi, Hongsheng Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

BlinkSim consists of a configurable rendering engine and a flexible engine for event data simulation.

Event-based Optical Flow Optical Flow Estimation

Paper
Add Code

NICER-SLAM: Neural Implicit Scene Encoding for RGB SLAM

no code implementations • 7 Feb 2023 • Zihan Zhu, Songyou Peng, Viktor Larsson, Zhaopeng Cui, Martin R. Oswald, Andreas Geiger, Marc Pollefeys

Neural implicit representations have recently become popular in simultaneous localization and mapping (SLAM), especially in dense visual SLAM.

3D Scene Reconstruction Novel View Synthesis +2

Paper
Add Code

DPS-Net: Deep Polarimetric Stereo Depth Estimation

no code implementations • ICCV 2023 • Chaoran Tian, Weihong Pan, Zimo Wang, Mao Mao, Guofeng Zhang, Hujun Bao, Ping Tan, Zhaopeng Cui

Stereo depth estimation usually struggles to deal with textureless scenes for both traditional and learning-based methods due to the inherent dependence on image correspondence matching.

Stereo Depth Estimation

Paper
Add Code

RAGO: Recurrent Graph Optimizer For Multiple Rotation Averaging

1 code implementation • CVPR 2022 • Heng Li, Zhaopeng Cui, Shuaicheng Liu, Ping Tan

Our graph optimizer iteratively refines the global camera rotations by minimizing each node's single rotation objective function.

Paper
Code

Generative Category-Level Shape and Pose Estimation with Semantic Primitives

1 code implementation • 3 Oct 2022 • Guanglin Li, Yifeng Li, Zhichao Ye, Qihang Zhang, Tao Kong, Zhaopeng Cui, Guofeng Zhang

Then, by using a SIM(3)-invariant shape descriptor, we gracefully decouple the shape and pose of an object, thus supporting latent shape optimization of target objects in arbitrary poses.

Ranked #2 on 6D Pose Estimation using RGBD on REAL275

6D Pose Estimation using RGBD Object

Paper
Code

IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis

1 code implementation • ICCV 2023 • Weicai Ye, Shuo Chen, Chong Bao, Hujun Bao, Marc Pollefeys, Zhaopeng Cui, Guofeng Zhang

Existing inverse rendering combined with neural rendering methods can only perform editable novel view synthesis on object-specific scenes, while we present intrinsic neural radiance fields, dubbed IntrinsicNeRF, which introduce intrinsic decomposition into the NeRF-based neural rendering method and can extend its application to room-scale scenes.

Clustering Inverse Rendering +2

177

Paper
Code

DELTAR: Depth Estimation from a Light-weight ToF Sensor and RGB Image

no code implementations • 27 Sep 2022 • Yijin Li, Xinyang Liu, Wenqi Dong, Han Zhou, Hujun Bao, Guofeng Zhang, yinda zhang, Zhaopeng Cui

Light-weight time-of-flight (ToF) depth sensors are small, cheap, low-energy and have been massively deployed on mobile devices for the purposes like autofocus, obstacle detection, etc.

3D Reconstruction Depth Completion +2

Paper
Add Code

NeuMesh: Learning Disentangled Neural Mesh-based Implicit Field for Geometry and Texture Editing

no code implementations • 25 Jul 2022 • Bangbang Yang, Chong Bao, Junyi Zeng, Hujun Bao, yinda zhang, Zhaopeng Cui, Guofeng Zhang

Very recently neural implicit rendering techniques have been rapidly evolved and shown great advantages in novel view synthesis and 3D scene reconstruction.

3D Scene Reconstruction Neural Rendering +1

Paper
Add Code

CompNVS: Novel View Synthesis with Scene Completion

no code implementations • 23 Jul 2022 • Zuoyue Li, Tianxing Fan, Zhenqiang Li, Zhaopeng Cui, Yoichi Sato, Marc Pollefeys, Martin R. Oswald

We introduce a scalable framework for novel view synthesis from RGB-D images with largely incomplete scene coverage.

Novel View Synthesis Scene Understanding

Paper
Add Code

DeFlowSLAM: Self-Supervised Scene Motion Decomposition for Dynamic Dense SLAM

1 code implementation • 18 Jul 2022 • Weicai Ye, Xingyuan Yu, Xinyue Lan, Yuhang Ming, Jinyu Li, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

We present a novel dual-flow representation of scene motion that decomposes the optical flow into a static flow field caused by the camera motion and another dynamic flow field caused by the objects' movements in the scene.

Pose Estimation Simultaneous Localization and Mapping

110

Paper
Code

Factorized and Controllable Neural Re-Rendering of Outdoor Scene for Photo Extrapolation

no code implementations • 14 Jul 2022 • Boming Zhao, Bangbang Yang, Zhenyang Li, Zuoyue Li, Guofeng Zhang, Jiashu Zhao, Dawei Yin, Zhaopeng Cui, Hujun Bao

Expanding an existing tourist photo from a partially captured scene to a full scene is one of the desired experiences for photography applications.

Paper
Add Code

PVO: Panoptic Visual Odometry

1 code implementation • CVPR 2023 • Weicai Ye, Xinyue Lan, Shuo Chen, Yuhang Ming, Xingyuan Yu, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

We present PVO, a novel panoptic visual odometry framework to achieve more comprehensive modeling of the scene motion, geometry, and panoptic segmentation information.

Optical Flow Estimation Pose Estimation +3

198

Paper
Code

TC-SfM: Robust Track-Community-Based Structure-from-Motion

no code implementations • 13 Jun 2022 • Lei Wang, Linlin Ge, Shan Luo, Zihan Yan, Zhaopeng Cui, Jieqing Feng

Specifically, a novel structure is proposed, namely, {\textit{track-community}}, in which each community consists of a group of tracks and represents a local segment in the scene.

Community Detection

Paper
Add Code

Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects

no code implementations • 5 May 2022 • Bangbang Yang, yinda zhang, Yijin Li, Zhaopeng Cui, Sean Fanello, Hujun Bao, Guofeng Zhang

We, as human beings, can understand and picture a familiar scene from arbitrary viewpoints given a single image, whereas this is still a grand challenge for computers.

Data Augmentation Neural Rendering +1

Paper
Add Code

FD-SLAM: 3-D Reconstruction Using Features and Dense Matching

no code implementations • 25 Mar 2022 • Xingrui Yang, Yuhang Ming, Zhaopeng Cui, Andrew Calway

It is well known that visual SLAM systems based on dense matching are locally accurate but are also susceptible to long-term drift and map corruption.

Pose Estimation

Paper
Add Code

Hybrid Tracker with Pixel and Instance for Video Panoptic Segmentation

no code implementations • 2 Mar 2022 • Weicai Ye, Xinyue Lan, Ge Su, Hujun Bao, Zhaopeng Cui, Guofeng Zhang

HybridTracker performs pixel tracker and instance tracker in parallel to obtain the association matrices, which are fused into a matching matrix.

Optical Flow Estimation Segmentation +1

Paper
Add Code

SceneSqueezer: Learning To Compress Scene for Camera Relocalization

no code implementations • CVPR 2022 • Luwei Yang, Rakesh Shrestha, Wenbo Li, Shuaicheng Liu, Guofeng Zhang, Zhaopeng Cui, Ping Tan

Standard visual localization methods build a priori 3D model of a scene which is used to establish correspondences against the 2D keypoints in a query image.

Camera Relocalization Image Registration +3

Paper
Add Code

NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

1 code implementation • CVPR 2022 • Zihan Zhu, Songyou Peng, Viktor Larsson, Weiwei Xu, Hujun Bao, Zhaopeng Cui, Martin R. Oswald, Marc Pollefeys

Neural implicit representations have recently shown encouraging results in various domains, including promising progress in simultaneous localization and mapping (SLAM).

Simultaneous Localization and Mapping

1,368

Paper
Code

LatentHuman: Shape-and-Pose Disentangled Latent Representation for Human Bodies

no code implementations • 30 Nov 2021 • Sandro Lombardi, Bangbang Yang, Tianxing Fan, Hujun Bao, Guofeng Zhang, Marc Pollefeys, Zhaopeng Cui

In this work, we propose a novel neural implicit representation for the human body, which is fully differentiable and optimizable with disentangled shape and pose latent spaces.

3D Reconstruction motion retargeting +1

Paper
Add Code

Non-local Recurrent Regularization Networks for Multi-view Stereo

no code implementations • 13 Oct 2021 • Qingshan Xu, Martin R. Oswald, Wenbing Tao, Marc Pollefeys, Zhaopeng Cui

However, existing recurrent methods only model the local dependencies in the depth domain, which greatly limits the capability of capturing the global scene context along the depth dimension.

Depth Estimation

Paper
Add Code

Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering

no code implementations • ICCV 2021 • Bangbang Yang, yinda zhang, Yinghao Xu, Yijin Li, Han Zhou, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

In this paper, we present a novel neural scene rendering system, which learns an object-compositional neural radiance field and produces realistic rendering with editing capability for a clustered and real-world scene.

Neural Rendering Novel View Synthesis +1

Paper
Add Code

DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization

1 code implementation • ICCV 2021 • Cheng Zhang, Zhaopeng Cui, Cai Chen, Shuaicheng Liu, Bing Zeng, Hujun Bao, yinda zhang

Panorama images have a much larger field-of-view thus naturally encode enriched scene context information compared to standard perspective images, which however is not well exploited in the previous scene understanding methods.

Object Relation +1

Paper
Code

Vis2Mesh: Efficient Mesh Reconstruction from Unstructured Point Clouds of Large Scenes with Learned Virtual View Visibility

1 code implementation • ICCV 2021 • Shuang Song, Zhaopeng Cui, Rongjun Qin

Then the visibility information of multiple views is aggregated to generate a 3D mesh model by solving an optimization problem considering visibility in which a novel adaptive visibility weighting in surface determination is also introduced to suppress line of sight with a large incident angle.

Binary Classification Depth Completion +1

Paper
Code

Deep Hybrid Self-Prior for Full 3D Mesh Generation

no code implementations • ICCV 2021 • Xingkui Wei, Zhengqing Chen, Yanwei Fu, Zhaopeng Cui, yinda zhang

We present a deep learning pipeline that leverages network self-prior to recover a full 3D model consisting of both a triangular mesh and a texture map from the colored 3D point cloud.

Surface Reconstruction

Paper
Add Code

End-to-End Rotation Averaging With Multi-Source Propagation

1 code implementation • CVPR 2021 • Luwei Yang, Heng Li, Jamal Ahmed Rahim, Zhaopeng Cui, Ping Tan

These methods can suffer from bad initializations due to the noisy spanning tree or outliers in input relative rotations.

Paper
Code

Towards Efficient Graph Convolutional Networks for Point Cloud Handling

no code implementations • ICCV 2021 • Yawei Li, He Chen, Zhaopeng Cui, Radu Timofte, Marc Pollefeys, Gregory Chirikjian, Luc van Gool

In this paper, we aim at improving the computational efficiency of graph convolutional networks (GCNs) for learning on point clouds.

Computational Efficiency

Paper
Add Code

Riggable 3D Face Reconstruction via In-Network Optimization

1 code implementation • CVPR 2021 • Ziqian Bai, Zhaopeng Cui, Xiaoming Liu, Ping Tan

This paper presents a method for riggable 3D face reconstruction from monocular images, which jointly estimates a personalized face rig and per-image parameters including expressions, poses, and illuminations.

3D Face Reconstruction Decoder

140

Paper
Code

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

1 code implementation • CVPR 2021 • Cheng Zhang, Zhaopeng Cui, yinda zhang, Bing Zeng, Marc Pollefeys, Shuaicheng Liu

We not only propose an image-based local structured implicit network to improve the object shape estimation, but also refine the 3D object pose and scene layout via a novel implicit scene graph neural network that exploits the implicit local object features.

Ranked #1 on Monocular 3D Object Detection on SUN RGB-D (using extra training data)

3D Shape Reconstruction Monocular 3D Object Detection +4

195

Paper
Code

P2-Net: Joint Description and Detection of Local Features for Pixel and Point Matching

1 code implementation • ICCV 2021 • Bing Wang, Changhao Chen, Zhaopeng Cui, Jie Qin, Chris Xiaoxuan Lu, Zhengdi Yu, Peijun Zhao, Zhen Dong, Fan Zhu, Niki Trigoni, Andrew Markham

Accurately describing and detecting 2D and 3D keypoints is crucial to establishing correspondences across images and point clouds.

Visual Localization

Paper
Code

The Card Shuffling Hypotheses: Building a Time and Memory Efficient Graph Convolutional Network

no code implementations • 1 Jan 2021 • Yawei Li, He Chen, Zhaopeng Cui, Radu Timofte, Marc Pollefeys, Gregory Chirikjian, Luc van Gool

State-of-the-art GCNs adopt $K$-nearest neighbor (KNN) searches for local feature aggregation and feature extraction operations from layer to layer.

3D Classification Point Cloud Classification +2

Paper
Add Code

Sat2Vid: Street-view Panoramic Video Synthesis from a Single Satellite Image

no code implementations • ICCV 2021 • Zuoyue Li, Zhenqiang Li, Zhaopeng Cui, Rongjun Qin, Marc Pollefeys, Martin R. Oswald

For geometrical and temporal consistency, our approach explicitly creates a 3D point cloud representation of the scene and maintains dense 3D-2D correspondences across frames that reflect the geometric scene configuration inferred from the satellite view.

Image Generation

Paper
Add Code

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

no code implementations • 26 Nov 2020 • Miao Liu, Dexin Yang, Yan Zhang, Zhaopeng Cui, James M. Rehg, Siyu Tang

We introduce a novel task of reconstructing a time series of second-person 3D human body meshes from monocular egocentric videos.

Time Series Time Series Analysis

Paper
Add Code

Self-Supervised Human Depth Estimation from Monocular Videos

1 code implementation • CVPR 2020 • Feitong Tan, Hao Zhu, Zhaopeng Cui, Siyu Zhu, Marc Pollefeys, Ping Tan

Previous methods on estimating detailed human depth often require supervised training with `ground truth' depth data.

Depth Estimation Self-Supervised Learning

Paper
Code

OmniSLAM: Omnidirectional Localization and Dense Mapping for Wide-baseline Multi-camera Systems

no code implementations • 18 Mar 2020 • Changhee Won, Hochang Seok, Zhaopeng Cui, Marc Pollefeys, Jongwoo Lim

In this paper, we present an omnidirectional localization and dense mapping system for a wide-baseline multiview stereo setup with ultra-wide field-of-view (FOV) fisheye cameras, which has a 360 degrees coverage of stereo observations of the environment.

Depth Estimation Visual Odometry

Paper
Add Code

Reflection Separation using a Pair of Unpolarized and Polarized Images

1 code implementation • NeurIPS 2019 • Youwei Lyu, Zhaopeng Cui, Si Li, Marc Pollefeys, Boxin Shi

When we take photos through glass windows or doors, the transmitted background scene is often blended with undesirable reflection.

Paper
Code

DIST: Rendering Deep Implicit Signed Distance Function with Differentiable Sphere Tracing

1 code implementation • CVPR 2020 • Shaohui Liu, yinda zhang, Songyou Peng, Boxin Shi, Marc Pollefeys, Zhaopeng Cui

We propose a differentiable sphere tracing algorithm to bridge the gap between inverse graphics methods and the recently proposed deep learning based implicit signed distance function.

215

Paper
Code

Polarimetric Relative Pose Estimation

no code implementations • ICCV 2019 • Zhaopeng Cui, Viktor Larsson, Marc Pollefeys

In this paper we consider the problem of relative pose estimation from two images with per-pixel polarimetric information.

Pose Estimation

Paper
Add Code

DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene from Sparse LiDAR Data and Single Color Image

1 code implementation • CVPR 2019 • Jiaxiong Qiu, Zhaopeng Cui, yinda zhang, Xingdi Zhang, Shuaicheng Liu, Bing Zeng, Marc Pollefeys

In this paper, we propose a deep learning architecture that produces accurate dense depth for the outdoor scene from a single color image and a sparse depth.

Decoder Depth Completion +2

242

Paper
Code

Efficient 2D-3D Matching for Multi-Camera Visual Localization

no code implementations • 17 Sep 2018 • Marcel Geppert, Peidong Liu, Zhaopeng Cui, Marc Pollefeys, Torsten Sattler

This results in a system that provides reliable and drift-less pose estimations for high speed autonomous driving.

Robotics

Paper
Add Code

Polarimetric Dense Monocular SLAM

no code implementations • CVPR 2018 • Luwei Yang, Feitong Tan, Ao Li, Zhaopeng Cui, Yasutaka Furukawa, Ping Tan

This paper presents a novel polarimetric dense monocular SLAM (PDMS) algorithm based on a polarization camera.

Paper
Add Code

Polarimetric Multi-View Stereo

no code implementations • CVPR 2017 • Zhaopeng Cui, Jinwei Gu, Boxin Shi, Ping Tan, Jan Kautz

Multi-view stereo relies on feature correspondences for 3D reconstruction, and thus is fundamentally flawed in dealing with featureless scenes.

3D Reconstruction

Paper
Add Code

Global Structure-From-Motion by Similarity Averaging

no code implementations • ICCV 2015 • Zhaopeng Cui, Ping Tan

Depth images help to upgrade an essential matrix to a similarity transformation, which can determine the scale of relative translation.

Translation

Paper
Add Code

Linear Global Translation Estimation with Feature Tracks

no code implementations • 6 Mar 2015 • Zhaopeng Cui, Nianjuan Jiang, Chengzhou Tang, Ping Tan

This paper derives a novel linear position constraint for cameras seeing a common scene point, which leads to a direct linear method for global camera translation estimation.

Position Translation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.