Search Results for author: Xiaohan Fei

Found 11 papers, 3 papers with code

Fast Sparse View Guided NeRF Update for Object Reconfigurations

no code implementations • 16 Mar 2024 • Ziqi Lu, Jianbo Ye, Xiaohan Fei, Xiaolong Li, Jiawei Mo, Ashwin Swaminathan, Stefano Soatto

Neural Radiance Field (NeRF), as an implicit 3D scene representation, lacks inherent ability to accommodate changes made to the initial static scene.

Paper
Add Code

A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D

no code implementations • 29 Feb 2024 • Xiaohan Fei, Chethan Parameshwara, Jiawei Mo, Xiaolong Li, Ashwin Swaminathan, Cj Taylor, Paolo Favaro, Stefano Soatto

However, the SDS method is also the source of several artifacts, such as the Janus problem, the misalignment between the text prompt and the generated 3D model, and 3D model inaccuracies.

Image Generation Text to 3D

Paper
Add Code

Towards Visual Foundational Models of Physical Scenes

no code implementations • 6 Jun 2023 • Chethan Parameshwara, Alessandro Achille, Xiaolong Li, Jiawei Mo, Matthew Trager, Ashwin Swaminathan, Cj Taylor, Dheera Venkatraman, Xiaohan Fei, Stefano Soatto

We describe a first step towards learning general-purpose visual representations of physical scenes using only image prediction as a training criterion.

Paper
Add Code

Single View Physical Distance Estimation using Human Pose

no code implementations • ICCV 2021 • Xiaohan Fei, Henry Wang, Xiangyu Zeng, Lin Lee Cheong, Meng Wang, Joseph Tighe

We propose a fully automated system that simultaneously estimates the camera intrinsics, the ground plane, and physical distances between people from a single RGB image or video captured by a camera viewing a 3-D scene from a fixed vantage point.

Camera Calibration

Paper
Add Code

An Adaptive Framework for Learning Unsupervised Depth Completion

1 code implementation • 6 Jun 2021 • Alex Wong, Xiaohan Fei, Byung-Woo Hong, Stefano Soatto

We present a method to infer a dense depth map from a color image and associated sparse depth measurements.

Depth Completion

Paper
Code

Unsupervised Depth Completion from Visual Inertial Odometry

2 code implementations • 15 May 2019 • Alex Wong, Xiaohan Fei, Stephanie Tsuei, Stefano Soatto

Our method first constructs a piecewise planar scaffolding of the scene, and then uses it to infer dense depth using the image along with the sparse points.

Ranked #4 on Depth Completion on VOID

Depth Completion

182

Paper
Code

Geo-Supervised Visual Depth Prediction

2 code implementations • 30 Jul 2018 • Xiaohan Fei, Alex Wong, Stefano Soatto

We propose using global orientation from inertial measurements, and the bias it induces on the shape of objects populating the scene, to inform visual 3D reconstruction.

3D Reconstruction Depth Estimation +1

841

Paper
Code

Visual-Inertial Object Detection and Mapping

no code implementations • ECCV 2018 • Xiaohan Fei, Stefano Soatto

We present a method to populate an unknown environment with models of previously seen objects, placed in a Euclidean reference frame that is inferred causally and on-line using monocular video along with inertial sensors.

Object object-detection +1

Paper
Add Code

Visual-Inertial-Semantic Scene Representation for 3D Object Detection

no code implementations • CVPR 2017 • Jingming Dong, Xiaohan Fei, Stefano Soatto

We describe a system to detect objects in three-dimensional space using video and inertial sensors (accelerometer and gyrometer), ubiquitous in modern mobile platforms from phones to drones.

3D Object Detection object-detection

Paper
Add Code

Visual-Inertial-Semantic Scene Representation for 3-D Object Detection

no code implementations • 13 Jun 2016 • Jingming Dong, Xiaohan Fei, Stefano Soatto

We describe a system to detect objects in three-dimensional space using video and inertial sensors (accelerometer and gyrometer), ubiquitous in modern mobile platforms from phones to drones.

object-detection Object Detection

Paper
Add Code

A Simple Hierarchical Pooling Data Structure for Loop Closure

no code implementations • 20 Nov 2015 • Xiaohan Fei, Konstantine Tsotsos, Stefano Soatto

We propose a data structure obtained by hierarchically averaging bag-of-word descriptors during a sequence of views that achieves average speedups in large-scale loop closure applications ranging from 4 to 20 times on benchmark datasets.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.