no code implementations • 16 Mar 2024 • Ziqi Lu, Jianbo Ye, Xiaohan Fei, Xiaolong Li, Jiawei Mo, Ashwin Swaminathan, Stefano Soatto
Neural Radiance Field (NeRF), as an implicit 3D scene representation, lacks inherent ability to accommodate changes made to the initial static scene.
no code implementations • 29 Feb 2024 • Xiaohan Fei, Chethan Parameshwara, Jiawei Mo, Xiaolong Li, Ashwin Swaminathan, Cj Taylor, Paolo Favaro, Stefano Soatto
However, the SDS method is also the source of several artifacts, such as the Janus problem, the misalignment between the text prompt and the generated 3D model, and 3D model inaccuracies.
no code implementations • 6 Jun 2023 • Chethan Parameshwara, Alessandro Achille, Xiaolong Li, Jiawei Mo, Matthew Trager, Ashwin Swaminathan, Cj Taylor, Dheera Venkatraman, Xiaohan Fei, Stefano Soatto
We describe a first step towards learning general-purpose visual representations of physical scenes using only image prediction as a training criterion.
no code implementations • ICCV 2021 • Xiaohan Fei, Henry Wang, Xiangyu Zeng, Lin Lee Cheong, Meng Wang, Joseph Tighe
We propose a fully automated system that simultaneously estimates the camera intrinsics, the ground plane, and physical distances between people from a single RGB image or video captured by a camera viewing a 3-D scene from a fixed vantage point.
1 code implementation • 6 Jun 2021 • Alex Wong, Xiaohan Fei, Byung-Woo Hong, Stefano Soatto
We present a method to infer a dense depth map from a color image and associated sparse depth measurements.
2 code implementations • 15 May 2019 • Alex Wong, Xiaohan Fei, Stephanie Tsuei, Stefano Soatto
Our method first constructs a piecewise planar scaffolding of the scene, and then uses it to infer dense depth using the image along with the sparse points.
Ranked #4 on Depth Completion on VOID
2 code implementations • 30 Jul 2018 • Xiaohan Fei, Alex Wong, Stefano Soatto
We propose using global orientation from inertial measurements, and the bias it induces on the shape of objects populating the scene, to inform visual 3D reconstruction.
no code implementations • ECCV 2018 • Xiaohan Fei, Stefano Soatto
We present a method to populate an unknown environment with models of previously seen objects, placed in a Euclidean reference frame that is inferred causally and on-line using monocular video along with inertial sensors.
no code implementations • CVPR 2017 • Jingming Dong, Xiaohan Fei, Stefano Soatto
We describe a system to detect objects in three-dimensional space using video and inertial sensors (accelerometer and gyrometer), ubiquitous in modern mobile platforms from phones to drones.
no code implementations • 13 Jun 2016 • Jingming Dong, Xiaohan Fei, Stefano Soatto
We describe a system to detect objects in three-dimensional space using video and inertial sensors (accelerometer and gyrometer), ubiquitous in modern mobile platforms from phones to drones.
no code implementations • 20 Nov 2015 • Xiaohan Fei, Konstantine Tsotsos, Stefano Soatto
We propose a data structure obtained by hierarchically averaging bag-of-word descriptors during a sequence of views that achieves average speedups in large-scale loop closure applications ranging from 4 to 20 times on benchmark datasets.