no code implementations • 22 Feb 2024 • Jason Y. Zhang, Amy Lin, Moneish Kumar, Tzu-Hsuan Yang, Deva Ramanan, Shubham Tulsiani
Estimating camera poses is a fundamental task for 3D reconstruction and remains challenging given sparsely sampled views (<10).
1 code implementation • 8 May 2023 • Amy Lin, Jason Y. Zhang, Deva Ramanan, Shubham Tulsiani
We address the task of estimating 6D camera poses from sparse-view image sets (2-8 images).
no code implementations • CVPR 2023 • Haithem Turki, Jason Y. Zhang, Francesco Ferroni, Deva Ramanan
We extend neural radiance fields (NeRFs) to dynamic large-scale urban scenes.
no code implementations • CVPR 2023 • Samarth Sinha, Jason Y. Zhang, Andrea Tagliasacchi, Igor Gilitschenski, David B. Lindell
Camera pose estimation is a key step in standard 3D reconstruction pipelines that operate on a dense set of images of a single object or scene.
1 code implementation • 11 Aug 2022 • Jason Y. Zhang, Deva Ramanan, Shubham Tulsiani
We describe a data-driven method for inferring the camera viewpoints given multiple images of an arbitrary object.
1 code implementation • NeurIPS 2021 • Jason Y. Zhang, Gengshan Yang, Shubham Tulsiani, Deva Ramanan
NeRS learns a neural shape representation of a closed surface that is diffeomorphic to a sphere, guaranteeing water-tight reconstructions.
1 code implementation • ECCV 2020 • Jason Y. Zhang, Sam Pepose, Hanbyul Joo, Deva Ramanan, Jitendra Malik, Angjoo Kanazawa
We present a method that infers spatial arrangements and shapes of humans and objects in a globally consistent 3D scene, all from a single image in-the-wild captured in an uncontrolled environment.
Ranked #3 on 3D Object Reconstruction on BEHAVE
1 code implementation • ICCV 2019 • Jason Y. Zhang, Panna Felsen, Angjoo Kanazawa, Jitendra Malik
In this work, we present perhaps the first approach for predicting a future 3D mesh model sequence of a person from past video input.
1 code implementation • CVPR 2019 • Angjoo Kanazawa, Jason Y. Zhang, Panna Felsen, Jitendra Malik
We present a framework that can similarly learn a representation of 3D dynamics of humans from video via a simple but effective temporal encoding of image features.
Ranked #15 on 3D Human Pose Estimation on 3DPW (Acceleration Error metric)