1 code implementation • 13 Jan 2025 • Xingchen Liu, Piyush Tayal, Jianyuan Wang, Jesus Zarzar, Tom Monnier, Konstantinos Tertikas, Jiali Duan, Antoine Toisoul, Jason Y. Zhang, Natalia Neverova, Andrea Vedaldi, Roman Shapovalov, David Novotny
We introduce Uncommon Objects in 3D (uCO3D), a new object-centric dataset for 3D deep learning and 3D generative AI.
no code implementations • 24 Dec 2024 • Minghao Chen, Roman Shapovalov, Iro Laina, Tom Monnier, Jianyuan Wang, David Novotny, Andrea Vedaldi
Text- or image-to-3D generators and 3D scanners can now produce 3D assets with high-quality shapes and textures.
no code implementations • 2 Jul 2024 • Yawar Siddiqui, Tom Monnier, Filippos Kokkinos, Mahendra Kariya, Yanir Kleiman, Emilien Garreau, Oran Gafni, Natalia Neverova, Andrea Vedaldi, Roman Shapovalov, David Novotny
We present Meta 3D AssetGen (AssetGen), a significant advancement in text-to-3D generation which produces faithful, high-quality meshes with texture and material control.
no code implementations • 2 Jul 2024 • Raphael Bensadoun, Tom Monnier, Yanir Kleiman, Filippos Kokkinos, Yawar Siddiqui, Mahendra Kariya, Omri Harosh, Roman Shapovalov, Benjamin Graham, Emilien Garreau, Animesh Karnewar, Ang Cao, Idan Azuri, Iurii Makarov, Eric-Tuan Le, Antoine Toisoul, David Novotny, Oran Gafni, Natalia Neverova, Andrea Vedaldi
We introduce Meta 3D Gen (3DGen), a new state-of-the-art, fast pipeline for text-to-3D asset generation.
1 code implementation • 30 Apr 2024 • Ang Cao, Justin Johnson, Andrea Vedaldi, David Novotny
Contemporary 3D research, particularly in reconstruction and generation, heavily relies on 2D images for inputs or supervision.
1 code implementation • 25 Mar 2024 • Remy Sabathier, Niloy J. Mitra, David Novotny
We present a method to build animatable dog avatars from monocular videos.
1 code implementation • CVPR 2024 • Lukas Höllein, Aljaž Božič, Norman Müller, David Novotny, Hung-Yu Tseng, Christian Richardt, Michael Zollhöfer, Matthias Nießner
In this paper, we present a method that leverages pretrained text-to-image models as a prior, and learn to generate multi-view images in a single denoising process from real-world data.
1 code implementation • CVPR 2024 • Jianyuan Wang, Nikita Karaev, Christian Rupprecht, David Novotny
Finally we optimise the cameras and triangulate 3D points via a differentiable bundle adjustment layer.
no code implementations • 14 Dec 2023 • Animesh Karnewar, Roman Shapovalov, Tom Monnier, Andrea Vedaldi, Niloy J. Mitra, David Novotny
Encoding information from 2D views of an object into a 3D representation is crucial for generalized 3D feature extraction.
1 code implementation • 7 Dec 2023 • Jianyuan Wang, Nikita Karaev, Christian Rupprecht, David Novotny
Finally, we optimise the cameras and triangulate 3D points via a differentiable bundle adjustment layer.
no code implementations • ICCV 2023 • Animesh Karnewar, Niloy J. Mitra, Andrea Vedaldi, David Novotny
Diffusion-based image generators can now produce high-quality and diverse samples, but their success has yet to fully translate to 3D generation: existing diffusion methods can either generate low-resolution but 3D consistent outputs, or detailed 2D views of 3D objects but with potential structural defects and lacking view consistency or realism.
1 code implementation • ICCV 2023 • Roman Shapovalov, Yanir Kleiman, Ignacio Rocco, David Novotny, Andrea Vedaldi, Changan Chen, Filippos Kokkinos, Ben Graham, Natalia Neverova
We introduce Replay, a collection of multi-view, multi-modal videos of humans interacting socially.
1 code implementation • ICCV 2023 • Jianyuan Wang, Christian Rupprecht, David Novotny
Camera pose estimation is a long-standing computer vision problem that to date often relies on classical methods, such as handcrafted keypoint matching, RANSAC and bundle adjustment.
no code implementations • CVPR 2023 • Animesh Karnewar, Andrea Vedaldi, David Novotny, Niloy Mitra
We show that our diffusion models are scalable, train robustly, and are competitive in terms of sample quality and fidelity to existing approaches for 3D generative modeling.
1 code implementation • 21 Mar 2023 • Ignacio Rocco, Iurii Makarov, Filippos Kokkinos, David Novotny, Benjamin Graham, Natalia Neverova, Andrea Vedaldi
We present a method for fast 3D reconstruction and real-time rendering of dynamic humans from monocular videos with accompanying parametric body fits.
1 code implementation • 6 Dec 2022 • Mohamed El Banani, Ignacio Rocco, David Novotny, Andrea Vedaldi, Natalia Neverova, Justin Johnson, Benjamin Graham
To address this, we propose a self-supervised approach for correspondence estimation that learns from multiview consistency in short RGB-D video sequences.
no code implementations • CVPR 2023 • Samarth Sinha, Roman Shapovalov, Jeremy Reizenstein, Ignacio Rocco, Natalia Neverova, Andrea Vedaldi, David Novotny
Obtaining photorealistic reconstructions of objects from sparse views is inherently ambiguous and can only be achieved by learning suitable reconstruction priors.
no code implementations • 4 Jun 2022 • Gil Avraham, Julian Straub, Tianwei Shen, Tsun-Yi Yang, Hugo Germain, Chris Sweeney, Vasileios Balntas, David Novotny, Daniel DeTone, Richard Newcombe
This paper presents a framework that combines traditional keypoint-based camera pose optimization with an invertible neural rendering mechanism.
1 code implementation • 5 Apr 2022 • Joseph Ortiz, Alexander Clegg, Jing Dong, Edgar Sucar, David Novotny, Michael Zollhoefer, Mustafa Mukadam
We present iSDF, a continual learning system for real-time signed distance field (SDF) reconstruction.
no code implementations • CVPR 2022 • David Novotny, Ignacio Rocco, Samarth Sinha, Alexandre Carlier, Gael Kerchenbaum, Roman Shapovalov, Nikita Smetanin, Natalia Neverova, Benjamin Graham, Andrea Vedaldi
Compared to weaker deformation models, this significantly reduces the reconstruction ambiguity and, for dynamic objects, allows Keypoint Transporter to obtain reconstructions of the quality superior or at least comparable to prior approaches while being much faster and reliant on a pre-trained monocular depth estimator network.
1 code implementation • ICCV 2021 • Jeremy Reizenstein, Roman Shapovalov, Philipp Henzler, Luca Sbordone, Patrick Labatut, David Novotny
Traditional approaches for learning 3D object categories have been predominantly trained and evaluated on synthetic datasets due to the unavailability of real 3D-annotated category-centric data.
no code implementations • ICCV 2021 • Roman Shapovalov, David Novotny, Benjamin Graham, Patrick Labatut, Andrea Vedaldi
The method learns, in an end-to-end fashion, a soft partition of a given category-specific 3D template mesh into rigid parts together with a monocular reconstruction network that predicts the part motions such that they reproject correctly onto 2D DensePose-like surface annotations of the object.
no code implementations • 19 Aug 2021 • Matan Atzmon, David Novotny, Andrea Vedaldi, Yaron Lipman
Implicit neural representation is a recent approach to learn shape collections as zero level-sets of neural networks, where each shape is represented by a latent code.
no code implementations • CVPR 2021 • Marvin Eisenberger, David Novotny, Gael Kerchenbaum, Patrick Labatut, Natalia Neverova, Daniel Cremers, Andrea Vedaldi
We present NeuroMorph, a new neural network architecture that takes as input two 3D shapes and produces in one go, i. e. in a single feed forward pass, a smooth interpolation and point-to-point correspondences between them.
no code implementations • CVPR 2021 • Natalia Neverova, Artsiom Sanakoyeu, Patrick Labatut, David Novotny, Andrea Vedaldi
Recent work has shown that it is possible to learn a unified dense pose predictor for several categories of related objects.
no code implementations • CVPR 2021 • Philipp Henzler, Jeremy Reizenstein, Patrick Labatut, Roman Shapovalov, Tobias Ritschel, Andrea Vedaldi, David Novotny
Our goal is to learn a deep network that, given a small number of images of an object of a given category, reconstructs it in 3D.
2 code implementations • NeurIPS 2020 • Natalia Neverova, David Novotny, Vasil Khalidov, Marc Szafraniec, Patrick Labatut, Andrea Vedaldi
In this work, we focus on the task of learning and representing dense correspondences in deformable object categories.
1 code implementation • 20 Nov 2020 • Benjamin Graham, David Novotny
Using a set of high-quality sparse keypoint matches, we optimize over the per-frame linear combinations of depth planes and camera poses to form a geometrically consistent cloud of keypoints.
no code implementations • NeurIPS 2020 • Benjamin Biggs, Sébastien Ehrhadt, Hanbyul Joo, Benjamin Graham, Andrea Vedaldi, David Novotny
We consider the problem of obtaining dense 3D reconstructions of humans from single and partially occluded views.
Ranked #3 on
Multi-Hypotheses 3D Human Pose Estimation
on AH36M
1 code implementation • NeurIPS 2020 • David Novotny, Roman Shapovalov, Andrea Vedaldi
We propose the Canonical 3D Deformer Map, a new representation of the 3D shape of common object categories that can be learned from a collection of 2D images of independent objects.
3 code implementations • 16 Jul 2020 • Nikhila Ravi, Jeremy Reizenstein, David Novotny, Taylor Gordon, Wan-Yen Lo, Justin Johnson, Georgia Gkioxari
We address these challenges by introducing PyTorch3D, a library of modular, efficient, and differentiable operators for 3D deep learning.
no code implementations • NeurIPS 2019 • Ben Graham, David Novotny, Jeremy Reizenstein
Given a set of a reference RGBD views of an indoor environment, and a new viewpoint, our goal is to predict the view from that location.
no code implementations • NeurIPS 2019 • Natalia Neverova, David Novotny, Andrea Vedaldi
We show that these models, by understanding uncertainty better, can solve the original DensePose task more accurately, thus setting the new state-of-the-art accuracy in this benchmark.
2 code implementations • ICCV 2019 • David Novotny, Nikhila Ravi, Benjamin Graham, Natalia Neverova, Andrea Vedaldi
We propose C3DPO, a method for extracting 3D models of deformable objects from 2D keypoint annotations in unconstrained images.
no code implementations • ECCV 2018 • David Novotny, Samuel Albanie, Diane Larlus, Andrea Vedaldi
Object detection and instance segmentation are dominated by region-based methods such as Mask RCNN.
no code implementations • CVPR 2018 • David Novotny, Samuel Albanie, Diane Larlus, Andrea Vedaldi
Self-supervision can dramatically cut back the amount of manually-labelled data required to train deep neural networks.
no code implementations • ICCV 2017 • David Novotny, Diane Larlus, Andrea Vedaldi
Traditional approaches for learning 3D object categories use either synthetic data or manual supervision.
no code implementations • CVPR 2017 • David Novotny, Diane Larlus, Andrea Vedaldi
Despite significant progress of deep learning in recent years, state-of-the-art semantic matching methods still rely on legacy features such as SIFT or HoG.
no code implementations • 5 Jul 2016 • David Novotny, Diane Larlus, Andrea Vedaldi
While recent research in image understanding has often focused on recognizing more types of objects, understanding more about the objects is just as important.
no code implementations • ICCV 2015 • David Novotny, Jiri Matas
The efficiency is achieved by the use of spatial bins in a novel combination with sparsity-inducing group normalized SVM.