Search Results for author: David Novotny

Found 34 papers, 11 papers with code

Accelerating 3D Deep Learning with PyTorch3D

3 code implementations • 16 Jul 2020 • Nikhila Ravi, Jeremy Reizenstein, David Novotny, Taylor Gordon, Wan-Yen Lo, Justin Johnson, Georgia Gkioxari

We address these challenges by introducing PyTorch3D, a library of modular, efficient, and differentiable operators for 3D deep learning.

Autonomous Vehicles

8,276

Paper
Code

Real-time volumetric rendering of dynamic humans

1 code implementation • 21 Mar 2023 • Ignacio Rocco, Iurii Makarov, Filippos Kokkinos, David Novotny, Benjamin Graham, Natalia Neverova, Andrea Vedaldi

We present a method for fast 3D reconstruction and real-time rendering of dynamic humans from monocular videos with accompanying parametric body fits.

3D Reconstruction

2,402

Paper
Code

Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction

1 code implementation • ICCV 2021 • Jeremy Reizenstein, Roman Shapovalov, Philipp Henzler, Luca Sbordone, Patrick Labatut, David Novotny

Traditional approaches for learning 3D object categories have been predominantly trained and evaluated on synthetic datasets due to the unavailability of real 3D-annotated category-centric data.

3D Reconstruction Neural Rendering +1

899

Paper
Code

iSDF: Real-Time Neural Signed Distance Fields for Robot Perception

1 code implementation • 5 Apr 2022 • Joseph Ortiz, Alexander Clegg, Jing Dong, Edgar Sucar, David Novotny, Michael Zollhoefer, Mustafa Mukadam

We present iSDF, a continual learning system for real-time signed distance field (SDF) reconstruction.

Continual Learning Denoising

410

Paper
Code

C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion

2 code implementations • ICCV 2019 • David Novotny, Nikhila Ravi, Benjamin Graham, Natalia Neverova, Andrea Vedaldi

We propose C3DPO, a method for extracting 3D models of deformable objects from 2D keypoint annotations in unconstrained images.

314

Paper
Code

ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models

1 code implementation • 4 Mar 2024 • Lukas Höllein, Aljaž Božič, Norman Müller, David Novotny, Hung-Yu Tseng, Christian Richardt, Michael Zollhöfer, Matthias Nießner

In this paper, we present a method that leverages pretrained text-to-image models as a prior, and learn to generate multi-view images in a single denoising process from real-world data.

Denoising Image Generation +1

223

Paper
Code

Continuous Surface Embeddings

1 code implementation • NeurIPS 2020 • Natalia Neverova, David Novotny, Vasil Khalidov, Marc Szafraniec, Patrick Labatut, Andrea Vedaldi

In this work, we focus on the task of learning and representing dense correspondences in deformable object categories.

Object Pose Estimation

125

Paper
Code

RidgeSfM: Structure from Motion via Robust Pairwise Matching Under Depth Uncertainty

1 code implementation • 20 Nov 2020 • Benjamin Graham, David Novotny

Using a set of high-quality sparse keypoint matches, we optimize over the per-frame linear combinations of depth planes and camera poses to form a geometrically consistent cloud of keypoints.

112

Paper
Code

Self-Supervised Correspondence Estimation via Multiview Registration

1 code implementation • 6 Dec 2022 • Mohamed El Banani, Ignacio Rocco, David Novotny, Andrea Vedaldi, Natalia Neverova, Justin Johnson, Benjamin Graham

To address this, we propose a self-supervised approach for correspondence estimation that learns from multiview consistency in short RGB-D video sequences.

Paper
Code

Replay: Multi-modal Multi-view Acted Videos for Casual Holography

1 code implementation • ICCV 2023 • Roman Shapovalov, Yanir Kleiman, Ignacio Rocco, David Novotny, Andrea Vedaldi, Changan Chen, Filippos Kokkinos, Ben Graham, Natalia Neverova

We introduce Replay, a collection of multi-view, multi-modal videos of humans interacting socially.

3D Reconstruction Novel View Synthesis

Paper
Code

Canonical 3D Deformer Maps: Unifying parametric and non-parametric methods for dense weakly-supervised category reconstruction

1 code implementation • NeurIPS 2020 • David Novotny, Roman Shapovalov, Andrea Vedaldi

We propose the Canonical 3D Deformer Map, a new representation of the 3D shape of common object categories that can be learned from a collection of 2D images of independent objects.

3D Reconstruction Object

Paper
Code

Self-supervised Learning of Geometrically Stable Features Through Probabilistic Introspection

no code implementations • CVPR 2018 • David Novotny, Samuel Albanie, Diane Larlus, Andrea Vedaldi

Self-supervision can dramatically cut back the amount of manually-labelled data required to train deep neural networks.

Image Classification Self-Supervised Learning +1

Paper
Add Code

Learning 3D Object Categories by Looking Around Them

no code implementations • ICCV 2017 • David Novotny, Diane Larlus, Andrea Vedaldi

Traditional approaches for learning 3D object categories use either synthetic data or manual supervision.

Data Augmentation Object

Paper
Add Code

AnchorNet: A Weakly Supervised Network to Learn Geometry-sensitive Features For Semantic Matching

no code implementations • CVPR 2017 • David Novotny, Diane Larlus, Andrea Vedaldi

Despite significant progress of deep learning in recent years, state-of-the-art semantic matching methods still rely on legacy features such as SIFT or HoG.

Object

Paper
Add Code

Learning the semantic structure of objects from Web supervision

no code implementations • 5 Jul 2016 • David Novotny, Diane Larlus, Andrea Vedaldi

While recent research in image understanding has often focused on recognizing more types of objects, understanding more about the objects is just as important.

Navigate

Paper
Add Code

Cascaded Sparse Spatial Bins for Efficient and Effective Generic Object Detection

no code implementations • ICCV 2015 • David Novotny, Jiri Matas

The efficiency is achieved by the use of spatial bins in a novel combination with sparsity-inducing group normalized SVM.

object-detection Object Detection

Paper
Add Code

Semi-convolutional Operators for Instance Segmentation

no code implementations • ECCV 2018 • David Novotny, Samuel Albanie, Diane Larlus, Andrea Vedaldi

Object detection and instance segmentation are dominated by region-based methods such as Mask RCNN.

Instance Segmentation object-detection +3

Paper
Add Code

Correlated Uncertainty for Learning Dense Correspondences from Noisy Labels

no code implementations • NeurIPS 2019 • Natalia Neverova, David Novotny, Andrea Vedaldi

We show that these models, by understanding uncertainty better, can solve the original DensePose task more accurately, thus setting the new state-of-the-art accuracy in this benchmark.

Paper
Add Code

PerspectiveNet: A Scene-consistent Image Generator for New View Synthesis in Real Indoor Environments

no code implementations • NeurIPS 2019 • Ben Graham, David Novotny, Jeremy Reizenstein

Given a set of a reference RGBD views of an indoor environment, and a new viewpoint, our goal is to predict the view from that location.

Paper
Add Code

3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data

no code implementations • NeurIPS 2020 • Benjamin Biggs, Sébastien Ehrhadt, Hanbyul Joo, Benjamin Graham, Andrea Vedaldi, David Novotny

We consider the problem of obtaining dense 3D reconstructions of humans from single and partially occluded views.

3D Reconstruction

Paper
Add Code

Unsupervised Learning of 3D Object Categories from Videos in the Wild

no code implementations • CVPR 2021 • Philipp Henzler, Jeremy Reizenstein, Patrick Labatut, Roman Shapovalov, Tobias Ritschel, Andrea Vedaldi, David Novotny

Our goal is to learn a deep network that, given a small number of images of an object of a given category, reconstructs it in 3D.

Benchmarking Monocular Reconstruction +1

Paper
Add Code

NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go

no code implementations • CVPR 2021 • Marvin Eisenberger, David Novotny, Gael Kerchenbaum, Patrick Labatut, Natalia Neverova, Daniel Cremers, Andrea Vedaldi

We present NeuroMorph, a new neural network architecture that takes as input two 3D shapes and produces in one go, i. e. in a single feed forward pass, a smooth interpolation and point-to-point correspondences between them.

Paper
Add Code

Discovering Relationships between Object Categories via Universal Canonical Maps

no code implementations • CVPR 2021 • Natalia Neverova, Artsiom Sanakoyeu, Patrick Labatut, David Novotny, Andrea Vedaldi

Recent work has shown that it is possible to learn a unified dense pose predictor for several categories of related objects.

Object Pose Prediction

Paper
Add Code

Augmenting Implicit Neural Shape Representations with Explicit Deformation Fields

no code implementations • 19 Aug 2021 • Matan Atzmon, David Novotny, Andrea Vedaldi, Yaron Lipman

Implicit neural representation is a recent approach to learn shape collections as zero level-sets of neural networks, where each shape is represented by a latent code.

Paper
Add Code

DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension

no code implementations • ICCV 2021 • Roman Shapovalov, David Novotny, Benjamin Graham, Patrick Labatut, Andrea Vedaldi

The method learns, in an end-to-end fashion, a soft partition of a given category-specific 3D template mesh into rigid parts together with a monocular reconstruction network that predicts the part motions such that they reproject correctly onto 2D DensePose-like surface annotations of the object.

3D Reconstruction Monocular Reconstruction +1

Paper
Add Code

KeyTr: Keypoint Transporter for 3D Reconstruction of Deformable Objects in Videos

no code implementations • CVPR 2022 • David Novotny, Ignacio Rocco, Samarth Sinha, Alexandre Carlier, Gael Kerchenbaum, Roman Shapovalov, Nikita Smetanin, Natalia Neverova, Benjamin Graham, Andrea Vedaldi

Compared to weaker deformation models, this significantly reduces the reconstruction ambiguity and, for dynamic objects, allows Keypoint Transporter to obtain reconstructions of the quality superior or at least comparable to prior approaches while being much faster and reliant on a pre-trained monocular depth estimator network.

3D Reconstruction Depth Estimation +2

Paper
Add Code

Nerfels: Renderable Neural Codes for Improved Camera Pose Estimation

no code implementations • 4 Jun 2022 • Gil Avraham, Julian Straub, Tianwei Shen, Tsun-Yi Yang, Hugo Germain, Chris Sweeney, Vasileios Balntas, David Novotny, Daniel DeTone, Richard Newcombe

This paper presents a framework that combines traditional keypoint-based camera pose optimization with an invertible neural rendering mechanism.

Neural Rendering Pose Estimation

Paper
Add Code

Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories

no code implementations • CVPR 2023 • Samarth Sinha, Roman Shapovalov, Jeremy Reizenstein, Ignacio Rocco, Natalia Neverova, Andrea Vedaldi, David Novotny

Obtaining photorealistic reconstructions of objects from sparse views is inherently ambiguous and can only be achieved by learning suitable reconstruction priors.

3D Reconstruction 4D reconstruction +2

Paper
Add Code

HoloDiffusion: Training a 3D Diffusion Model using 2D Images

no code implementations • CVPR 2023 • Animesh Karnewar, Andrea Vedaldi, David Novotny, Niloy Mitra

We show that our diffusion models are scalable, train robustly, and are competitive in terms of sample quality and fidelity to existing approaches for 3D generative modeling.

Paper
Add Code

PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment

no code implementations • ICCV 2023 • Jianyuan Wang, Christian Rupprecht, David Novotny

Camera pose estimation is a long-standing computer vision problem that to date often relies on classical methods, such as handcrafted keypoint matching, RANSAC and bundle adjustment.

Pose Estimation

Paper
Add Code

HoloFusion: Towards Photo-realistic 3D Generative Modeling

no code implementations • ICCV 2023 • Animesh Karnewar, Niloy J. Mitra, Andrea Vedaldi, David Novotny

Diffusion-based image generators can now produce high-quality and diverse samples, but their success has yet to fully translate to 3D generation: existing diffusion methods can either generate low-resolution but 3D consistent outputs, or detailed 2D views of 3D objects but with potential structural defects and lacking view consistency or realism.

3D Generation Super-Resolution