Search Results for author: David Novotny

Found 34 papers, 11 papers with code

Accelerating 3D Deep Learning with PyTorch3D

3 code implementations16 Jul 2020 Nikhila Ravi, Jeremy Reizenstein, David Novotny, Taylor Gordon, Wan-Yen Lo, Justin Johnson, Georgia Gkioxari

We address these challenges by introducing PyTorch3D, a library of modular, efficient, and differentiable operators for 3D deep learning.

Autonomous Vehicles

Real-time volumetric rendering of dynamic humans

1 code implementation21 Mar 2023 Ignacio Rocco, Iurii Makarov, Filippos Kokkinos, David Novotny, Benjamin Graham, Natalia Neverova, Andrea Vedaldi

We present a method for fast 3D reconstruction and real-time rendering of dynamic humans from monocular videos with accompanying parametric body fits.

3D Reconstruction

Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category Reconstruction

1 code implementation ICCV 2021 Jeremy Reizenstein, Roman Shapovalov, Philipp Henzler, Luca Sbordone, Patrick Labatut, David Novotny

Traditional approaches for learning 3D object categories have been predominantly trained and evaluated on synthetic datasets due to the unavailability of real 3D-annotated category-centric data.

3D Reconstruction Neural Rendering +1

C3DPO: Canonical 3D Pose Networks for Non-Rigid Structure From Motion

2 code implementations ICCV 2019 David Novotny, Nikhila Ravi, Benjamin Graham, Natalia Neverova, Andrea Vedaldi

We propose C3DPO, a method for extracting 3D models of deformable objects from 2D keypoint annotations in unconstrained images.

ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models

1 code implementation4 Mar 2024 Lukas Höllein, Aljaž Božič, Norman Müller, David Novotny, Hung-Yu Tseng, Christian Richardt, Michael Zollhöfer, Matthias Nießner

In this paper, we present a method that leverages pretrained text-to-image models as a prior, and learn to generate multi-view images in a single denoising process from real-world data.

Denoising Image Generation +1

Continuous Surface Embeddings

1 code implementation NeurIPS 2020 Natalia Neverova, David Novotny, Vasil Khalidov, Marc Szafraniec, Patrick Labatut, Andrea Vedaldi

In this work, we focus on the task of learning and representing dense correspondences in deformable object categories.

Object Pose Estimation

RidgeSfM: Structure from Motion via Robust Pairwise Matching Under Depth Uncertainty

1 code implementation20 Nov 2020 Benjamin Graham, David Novotny

Using a set of high-quality sparse keypoint matches, we optimize over the per-frame linear combinations of depth planes and camera poses to form a geometrically consistent cloud of keypoints.

Self-Supervised Correspondence Estimation via Multiview Registration

1 code implementation6 Dec 2022 Mohamed El Banani, Ignacio Rocco, David Novotny, Andrea Vedaldi, Natalia Neverova, Justin Johnson, Benjamin Graham

To address this, we propose a self-supervised approach for correspondence estimation that learns from multiview consistency in short RGB-D video sequences.

Canonical 3D Deformer Maps: Unifying parametric and non-parametric methods for dense weakly-supervised category reconstruction

1 code implementation NeurIPS 2020 David Novotny, Roman Shapovalov, Andrea Vedaldi

We propose the Canonical 3D Deformer Map, a new representation of the 3D shape of common object categories that can be learned from a collection of 2D images of independent objects.

3D Reconstruction Object

Learning 3D Object Categories by Looking Around Them

no code implementations ICCV 2017 David Novotny, Diane Larlus, Andrea Vedaldi

Traditional approaches for learning 3D object categories use either synthetic data or manual supervision.

Data Augmentation Object

AnchorNet: A Weakly Supervised Network to Learn Geometry-sensitive Features For Semantic Matching

no code implementations CVPR 2017 David Novotny, Diane Larlus, Andrea Vedaldi

Despite significant progress of deep learning in recent years, state-of-the-art semantic matching methods still rely on legacy features such as SIFT or HoG.

Object

Learning the semantic structure of objects from Web supervision

no code implementations5 Jul 2016 David Novotny, Diane Larlus, Andrea Vedaldi

While recent research in image understanding has often focused on recognizing more types of objects, understanding more about the objects is just as important.

Navigate

Cascaded Sparse Spatial Bins for Efficient and Effective Generic Object Detection

no code implementations ICCV 2015 David Novotny, Jiri Matas

The efficiency is achieved by the use of spatial bins in a novel combination with sparsity-inducing group normalized SVM.

object-detection Object Detection

Correlated Uncertainty for Learning Dense Correspondences from Noisy Labels

no code implementations NeurIPS 2019 Natalia Neverova, David Novotny, Andrea Vedaldi

We show that these models, by understanding uncertainty better, can solve the original DensePose task more accurately, thus setting the new state-of-the-art accuracy in this benchmark.

PerspectiveNet: A Scene-consistent Image Generator for New View Synthesis in Real Indoor Environments

no code implementations NeurIPS 2019 Ben Graham, David Novotny, Jeremy Reizenstein

Given a set of a reference RGBD views of an indoor environment, and a new viewpoint, our goal is to predict the view from that location.

NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go

no code implementations CVPR 2021 Marvin Eisenberger, David Novotny, Gael Kerchenbaum, Patrick Labatut, Natalia Neverova, Daniel Cremers, Andrea Vedaldi

We present NeuroMorph, a new neural network architecture that takes as input two 3D shapes and produces in one go, i. e. in a single feed forward pass, a smooth interpolation and point-to-point correspondences between them.

Augmenting Implicit Neural Shape Representations with Explicit Deformation Fields

no code implementations19 Aug 2021 Matan Atzmon, David Novotny, Andrea Vedaldi, Yaron Lipman

Implicit neural representation is a recent approach to learn shape collections as zero level-sets of neural networks, where each shape is represented by a latent code.

DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension

no code implementations ICCV 2021 Roman Shapovalov, David Novotny, Benjamin Graham, Patrick Labatut, Andrea Vedaldi

The method learns, in an end-to-end fashion, a soft partition of a given category-specific 3D template mesh into rigid parts together with a monocular reconstruction network that predicts the part motions such that they reproject correctly onto 2D DensePose-like surface annotations of the object.

3D Reconstruction Monocular Reconstruction +1

KeyTr: Keypoint Transporter for 3D Reconstruction of Deformable Objects in Videos

no code implementations CVPR 2022 David Novotny, Ignacio Rocco, Samarth Sinha, Alexandre Carlier, Gael Kerchenbaum, Roman Shapovalov, Nikita Smetanin, Natalia Neverova, Benjamin Graham, Andrea Vedaldi

Compared to weaker deformation models, this significantly reduces the reconstruction ambiguity and, for dynamic objects, allows Keypoint Transporter to obtain reconstructions of the quality superior or at least comparable to prior approaches while being much faster and reliant on a pre-trained monocular depth estimator network.

3D Reconstruction Depth Estimation +2

Nerfels: Renderable Neural Codes for Improved Camera Pose Estimation

no code implementations4 Jun 2022 Gil Avraham, Julian Straub, Tianwei Shen, Tsun-Yi Yang, Hugo Germain, Chris Sweeney, Vasileios Balntas, David Novotny, Daniel DeTone, Richard Newcombe

This paper presents a framework that combines traditional keypoint-based camera pose optimization with an invertible neural rendering mechanism.

Neural Rendering Pose Estimation

HoloDiffusion: Training a 3D Diffusion Model using 2D Images

no code implementations CVPR 2023 Animesh Karnewar, Andrea Vedaldi, David Novotny, Niloy Mitra

We show that our diffusion models are scalable, train robustly, and are competitive in terms of sample quality and fidelity to existing approaches for 3D generative modeling.

PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment

no code implementations ICCV 2023 Jianyuan Wang, Christian Rupprecht, David Novotny

Camera pose estimation is a long-standing computer vision problem that to date often relies on classical methods, such as handcrafted keypoint matching, RANSAC and bundle adjustment.

Pose Estimation

HoloFusion: Towards Photo-realistic 3D Generative Modeling

no code implementations ICCV 2023 Animesh Karnewar, Niloy J. Mitra, Andrea Vedaldi, David Novotny

Diffusion-based image generators can now produce high-quality and diverse samples, but their success has yet to fully translate to 3D generation: existing diffusion methods can either generate low-resolution but 3D consistent outputs, or detailed 2D views of 3D objects but with potential structural defects and lacking view consistency or realism.

3D Generation Super-Resolution

Visual Geometry Grounded Deep Structure From Motion

no code implementations7 Dec 2023 Jianyuan Wang, Nikita Karaev, Christian Rupprecht, David Novotny

Finally, we optimise the cameras and triangulate 3D points via a differentiable bundle adjustment layer.

Point Tracking

GOEnFusion: Gradient Origin Encodings for 3D Forward Diffusion Models

no code implementations14 Dec 2023 Animesh Karnewar, Andrea Vedaldi, Niloy J. Mitra, David Novotny

The recently introduced Forward-Diffusion method allows to train a 3D diffusion model using only 2D images for supervision.

Cannot find the paper you are looking for? You can Submit a new open access paper.