Search Results for author: Vincent Leroy

Found 15 papers, 7 papers with code

DUSt3R: Geometric 3D Vision Made Easy

1 code implementation21 Dec 2023 Shuzhe Wang, Vincent Leroy, Yohann Cabon, Boris Chidlovskii, Jerome Revaud

Our formulation directly provides a 3D model of the scene as well as depth information, but interestingly, we can seamlessly recover from it, pixel matches, relative and absolute camera.

3D Reconstruction Camera Calibration +2

Cross-view and Cross-pose Completion for 3D Human Understanding

no code implementations15 Nov 2023 Matthieu Armando, Salma Galaaoui, Fabien Baradel, Thomas Lucas, Vincent Leroy, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez

Human perception and understanding is a major domain of computer vision which, like many other vision subdomains recently, stands to gain from the use of large models pre-trained on large datasets.

Human Mesh Recovery Self-Supervised Learning

Win-Win: Training High-Resolution Vision Transformers from Two Windows

no code implementations1 Oct 2023 Vincent Leroy, Jerome Revaud, Thomas Lucas, Philippe Weinzaepfel

It is 4 times faster to train than a full-resolution network, and it is straightforward to use at test time compared to existing approaches.

Depth Estimation Depth Prediction +2

SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction

no code implementations19 Sep 2023 Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel, Fabien Baradel, Salma Galaaoui, Romain Bregier, Matthieu Armando, Jean-Sebastien Franco, Gregory Rogez

Recent hand-object interaction datasets show limited real object variability and rely on fitting the MANO parametric model to obtain groundtruth hand shapes.

3D Reconstruction Benchmarking +1

CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow

1 code implementation ICCV 2023 Philippe Weinzaepfel, Thomas Lucas, Vincent Leroy, Yohann Cabon, Vaibhav Arora, Romain Brégier, Gabriela Csurka, Leonid Antsfeld, Boris Chidlovskii, Jérôme Revaud

Despite impressive performance for high-level downstream tasks, self-supervised pre-training methods have not yet fully delivered on dense geometric vision tasks such as stereo matching or optical flow.

Optical Flow Estimation Position +2

CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion

1 code implementation19 Oct 2022 Philippe Weinzaepfel, Vincent Leroy, Thomas Lucas, Romain Brégier, Yohann Cabon, Vaibhav Arora, Leonid Antsfeld, Boris Chidlovskii, Gabriela Csurka, Jérôme Revaud

More precisely, we propose the pretext task of cross-view completion where the first input image is partially masked, and this masked content has to be reconstructed from the visible content and the second image.

Depth Estimation Depth Prediction +6

MonoNHR: Monocular Neural Human Renderer

1 code implementation2 Oct 2022 Hongsuk Choi, Gyeongsik Moon, Matthieu Armando, Vincent Leroy, Kyoung Mu Lee, Gregory Rogez

Existing neural human rendering methods struggle with a single image input due to the lack of information in invisible areas and the depth ambiguity of pixels in visible areas.

PUMP: Pyramidal and Uniqueness Matching Priors for Unsupervised Learning of Local Descriptors

1 code implementation CVPR 2022 Jérome Revaud, Vincent Leroy, Philippe Weinzaepfel, Boris Chidlovskii

In this paper, we propose to explicitly integrate two matching priors in a single loss in order to learn local descriptors without supervision.

Visual Localization

DOPE: Distillation Of Part Experts for whole-body 3D pose estimation in the wild

1 code implementation ECCV 2020 Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Vincent Leroy, Grégory Rogez

We introduce DOPE, the first method to detect and estimate whole-body 3D human poses, including bodies, hands and faces, in the wild.

3D Pose Estimation

Robust Image Retrieval-based Visual Localization using Kapture

2 code implementations27 Jul 2020 Martin Humenberger, Yohann Cabon, Nicolas Guerin, Julien Morat, Vincent Leroy, Jérôme Revaud, Philippe Rerole, Noé Pion, Cesar De Souza, Gabriela Csurka

To demonstrate this, we present a versatile pipeline for visual localization that facilitates the use of different local and global features, 3D data (e. g. depth maps), non-vision sensor data (e. g. IMU, GPS, WiFi), and various processing algorithms.

Image Retrieval Pose Estimation +2

Shape Reconstruction Using Volume Sweeping and Learned Photoconsistency

no code implementations ECCV 2018 Vincent Leroy, Jean-Sebastien Franco, Edmond Boyer

Our results demonstrate this ability, showing that a CNN, trained on a standard static dataset, can help recover surface details on dynamic scenes that are not perceived by traditional 2D feature based methods.

3D Shape Reconstruction

Multi-View Dynamic Shape Refinement Using Local Temporal Integration

no code implementations ICCV 2017 Vincent Leroy, Jean-Sebastien Franco, Edmond Boyer

We consider 4D shape reconstructions in multi-view environments and investigate how to exploit temporal redundancy for precision refinement.

Super-Resolution Temporal Sequences

Ternary Neural Networks for Resource-Efficient AI Applications

no code implementations1 Sep 2016 Hande Alemdar, Vincent Leroy, Adrien Prost-Boucle, Frédéric Pétrot

In this paper, we propose ternary neural networks (TNNs) in order to make deep learning more resource-efficient.

Cannot find the paper you are looking for? You can Submit a new open access paper.