1 code implementation • 21 Dec 2023 • Shuzhe Wang, Vincent Leroy, Yohann Cabon, Boris Chidlovskii, Jerome Revaud
Our formulation directly provides a 3D model of the scene as well as depth information, but interestingly, we can seamlessly recover from it, pixel matches, relative and absolute camera.
no code implementations • 15 Nov 2023 • Matthieu Armando, Salma Galaaoui, Fabien Baradel, Thomas Lucas, Vincent Leroy, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez
Human perception and understanding is a major domain of computer vision which, like many other vision subdomains recently, stands to gain from the use of large models pre-trained on large datasets.
no code implementations • 1 Oct 2023 • Vincent Leroy, Jerome Revaud, Thomas Lucas, Philippe Weinzaepfel
It is 4 times faster to train than a full-resolution network, and it is straightforward to use at test time compared to existing approaches.
no code implementations • 19 Sep 2023 • Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel, Fabien Baradel, Salma Galaaoui, Romain Bregier, Matthieu Armando, Jean-Sebastien Franco, Gregory Rogez
Recent hand-object interaction datasets show limited real object variability and rely on fitting the MANO parametric model to obtain groundtruth hand shapes.
no code implementations • 12 Jun 2023 • Matthieu Armando, Laurence Boissieux, Edmond Boyer, Jean-Sebastien Franco, Martin Humenberger, Christophe Legras, Vincent Leroy, Mathieu Marsot, Julien Pansiot, Sergi Pujades, Rim Rekik, Gregory Rogez, Anilkumar Swamy, Stefanie Wuhrer
This work presents 4DHumanOutfit, a new dataset of densely sampled spatio-temporal 4D human motion data of different actors, outfits and motions.
1 code implementation • ICCV 2023 • Philippe Weinzaepfel, Thomas Lucas, Vincent Leroy, Yohann Cabon, Vaibhav Arora, Romain Brégier, Gabriela Csurka, Leonid Antsfeld, Boris Chidlovskii, Jérôme Revaud
Despite impressive performance for high-level downstream tasks, self-supervised pre-training methods have not yet fully delivered on dense geometric vision tasks such as stereo matching or optical flow.
Ranked #1 on Optical Flow Estimation on KITTI 2012
1 code implementation • 19 Oct 2022 • Philippe Weinzaepfel, Vincent Leroy, Thomas Lucas, Romain Brégier, Yohann Cabon, Vaibhav Arora, Leonid Antsfeld, Boris Chidlovskii, Gabriela Csurka, Jérôme Revaud
More precisely, we propose the pretext task of cross-view completion where the first input image is partially masked, and this masked content has to be reconstructed from the visible content and the second image.
1 code implementation • 2 Oct 2022 • Hongsuk Choi, Gyeongsik Moon, Matthieu Armando, Vincent Leroy, Kyoung Mu Lee, Gregory Rogez
Existing neural human rendering methods struggle with a single image input due to the lack of information in invisible areas and the depth ambiguity of pixels in visible areas.
1 code implementation • CVPR 2022 • Jérome Revaud, Vincent Leroy, Philippe Weinzaepfel, Boris Chidlovskii
In this paper, we propose to explicitly integrate two matching priors in a single loss in order to learn local descriptors without supervision.
no code implementations • 4 Dec 2020 • Vincent Leroy, Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Grégory Rogez
Predicting 3D human pose from images has seen great recent improvements.
1 code implementation • ECCV 2020 • Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Vincent Leroy, Grégory Rogez
We introduce DOPE, the first method to detect and estimate whole-body 3D human poses, including bodies, hands and faces, in the wild.
2 code implementations • 27 Jul 2020 • Martin Humenberger, Yohann Cabon, Nicolas Guerin, Julien Morat, Vincent Leroy, Jérôme Revaud, Philippe Rerole, Noé Pion, Cesar De Souza, Gabriela Csurka
To demonstrate this, we present a versatile pipeline for visual localization that facilitates the use of different local and global features, 3D data (e. g. depth maps), non-vision sensor data (e. g. IMU, GPS, WiFi), and various processing algorithms.
no code implementations • ECCV 2018 • Vincent Leroy, Jean-Sebastien Franco, Edmond Boyer
Our results demonstrate this ability, showing that a CNN, trained on a standard static dataset, can help recover surface details on dynamic scenes that are not perceived by traditional 2D feature based methods.
no code implementations • ICCV 2017 • Vincent Leroy, Jean-Sebastien Franco, Edmond Boyer
We consider 4D shape reconstructions in multi-view environments and investigate how to exploit temporal redundancy for precision refinement.
no code implementations • 1 Sep 2016 • Hande Alemdar, Vincent Leroy, Adrien Prost-Boucle, Frédéric Pétrot
In this paper, we propose ternary neural networks (TNNs) in order to make deep learning more resource-efficient.