Search Results for author: Romain Brégier

Found 15 papers, 7 papers with code

Deep Regression on Manifolds: A 3D Rotation Case Study

1 code implementation30 Mar 2021 Romain Brégier

Many machine learning problems involve regressing variables on a non-Euclidean manifold -- e. g. a discrete probability distribution, or the 6D pose of an object.

regression

CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion

1 code implementation19 Oct 2022 Philippe Weinzaepfel, Vincent Leroy, Thomas Lucas, Romain Brégier, Yohann Cabon, Vaibhav Arora, Leonid Antsfeld, Boris Chidlovskii, Gabriela Csurka, Jérôme Revaud

More precisely, we propose the pretext task of cross-view completion where the first input image is partially masked, and this masked content has to be reconstructed from the visible content and the second image.

Depth Estimation Depth Prediction +6

CroCo v2: Improved Cross-view Completion Pre-training for Stereo Matching and Optical Flow

1 code implementation ICCV 2023 Philippe Weinzaepfel, Thomas Lucas, Vincent Leroy, Yohann Cabon, Vaibhav Arora, Romain Brégier, Gabriela Csurka, Leonid Antsfeld, Boris Chidlovskii, Jérôme Revaud

Despite impressive performance for high-level downstream tasks, self-supervised pre-training methods have not yet fully delivered on dense geometric vision tasks such as stereo matching or optical flow.

Optical Flow Estimation Position +2

DOPE: Distillation Of Part Experts for whole-body 3D pose estimation in the wild

1 code implementation ECCV 2020 Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Vincent Leroy, Grégory Rogez

We introduce DOPE, the first method to detect and estimate whole-body 3D human poses, including bodies, hands and faces, in the wild.

3D Pose Estimation

PoseBERT: A Generic Transformer Module for Temporal 3D Human Modeling

1 code implementation22 Aug 2022 Fabien Baradel, Romain Brégier, Thibault Groueix, Philippe Weinzaepfel, Yannis Kalantidis, Grégory Rogez

It is simple, generic and versatile, as it can be plugged on top of any image-based model to transform it in a video-based model leveraging temporal information.

Pose Estimation Pose Prediction

Object segmentation in depth maps with one user click and a synthetically trained fully convolutional network

no code implementations4 Jan 2018 Matthieu Grard, Romain Brégier, Florian Sella, Emmanuel Dellandréa, Liming Chen

We thus propose a step towards a practical interactive application for generating an object-oriented robotic grasp, requiring as inputs only one depth map of the scene and one user click on the next object to extract.

Instance Segmentation Object +4

Defining the Pose of any 3D Rigid Object and an Associated Distance

no code implementations14 Dec 2016 Romain Brégier, Frédéric Devernay, Laetitia Leyrit, James Crowley

The pose of a rigid object is usually regarded as a rigid transformation, described by a translation and a rotation.

Object Pose Estimation +1

Symmetry Aware Evaluation of 3D Object Detection and Pose Estimation in Scenes of Many Parts in Bulk

no code implementations21 Jun 2018 Romain Brégier, Frédéric Devernay, Laetitia Leyrit, James Crowley

While 3D object detection and pose estimation has been studied for a long time, its evaluation is not yet completely satisfactory.

3D Object Detection Object +2

SACReg: Scene-Agnostic Coordinate Regression for Visual Localization

no code implementations21 Jul 2023 Jerome Revaud, Yohann Cabon, Romain Brégier, Jongmin Lee, Philippe Weinzaepfel

Instead of encoding the scene coordinates into the network weights, our model takes as input a database image with some sparse 2D pixel to 3D coordinate annotations, extracted from e. g. off-the-shelf Structure-from-Motion or RGB-D data, and a query image for which are predicted a dense 3D coordinate map and its confidence, based on cross-attention.

Image Retrieval regression +2

MFOS: Model-Free & One-Shot Object Pose Estimation

no code implementations3 Oct 2023 Jongmin Lee, Yohann Cabon, Romain Brégier, Sungjoo Yoo, Jerome Revaud

Existing learning-based methods for object pose estimation in RGB images are mostly model-specific or category based.

Object Pose Estimation

Cross-view and Cross-pose Completion for 3D Human Understanding

no code implementations15 Nov 2023 Matthieu Armando, Salma Galaaoui, Fabien Baradel, Thomas Lucas, Vincent Leroy, Romain Brégier, Philippe Weinzaepfel, Grégory Rogez

Human perception and understanding is a major domain of computer vision which, like many other vision subdomains recently, stands to gain from the use of large models pre-trained on large datasets.

Human Mesh Recovery Self-Supervised Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.