1 code implementation • 7 Feb 2023 • Laura Calem, Hedi Ben-Younes, Patrick Pérez, Nicolas Thome
Predicting multiple trajectories for road users is important for automated driving systems: ego-vehicle motion planning indeed requires a clear view of the possible motions of the surrounding agents.
1 code implementation • 15 Dec 2022 • Oriane Siméoni, Chloé Sekkat, Gilles Puy, Antonin Vobecky, Éloi Zablocki, Patrick Pérez
This way, the salient objects emerge as a by-product without any strong assumption on what an object should be.
1 code implementation • 6 Dec 2022 • Mohammad Fahes, Tuan-Hung Vu, Andrei Bursuc, Patrick Pérez, Raoul de Charette
Domain adaptation has been vastly investigated in computer vision but still requires access to target images at train time, which might be intractable in some uncommon conditions.
Prompt-driven Zero-shot Domain Adaptation
Semantic Segmentation
+2
1 code implementation • 22 Nov 2022 • Mehdi Zemni, Mickaël Chen, Éloi Zablocki, Hédi Ben-Younes, Patrick Pérez, Matthieu Cord
We conduct a set of experiments on counterfactual explanation benchmarks for driving scenes, and we show that our method can be adapted beyond classification, e. g., to explain semantic segmentation models.
no code implementations • 26 Aug 2022 • Simon Roburin, Charles Corbière, Gilles Puy, Nicolas Thome, Matthieu Aubry, Renaud Marlet, Patrick Pérez
Predictive performance of machine learning models trained with empirical risk minimization (ERM) can degrade considerably under distribution shifts.
1 code implementation • 28 Jul 2022 • Léon Zheng, Gilles Puy, Elisa Riccietti, Patrick Pérez, Rémi Gribonval
We introduce a regularization loss based on kernel mean embeddings with rotation-invariant kernels on the hypersphere (also known as dot-product kernels) for self-supervised learning of image representations.
1 code implementation • 25 Jul 2022 • Huy V. Vo, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Jean Ponce
On COCO, using on average 10 fully-annotated images per class, or equivalently 1% of the training set, BiB also reduces the performance gap (in AP) between the weakly-supervised detector and the fully-supervised Fast RCNN by over 70%, showing a good trade-off between performance and data efficiency.
1 code implementation • 27 Jun 2022 • Florent Bartoccioni, Éloi Zablocki, Andrei Bursuc, Patrick Pérez, Matthieu Cord, Karteek Alahari
Recent works in autonomous driving have widely adopted the bird's-eye-view (BEV) semantic map as an intermediate representation of the world.
no code implementations • 11 May 2022 • Soshi Shimada, Vladislav Golyanik, Zhi Li, Patrick Pérez, Weipeng Xu, Christian Theobalt
Marker-less monocular 3D human motion capture (MoCap) with scene interactions is a challenging research topic relevant for extended reality, robotics and virtual avatar generation.
1 code implementation • 25 Apr 2022 • Antoine Saporta, Arthur Douillard, Tuan-Hung Vu, Patrick Pérez, Matthieu Cord
Unsupervised Domain Adaptation (UDA) is a transfer learning task which aims at training on an unlabeled target domain by leveraging a labeled source domain.
1 code implementation • 21 Mar 2022 • Antonin Vobecky, David Hurych, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Josef Sivic
This work investigates learning pixel-wise semantic image segmentation in urban scenes without any manual annotation, just from the raw non-curated data collected by cars which, equipped with cameras and LiDAR sensors, drive around a city.
Ranked #1 on
Unsupervised Semantic Segmentation
on Cityscapes val
1 code implementation • CVPR 2022 • Julien Rebut, Arthur Ouaknine, Waqas Malik, Patrick Pérez
With their robustness to adverse weather conditions and ability to measure speeds, radar sensors have been part of the automotive landscape for more than two decades.
no code implementations • 6 Dec 2021 • Himalaya Jain, Tuan-Hung Vu, Patrick Pérez, Matthieu Cord
With the rapid advances in generative adversarial networks (GANs), the visual quality of synthesised scenes keeps improving, including for complex urban scenes with applications to automated driving.
1 code implementation • 17 Nov 2021 • Paul Jacob, Éloi Zablocki, Hédi Ben-Younes, Mickaël Chen, Patrick Pérez, Matthieu Cord
In this work, we address the problem of producing counterfactual explanations for high-quality images and complex scenes.
2 code implementations • 29 Sep 2021 • Oriane Siméoni, Gilles Puy, Huy V. Vo, Simon Roburin, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Renaud Marlet, Jean Ponce
We also show that training a class-agnostic detector on the discovered objects boosts results by another 7 points.
Ranked #2 on
Weakly-Supervised Object Localization
on CUB-200-2011
(Top-1 Localization Accuracy metric)
1 code implementation • 16 Sep 2021 • Hédi Ben-Younes, Éloi Zablocki, Mickaël Chen, Patrick Pérez, Matthieu Cord
Learning-based trajectory prediction models have encountered great success, with the promise of leveraging contextual information in addition to motion history.
1 code implementation • 8 Sep 2021 • Florent Bartoccioni, Éloi Zablocki, Patrick Pérez, Matthieu Cord, Karteek Alahari
In such a monocular setup, dense depth is obtained with either additional input from one or several expensive LiDARs, e. g., with 64 beams, or camera-only methods, which suffer from scale-ambiguity and infinite-depth problems.
1 code implementation • ICCV 2021 • Antoine Saporta, Tuan-Hung Vu, Matthieu Cord, Patrick Pérez
In this work, we address the task of unsupervised domain adaptation (UDA) for semantic segmentation in presence of multiple target domains: The objective is to train a single model that can handle all these domains at test time.
1 code implementation • NeurIPS 2021 • Huy V. Vo, Elena Sizikova, Cordelia Schmid, Patrick Pérez, Jean Ponce
Extensive experiments on COCO and OpenImages show that, in the single-object discovery setting where a single prominent object is sought in each image, the proposed LOD (Large-scale Object Discovery) approach is on par with, or better than the state of the art for medium-scale datasets (up to 120K images), and over 37% better than the only other algorithms capable of scaling up to 1. 7M images.
1 code implementation • CVPR 2021 • Guillaume Le Moing, Tuan-Hung Vu, Himalaya Jain, Patrick Pérez, Matthieu Cord
Despite the recent progress of generative adversarial networks (GANs) at synthesizing photo-realistic images, producing complex urban scenes remains a challenging problem.
no code implementations • 3 May 2021 • Soshi Shimada, Vladislav Golyanik, Weipeng Xu, Patrick Pérez, Christian Theobalt
We present a new trainable system for physically plausible markerless 3D human motion capture, which achieves state-of-the-art results in a broad range of challenging scenarios.
2 code implementations • ICCV 2021 • Arthur Ouaknine, Alasdair Newson, Patrick Pérez, Florence Tupin, Julien Rebut
Understanding the scene around the ego-vehicle is key to assisted and autonomous driving.
no code implementations • 25 Mar 2021 • Julien Rebut, Andrei Bursuc, Patrick Pérez
Robustness to various image corruptions, caused by changing weather conditions or sensor degradation and aging, is crucial for safety when such vehicles are deployed in the real world.
2 code implementations • 18 Jan 2021 • Maximilian Jaritz, Tuan-Hung Vu, Raoul de Charette, Émilie Wirbel, Patrick Pérez
Domain adaptation is an important task to enable learning when labels are scarce.
no code implementations • 13 Jan 2021 • Éloi Zablocki, Hédi Ben-Younes, Patrick Pérez, Matthieu Cord
The concept of explainability has several facets and the need for explainability is strong in driving, a safety-critical application.
2 code implementations • CVPR 2021 • Spyros Gidaris, Andrei Bursuc, Gilles Puy, Nikos Komodakis, Matthieu Cord, Patrick Pérez
With this in mind, we propose a teacher-student scheme to learn representations by training a convolutional net to reconstruct a bag-of-visual-words (BoW) representation of an image, given as input a perturbed version of that same image.
Ranked #16 on
Semi-Supervised Image Classification
on ImageNet - 1% labeled data
(Top 5 Accuracy metric)
1 code implementation • 15 Dec 2020 • Antonín Vobecký, David Hurych, Michal Uřičář, Patrick Pérez, Josef Šivic
This is achieved with a data generator (called DummyNet) with disentangled control of the pose, the appearance, and the target background scene.
no code implementations • 11 Dec 2020 • Charles Corbière, Nicolas Thome, Antoine Saporta, Tuan-Hung Vu, Matthieu Cord, Patrick Pérez
In this paper, we introduce a novel target criterion for model confidence, namely the true class probability (TCP).
1 code implementation • 9 Dec 2020 • Hédi Ben-Younes, Éloi Zablocki, Patrick Pérez, Matthieu Cord
In this era of active development of autonomous vehicles, it becomes crucial to provide driving systems with the capacity to explain their decisions.
1 code implementation • 4 Dec 2020 • Taylor Mordan, Matthieu Cord, Patrick Pérez, Alexandre Alahi
By increasing the number of attributes jointly learned, we highlight an issue related to the scales of gradients, which arises in MTL with numerous tasks.
no code implementations • 20 Sep 2020 • Ayush Tewari, Mohamed Elgharib, Mallikarjun B R., Florian Bernard, Hans-Peter Seidel, Patrick Pérez, Michael Zollhöfer, Christian Theobalt
We present the first approach for embedding real portrait images in the latent space of StyleGAN, which allows for intuitive editing of the head pose, facial expression, and scene illumination in the image.
no code implementations • 10 Jul 2020 • Adithya Ranga, Filippo Giruzzi, Jagdish Bhanushali, Emilie Wirbel, Patrick Pérez, Tuan-Hung Vu, Xavier Perrotton
In this paper we propose a multi-task learning model to predict pedestrian actions, crossing intent and forecast their future path from video sequences.
1 code implementation • ECCV 2020 • Huy V. Vo, Patrick Pérez, Jean Ponce
This paper addresses the problem of discovering the objects present in a collection of images without any supervision.
Ranked #1 on
Single-object colocalization
on VOC_6x2
1 code implementation • 23 Jun 2020 • Simon Roburin, Yann de Mont-Marin, Andrei Bursuc, Renaud Marlet, Patrick Pérez, Mathieu Aubry
Normalization Layers (NLs) are widely used in modern deep-learning architectures.
1 code implementation • 15 Jun 2020 • Antoine Saporta, Tuan-Hung Vu, Matthieu Cord, Patrick Pérez
While fully-supervised deep learning yields good models for urban scene semantic segmentation, these models struggle to generalize to new environments with different lighting or weather conditions for instance.
no code implementations • 9 May 2020 • Xu Yao, Gilles Puy, Patrick Pérez
We address the problem of style transfer between two photos and propose a new way to preserve photorealism.
no code implementations • 2 Apr 2020 • Maxime Bucher, Tuan-Hung Vu, Matthieu Cord, Patrick Pérez
In this work, we define and address a novel domain adaptation (DA) problem in semantic scene segmentation, where the target domain not only exhibits a data distribution shift w. r. t.
no code implementations • CVPR 2020 • Ayush Tewari, Mohamed Elgharib, Gaurav Bharaj, Florian Bernard, Hans-Peter Seidel, Patrick Pérez, Michael Zollhöfer, Christian Theobalt
StyleGAN generates photorealistic portrait images of faces with eyes, teeth, hair and context (neck, shoulders, background), but lacks a rig-like control over semantic face parameters that are interpretable in 3D, such as face pose, expressions, and scene illumination.
1 code implementation • CVPR 2020 • Spyros Gidaris, Andrei Bursuc, Nikos Komodakis, Patrick Pérez, Matthieu Cord
Inspired by the success of NLP methods in this area, in this work we propose a self-supervised approach based on spatially dense image descriptions that encode discrete visual concepts, here called visual words.
no code implementations • 2 Feb 2020 • B Ravi Kiran, Ibrahim Sobh, Victor Talpaert, Patrick Mannion, Ahmad A. Al Sallab, Senthil Yogamani, Patrick Pérez
With the development of deep representation learning, the domain of reinforcement learning (RL) has become a powerful learning framework now capable of learning complex policies in high dimensional environments.
no code implementations • 23 Jan 2020 • Srđan Kitić, Gilles Puy, Patrick Pérez, Philippe Gilberton
We consider the problem of identifying people on the basis of their walk (gait) pattern.
1 code implementation • ECCV 2020 • Himalaya Jain, Spyros Gidaris, Nikos Komodakis, Patrick Pérez, Matthieu Cord
Knowledge distillation refers to the process of training a compact student network to achieve better accuracy by learning from a high capacity teacher network.
1 code implementation • NeurIPS 2019 • Charles Corbière, Nicolas Thome, Avner Bar-Hen, Matthieu Cord, Patrick Pérez
In this paper, we propose a new target criterion for model confidence, corresponding to the True Class Probability (TCP).
1 code implementation • CVPR 2020 • Maximilian Jaritz, Tuan-Hung Vu, Raoul de Charette, Émilie Wirbel, Patrick Pérez
In this work, we explore how to learn from multi-modality and propose cross-modal UDA (xMUDA) where we assume the presence of 2D images and 3D point clouds for 3D semantic segmentation.
no code implementations • 7 Nov 2019 • Victor Besnier, Himalaya Jain, Andrei Bursuc, Matthieu Cord, Patrick Pérez
This naturally brings the question: Can we train a classifier only on the generated data?
1 code implementation • NeurIPS 2019 • Charles Corbière, Nicolas Thome, Avner Bar-Hen, Matthieu Cord, Patrick Pérez
In this paper, we propose a new target criterion for model confidence, corresponding to the True Class Probability (TCP).
1 code implementation • ICCV 2019 • Spyros Gidaris, Andrei Bursuc, Nikos Komodakis, Patrick Pérez, Matthieu Cord
Few-shot learning and self-supervised learning address different facets of the same problem: how to train a model with little or no labeled data.
2 code implementations • NeurIPS 2019 • Maxime Bucher, Tuan-Hung Vu, Matthieu Cord, Patrick Pérez
Semantic segmentation models are limited in their ability to scale to large numbers of object classes.
Ranked #1 on
Zero-Shot Learning
on PASCAL Context
1 code implementation • CVPR 2019 • Martin Engilberge, Louis Chevallier, Patrick Pérez, Matthieu Cord
Our approach is based on a deep architecture that approximates the sorting of arbitrary sets of scores.
2 code implementations • ICCV 2019 • Tuan-Hung Vu, Himalaya Jain, Maxime Bucher, Matthieu Cord, Patrick Pérez
As a result, the performance of the trained semantic segmentation model on the target domain is boosted.
Ranked #17 on
Image-to-Image Translation
on SYNTHIA-to-Cityscapes
no code implementations • CVPR 2019 • Ayush Tewari, Florian Bernard, Pablo Garrido, Gaurav Bharaj, Mohamed Elgharib, Hans-Peter Seidel, Patrick Pérez, Michael Zollhöfer, Christian Theobalt
In contrast, we propose multi-frame video-based self-supervised training of a deep network that (i) learns a face identity model both in shape and appearance while (ii) jointly learning to reconstruct 3D faces.
3 code implementations • CVPR 2019 • Tuan-Hung Vu, Himalaya Jain, Maxime Bucher, Matthieu Cord, Patrick Pérez
Semantic segmentation is a key problem for many computer vision tasks.
Ranked #19 on
Image-to-Image Translation
on GTAV-to-Cityscapes Labels
no code implementations • 9 Nov 2018 • Sanjeel Parekh, Alexey Ozerov, Slim Essid, Ngoc Duong, Patrick Pérez, Gaël Richard
We tackle the problem of audiovisual scene analysis for weakly-labeled data.
no code implementations • 13 Jun 2018 • Gilles Puy, Patrick Pérez
In contrast to existing convnets that address the same task, our architecture derives directly from the structure of the gradient descent originally used to solve the style transfer problem [Gatys et al., 2016].
no code implementations • 29 May 2018 • Hyeongwoo Kim, Pablo Garrido, Ayush Tewari, Weipeng Xu, Justus Thies, Matthias Nießner, Patrick Pérez, Christian Richardt, Michael Zollhöfer, Christian Theobalt
In order to enable source-to-target video re-animation, we render a synthetic target video with the reconstructed head animation parameters from a source video, and feed it into the trained network -- thus taking full control of the target.
no code implementations • 19 Apr 2018 • Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard
Audio-visual representation learning is an important task from the perspective of designing machines with the ability to understand complex events.
1 code implementation • CVPR 2018 • Martin Engilberge, Louis Chevallier, Patrick Pérez, Matthieu Cord
Several works have proposed to learn a two-path neural network that maps images and texts, respectively, to a same shared Euclidean space where geometry captures useful semantic relationships.
no code implementations • CVPR 2018 • Himalaya Jain, Joaquin Zepeda, Patrick Pérez, Rémi Gribonval
To work at scale, a complete image indexing system comprises two components: An inverted file index to restrict the actual search to only a subset that should contain most of the items relevant to the query; An approximate distance computation mechanism to rapidly scan these lists.
no code implementations • CVPR 2018 • Ayush Tewari, Michael Zollhöfer, Pablo Garrido, Florian Bernard, Hyeongwoo Kim, Patrick Pérez, Christian Theobalt
To alleviate this problem, we present the first approach that jointly learns 1) a regressor for face shape, expression, reflectance and illumination on the basis of 2) a concurrently learned parametric face model.
1 code implementation • 31 Oct 2017 • Eric Grinstein, Ngoc Duong, Alexey Ozerov, Patrick Pérez
"Style transfer" among images has recently emerged as a very active research topic, fuelled by the power of convolution neural networks (CNNs), and has become fast a very popular technology in social media.
Sound Audio and Speech Processing Classical Physics
no code implementations • ICCV 2017 • Himalaya Jain, Joaquin Zepeda, Patrick Pérez, Rémi Gribonval
For large-scale visual search, highly compressed yet meaningful representations of images are essential.
no code implementations • ICCV 2017 • Ayush Tewari, Michael Zollhöfer, Hyeongwoo Kim, Pablo Garrido, Florian Bernard, Patrick Pérez, Christian Theobalt
In this work we propose a novel model-based deep convolutional autoencoder that addresses the highly challenging problem of reconstructing a 3D human face from a single in-the-wild color image.
no code implementations • 24 Feb 2017 • Gilles Puy, Srdan Kitic, Patrick Pérez
This paper deals with the unification of local and non-local signal processing on graphs within a single convolutional neural network (CNN) framework.
no code implementations • CVPR 2017 • Ondrej Miksik, Juan-Manuel Pérez-Rúa, Philip H. S. Torr, Patrick Pérez
Rotoscoping, the detailed delineation of scene elements through a video shot, is a painstaking task of tremendous importance in professional post-production pipelines.
no code implementations • 10 Aug 2016 • Himalaya Jain, Patrick Pérez, Rémi Gribonval, Joaquin Zepeda, Hervé Jégou
This paper tackles the task of storing a large collection of vectors, such as visual descriptors, and of searching in it.
no code implementations • 9 Jun 2016 • Nicolas Keriven, Anthony Bourrier, Rémi Gribonval, Patrick Pérez
We propose a "compressive learning" framework where we estimate model parameters from a sketch of the training data.
no code implementations • 18 Mar 2015 • Alasdair Newson, Andrés Almansa, Matthieu Fradet, Yann Gousseau, Patrick Pérez
Our algorithm is able to deal with a variety of challenging situations which naturally arise in video inpainting, such as the correct reconstruction of dynamic textures, multiple moving objects and moving background.