no code implementations • 21 Feb 2024 • Gianluca Monaci, Leonid Antsfeld, Boris Chidlovskii, Christian Wolf
Bird's-eye view (BEV) maps are an important geometrically structured representation widely used in robotics, in particular self-driving vehicles and terrestrial robots.
no code implementations • CVPR 2024 • Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf
We evaluate the method on a wide variety of tasks from the CortexBench benchmark and show that, compared to existing work, it can be addressed with a single policy.
no code implementations • CVPR 2024 • Guillaume Bono, Hervé Poirier, Leonid Antsfeld, Gianluca Monaci, Boris Chidlovskii, Christian Wolf
In the context of autonomous navigation of terrestrial robots, the creation of realistic models for agent dynamics and sensing is a widespread habit in the robotics literature and in commercial applications, where they are used for model based control and/or for localization and mapping.
no code implementations • 24 Jan 2024 • Assem Sadek, Guillaume Bono, Boris Chidlovskii, Atilla Baskurt, Christian Wolf
More recently, beyond waypoint planning, problems involving significant components of (visual) high-level reasoning have been explored in simulated environments, mostly addressed with large-scale machine learning, in particular RL, offline-RL or imitation learning.
no code implementations • 28 Sep 2023 • Guillaume Bono, Leonid Antsfeld, Boris Chidlovskii, Philippe Weinzaepfel, Christian Wolf
The main challenge lies in learning compact representations generalizable to unseen environments and in learning high-capacity perception modules capable of reasoning on high-dimensional input.
no code implementations • 6 Jun 2023 • Guillaume Bono, Leonid Antsfeld, Assem Sadek, Gianluca Monaci, Christian Wolf
Agents navigating in 3D environments require some form of memory, which should hold a compact and actionable representation of the history of observations useful for decision taking and planning.
1 code implementation • 21 Apr 2023 • Pierre Marza, Laetitia Matignon, Olivier Simonin, Dhruv Batra, Christian Wolf, Devendra Singh Chaplot
Empirical results show that NeRFs can be trained on actively collected data using just a single episode of experience in an unseen environment, and can be used for several downstream robotic tasks, and that modular trained exploration models outperform other classical and end-to-end baselines.
1 code implementation • 16 Feb 2023 • Steeven Janny, Aurélien Béneteau, Madiha Nadri, Julie Digne, Nicolas Thome, Christian Wolf
To perform future forecasting of pressure and velocity on the challenging EAGLE dataset, we introduce a new mesh transformer.
1 code implementation • ICCV 2023 • Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf
Understanding and mapping a new environment are core abilities of any autonomously navigating agent.
no code implementations • 19 Apr 2022 • Steeven Janny, Quentin Possamai, Laurent Bako, Madiha Nadri, Christian Wolf
The identification of a nonlinear dynamic model is an open topic in control theory, especially from sparse input-output measurements.
no code implementations • 14 Feb 2022 • Pierre Marza, Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf
We also study the impact of two methods to incorporate the information about objects necessary for answering a question, in the reasoning module directly, and earlier in the object selection stage.
no code implementations • ICLR 2022 • Steeven Janny, Fabien Baradel, Natalia Neverova, Madiha Nadri, Greg Mori, Christian Wolf
Learning causal relationships in high-dimensional data (images, videos) is a hard task, as they are often defined on low dimensional manifolds and must be extracted from complex signals dominated by appearance, lighting, textures and also spurious correlations in the data.
no code implementations • 22 Dec 2021 • Edward Beeching, Maxim Peter, Philippe Marcotte, Jilles Debangoye, Olivier Simonin, Joshua Romoff, Christian Wolf
We address planning and navigation in challenging 3D video games featuring maps with disconnected regions reachable by agents using special actions.
1 code implementation • 7 Dec 2021 • Edward Beeching, Jilles Debangoye, Olivier Simonin, Christian Wolf
We present Godot Reinforcement Learning (RL) Agents, an open-source interface for developing environments and agents in the Godot Game Engine.
no code implementations • 29 Nov 2021 • Assem Sadek, Guillaume Bono, Boris Chidlovskii, Christian Wolf
In this work we present an in-depth study of the performance and reasoning capacities of real physical agents, trained in simulation and deployed to two different physical environments.
1 code implementation • 12 Oct 2021 • Eric Guérin, Killian Oechslin, Christian Wolf, Benoît Martinez
In this paper, we propose a method for the automatic semantic segmentation of satellite images into six classes (sparse forest, dense forest, moor, herbaceous formation, building, and road).
1 code implementation • 24 Sep 2021 • Theo Jaunet, Guillaume Bono, Romain Vuillemot, Christian Wolf
The Robotics community has started to heavily rely on increasingly realistic 3D simulators for large-scale training of robots on massive amounts of data.
no code implementations • 29 Jul 2021 • Jörg Stork, Philip Wenzel, Severin Landwein, Maria-Elena Algorri, Martin Zaefferer, Wolfgang Kusch, Martin Staubach, Thomas Bartz-Beielstein, Hartmut Köhn, Hermann Dejager, Christian Wolf
We have built a novel system for the surveillance of drinking water reservoirs using underwater sensor networks.
2 code implementations • 13 Jul 2021 • Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf
In the context of visual navigation, the capacity to map a novel environment is necessary for an agent to exploit its observation history in the considered place and efficiently reach known goals.
no code implementations • 22 Jun 2021 • Boris Chidlovskii, Assem Sadek, Christian Wolf
We address the problem of universal domain adaptation (UDA) in ordinal regression (OR), which attempts to solve classification problems in which labels are not independent, but follow a natural order.
no code implementations • NeurIPS 2021 • Corentin Kervadec, Christian Wolf, Grigory Antipov, Moez Baccouche, Madiha Nadri
Methods for Visual Question Anwering (VQA) are notorious for leveraging dataset biases rather than performing reasoning, hindering generalization.
no code implementations • CVPR 2021 • Corentin Kervadec, Theo Jaunet, Grigory Antipov, Moez Baccouche, Romain Vuillemot, Christian Wolf
Since its inception, Visual Question Answering (VQA) is notoriously known as a task, where models are prone to exploit biases in datasets to find shortcuts instead of performing high-level reasoning.
1 code implementation • 2 Apr 2021 • Theo Jaunet, Corentin Kervadec, Romain Vuillemot, Grigory Antipov, Moez Baccouche, Christian Wolf
First, as a result of a collaboration of three fields, machine learning, vision and language reasoning, and data analytics, the work lead to a better understanding of bias exploitation of neural models for VQA, which eventually resulted in an impact on its design and training through the proposition of a method for the transfer of reasoning patterns from an oracle model.
1 code implementation • 23 Mar 2021 • Steeven Janny, Vincent Andrieu, Madiha Nadri, Christian Wolf
Building on this formulation and problem definition, we propose a predictor structure based on the Kazantzis-Kravaris/Luenberger (KKL) observer and we show that KKL fits well into our general framework.
1 code implementation • CVPR 2021 • Brendan Duke, Abdalla Ahmed, Christian Wolf, Parham Aarabi, Graham W. Taylor
SST extracts per-pixel representations for each object in a video using sparse attention over spatiotemporal features.
1 code implementation • ECCV 2020 • Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf
We train an agent to navigate in 3D environments using a hierarchical strategy including a high-level graph based planner and a local policy.
no code implementations • 10 Jun 2020 • Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf
Since its appearance, Visual Question Answering (VQA, i. e. answering a question posed over an image), has always been treated as a classification problem over a set of predefined answers.
1 code implementation • CVPR 2021 • Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf
Models for Visual Question Answering (VQA) are notorious for their tendency to rely on dataset biases, as the large and unbalanced diversity of questions and concepts involved and tends to prevent models from learning to reason, leading them to perform educated guesses instead.
no code implementations • 24 Jan 2020 • Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin
The EgoMap architecture incorporates several inductive biases including a differentiable inverse projection of CNN feature vectors onto a top-down spatially structured map.
no code implementations • 6 Dec 2019 • Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf
The large adoption of the self-attention (i. e. transformer model) and BERT-like training principles has recently resulted in a number of high performing models on a large panoply of vision-and-language problems (such as Visual Question Answering (VQA), image retrieval, etc.).
1 code implementation • ICLR 2020 • Fabien Baradel, Natalia Neverova, Julien Mille, Greg Mori, Christian Wolf
Understanding causes and effects in mechanical systems is an essential component of reasoning in the physical world.
1 code implementation • 6 Sep 2019 • Theo Jaunet, Romain Vuillemot, Christian Wolf
We also report on experts evaluation using DRLViz, and applicability of DRLViz to other scenarios and navigation problems beyond simulation games, as well as its contribution to black box models interpretability and explainability in the field of visual analytics.
1 code implementation • 14 Jun 2019 • Anshul Paigwar, Özgür Erkent, Christian Wolf, Christian Laugier
In this study, we propose Attentional Point- Net, which is a novel end-to-end trainable deep architecture for object detection in point clouds.
no code implementations • 16 Apr 2019 • Quentin Debard, Jilles Steeve Dibangoye, Stéphane Canu, Christian Wolf
The second agent acts as the interaction protocol, interpreting and translating to 3D operations the 2D finger trajectories from the first agent.
1 code implementation • 3 Apr 2019 • Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin
In this paper we argue that research on training agents capable of complex reasoning can be simplified by decoupling from the requirement of high fidelity photographic observations.
1 code implementation • ECCV 2018 • Fabien Baradel, Natalia Neverova, Christian Wolf, Julien Mille, Greg Mori
Human activity recognition is typically addressed by detecting key concepts like global and local motion, features related to object classes present in the scene, as well as features related to the global context.
Ranked #1 on Semantic Object Interaction Classification on VLOG
1 code implementation • CVPR 2018 • Fabien Baradel, Christian Wolf, Julien Mille, Graham W. Taylor
No spatial coherence is forced on the glimpse locations, which gives the module liberty to explore different points at each frame and better optimize the process of scrutinizing visual information.
Ranked #21 on Skeleton Based Action Recognition on N-UCLA
1 code implementation • 19 Feb 2018 • Quentin Debard, Christian Wolf, Stéphane Canu, Julien Arné
We propose a fully automatic method for learning gestures on big touch devices in a potentially multi-user context.
no code implementations • 20 Dec 2017 • Fabien Baradel, Christian Wolf, Julien Mille
We propose a new spatio-temporal attention based mechanism for human action recognition able to automatically attend to the hands most involved into the studied action and detect the most discriminative moments in an action.
no code implementations • 25 Sep 2017 • Emre Dogan, Gonen Eren, Christian Wolf, Eric Lombardi, Atilla Baskurt
We propose a new method for human pose estimation which leverages information from multiple views to impose a strong prior on articulated pose.
1 code implementation • 25 Jul 2017 • Damien Fourure, Rémi Emonet, Elisa Fromont, Damien Muselet, Alain Tremeau, Christian Wolf
However, for semantic image segmentation, where the task consists in providing a semantic class to each pixel of an image, feature maps reduction is harmful because it leads to a resolution loss in the output prediction.
1 code implementation • 20 Jul 2017 • Shahab Joudaki, Chris Blake, Andrew Johnson, Alexandra Amon, Marika Asgari, Ami Choi, Thomas Erben, Karl Glazebrook, Joachim Harnois-Deraps, Catherine Heymans, Hendrik Hildebrandt, Henk Hoekstra, Dominik Klaes, Konrad Kuijken, Chris Lidman, Alexander Mead, Lance Miller, David Parkinson, Gregory B. Poole, Peter Schneider, Massimo Viola, Christian Wolf
The complementarity of our observables allows for constraints on modified gravity degrees of freedom that are not simultaneously bounded with either probe alone, and up to a factor of three improvement in the $S_8$ constraint in the extended cosmology compared to KiDS alone.
Cosmology and Nongalactic Astrophysics
no code implementations • 27 Apr 2017 • Bastien Moysset, Christopher Kermorvant, Christian Wolf
Text line detection and localization is a crucial step for full page document analysis, but still suffers from heterogeneity of real life documents.
no code implementations • 29 Mar 2017 • Fabien Baradel, Christian Wolf, Julien Mille
We show that it is of high interest to shift the attention to different hands at different time steps depending on the activity itself.
no code implementations • 17 Nov 2016 • Bastien Moysset, Christoper Kermorvant, Christian Wolf
The current trend in object detection and localization is to learn predictions with high capacity deep neural networks trained on a very large amount of annotated data and using a high amount of processing power.
no code implementations • 20 Nov 2015 • Natalia Neverova, Christian Wolf, Florian Nebout, Graham Taylor
We propose a method for hand pose estimation based on a deep regressor trained on two different kinds of input.
no code implementations • 12 Nov 2015 • Natalia Neverova, Christian Wolf, Griffin Lacey, Lex Fridman, Deepak Chandra, Brandon Barbello, Graham Taylor
We present a large-scale study exploring the capability of temporal deep neural networks to interpret natural human kinematics and introduce the first method for active biometric authentication with mobile inertial sensors.
no code implementations • 4 May 2015 • Eric Lombardi, Christian Wolf, Oya Celiktutan, Bülent Sankur
In this paper, we propose a method for activity recognition from videos based on sparse local features and hypergraph matching.
no code implementations • 31 Dec 2014 • Natalia Neverova, Christian Wolf, Graham W. Taylor, Florian Nebout
We present a method for gesture detection and localisation based on multi-scale and multi-modal deep learning.