Search Results for author: Christian Wolf

Found 49 papers, 21 papers with code

Zero-BEV: Zero-shot Projection of Any First-Person Modality to BEV Maps

no code implementations21 Feb 2024 Gianluca Monaci, Leonid Antsfeld, Boris Chidlovskii, Christian Wolf

Bird's-eye view (BEV) maps are an important geometrically structured representation widely used in robotics, in particular self-driving vehicles and terrestrial robots.

Monocular Depth Estimation Semantic Segmentation

Task-conditioned adaptation of visual features in multi-task policy learning

no code implementations CVPR 2024 Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf

We evaluate the method on a wide variety of tasks from the CortexBench benchmark and show that, compared to existing work, it can be addressed with a single policy.

Decision Making

Learning to navigate efficiently and precisely in real environments

no code implementations CVPR 2024 Guillaume Bono, Hervé Poirier, Leonid Antsfeld, Gianluca Monaci, Boris Chidlovskii, Christian Wolf

In the context of autonomous navigation of terrestrial robots, the creation of realistic models for agent dynamics and sensing is a widespread habit in the robotics literature and in commercial applications, where they are used for model based control and/or for localization and mapping.

Autonomous Navigation Navigate

Multi-Object Navigation in real environments using hybrid policies

no code implementations24 Jan 2024 Assem Sadek, Guillaume Bono, Boris Chidlovskii, Atilla Baskurt, Christian Wolf

More recently, beyond waypoint planning, problems involving significant components of (visual) high-level reasoning have been explored in simulated environments, mostly addressed with large-scale machine learning, in particular RL, offline-RL or imitation learning.

Imitation Learning Object +1

End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon

no code implementations28 Sep 2023 Guillaume Bono, Leonid Antsfeld, Boris Chidlovskii, Philippe Weinzaepfel, Christian Wolf

The main challenge lies in learning compact representations generalizable to unseen environments and in learning high-capacity perception modules capable of reasoning on high-dimensional input.

Pose Estimation Visual Navigation

Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction

no code implementations6 Jun 2023 Guillaume Bono, Leonid Antsfeld, Assem Sadek, Gianluca Monaci, Christian Wolf

Agents navigating in 3D environments require some form of memory, which should hold a compact and actionable representation of the history of observations useful for decision taking and planning.

Navigate

AutoNeRF: Training Implicit Scene Representations with Autonomous Agents

1 code implementation21 Apr 2023 Pierre Marza, Laetitia Matignon, Olivier Simonin, Dhruv Batra, Christian Wolf, Devendra Singh Chaplot

Empirical results show that NeRFs can be trained on actively collected data using just a single episode of experience in an unseen environment, and can be used for several downstream robotic tasks, and that modular trained exploration models outperform other classical and end-to-end baselines.

Novel View Synthesis

Eagle: Large-Scale Learning of Turbulent Fluid Dynamics with Mesh Transformers

1 code implementation16 Feb 2023 Steeven Janny, Aurélien Béneteau, Madiha Nadri, Julie Digne, Nicolas Thome, Christian Wolf

To perform future forecasting of pressure and velocity on the challenging EAGLE dataset, we introduce a new mesh transformer.

Node Clustering

Learning Reduced Nonlinear State-Space Models: an Output-Error Based Canonical Approach

no code implementations19 Apr 2022 Steeven Janny, Quentin Possamai, Laurent Bako, Madiha Nadri, Christian Wolf

The identification of a nonlinear dynamic model is an open topic in control theory, especially from sparse input-output measurements.

State Space Models

An experimental study of the vision-bottleneck in VQA

no code implementations14 Feb 2022 Pierre Marza, Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf

We also study the impact of two methods to incorporate the information about objects necessary for answering a question, in the reasoning module directly, and earlier in the object selection stage.

Object Question Answering +2

Filtered-CoPhy: Unsupervised Learning of Counterfactual Physics in Pixel Space

no code implementations ICLR 2022 Steeven Janny, Fabien Baradel, Natalia Neverova, Madiha Nadri, Greg Mori, Christian Wolf

Learning causal relationships in high-dimensional data (images, videos) is a hard task, as they are often defined on low dimensional manifolds and must be extracted from complex signals dominated by appearance, lighting, textures and also spurious correlations in the data.

counterfactual Counterfactual Reasoning +1

Graph augmented Deep Reinforcement Learning in the GameRLand3D environment

no code implementations22 Dec 2021 Edward Beeching, Maxim Peter, Philippe Marcotte, Jilles Debangoye, Olivier Simonin, Joshua Romoff, Christian Wolf

We address planning and navigation in challenging 3D video games featuring maps with disconnected regions reachable by agents using special actions.

Deep Reinforcement Learning reinforcement-learning +1

Godot Reinforcement Learning Agents

1 code implementation7 Dec 2021 Edward Beeching, Jilles Debangoye, Olivier Simonin, Christian Wolf

We present Godot Reinforcement Learning (RL) Agents, an open-source interface for developing environments and agents in the Godot Game Engine.

reinforcement-learning Reinforcement Learning +2

An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments

no code implementations29 Nov 2021 Assem Sadek, Guillaume Bono, Boris Chidlovskii, Christian Wolf

In this work we present an in-depth study of the performance and reasoning capacities of real physical agents, trained in simulation and deployed to two different physical environments.

Benchmarking Visual Navigation +1

Satellite Image Semantic Segmentation

1 code implementation12 Oct 2021 Eric Guérin, Killian Oechslin, Christian Wolf, Benoît Martinez

In this paper, we propose a method for the automatic semantic segmentation of satellite images into six classes (sparse forest, dense forest, moor, herbaceous formation, building, and road).

2D Semantic Segmentation Segmentation +1

SIM2REALVIZ: Visualizing the Sim2Real Gap in Robot Ego-Pose Estimation

1 code implementation24 Sep 2021 Theo Jaunet, Guillaume Bono, Romain Vuillemot, Christian Wolf

The Robotics community has started to heavily rely on increasingly realistic 3D simulators for large-scale training of robots on massive amounts of data.

Pose Estimation

Teaching Agents how to Map: Spatial Reasoning for Multi-Object Navigation

2 code implementations13 Jul 2021 Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf

In the context of visual navigation, the capacity to map a novel environment is necessary for an agent to exploit its observation history in the considered place and efficiently reach known goals.

Reinforcement Learning (RL) Spatial Reasoning +1

Universal Domain Adaptation in Ordinal Regression

no code implementations22 Jun 2021 Boris Chidlovskii, Assem Sadek, Christian Wolf

We address the problem of universal domain adaptation (UDA) in ordinal regression (OR), which attempts to solve classification problems in which labels are not independent, but follow a natural order.

Age Estimation Clustering +2

Supervising the Transfer of Reasoning Patterns in VQA

no code implementations NeurIPS 2021 Corentin Kervadec, Christian Wolf, Grigory Antipov, Moez Baccouche, Madiha Nadri

Methods for Visual Question Anwering (VQA) are notorious for leveraging dataset biases rather than performing reasoning, hindering generalization.

PAC learning Transfer Learning +1

How Transferable are Reasoning Patterns in VQA?

no code implementations CVPR 2021 Corentin Kervadec, Theo Jaunet, Grigory Antipov, Moez Baccouche, Romain Vuillemot, Christian Wolf

Since its inception, Visual Question Answering (VQA) is notoriously known as a task, where models are prone to exploit biases in datasets to find shortcuts instead of performing high-level reasoning.

Question Answering Visual Question Answering

VisQA: X-raying Vision and Language Reasoning in Transformers

1 code implementation2 Apr 2021 Theo Jaunet, Corentin Kervadec, Romain Vuillemot, Grigory Antipov, Moez Baccouche, Christian Wolf

First, as a result of a collaboration of three fields, machine learning, vision and language reasoning, and data analytics, the work lead to a better understanding of bias exploitation of neural models for VQA, which eventually resulted in an impact on its design and training through the proposition of a method for the transfer of reasoning patterns from an oracle model.

Question Answering Visual Question Answering

Deep KKL: Data-driven Output Prediction for Non-Linear Systems

1 code implementation23 Mar 2021 Steeven Janny, Vincent Andrieu, Madiha Nadri, Christian Wolf

Building on this formulation and problem definition, we propose a predictor structure based on the Kazantzis-Kravaris/Luenberger (KKL) observer and we show that KKL fits well into our general framework.

Learning to plan with uncertain topological maps

1 code implementation ECCV 2020 Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf

We train an agent to navigate in 3D environments using a hierarchical strategy including a high-level graph based planner and a local policy.

Inductive Bias Navigate

Estimating semantic structure for the VQA answer space

no code implementations10 Jun 2020 Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf

Since its appearance, Visual Question Answering (VQA, i. e. answering a question posed over an image), has always been treated as a classification problem over a set of predefined answers.

General Classification Question Answering +1

Roses Are Red, Violets Are Blue... but Should Vqa Expect Them To?

1 code implementation CVPR 2021 Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf

Models for Visual Question Answering (VQA) are notorious for their tendency to rely on dataset biases, as the large and unbalanced diversity of questions and concepts involved and tends to prevent models from learning to reason, leading them to perform educated guesses instead.

Question Answering Visual Question Answering

EgoMap: Projective mapping and structured egocentric memory for Deep RL

no code implementations24 Jan 2020 Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

The EgoMap architecture incorporates several inductive biases including a differentiable inverse projection of CNN feature vectors onto a top-down spatially structured map.

Deep Reinforcement Learning Memorization +2

Weak Supervision helps Emergence of Word-Object Alignment and improves Vision-Language Tasks

no code implementations6 Dec 2019 Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf

The large adoption of the self-attention (i. e. transformer model) and BERT-like training principles has recently resulted in a number of high performing models on a large panoply of vision-and-language problems (such as Visual Question Answering (VQA), image retrieval, etc.).

Image Retrieval Inductive Bias +4

CoPhy: Counterfactual Learning of Physical Dynamics

1 code implementation ICLR 2020 Fabien Baradel, Natalia Neverova, Julien Mille, Greg Mori, Christian Wolf

Understanding causes and effects in mechanical systems is an essential component of reasoning in the physical world.

counterfactual Video Prediction

DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning

1 code implementation6 Sep 2019 Theo Jaunet, Romain Vuillemot, Christian Wolf

We also report on experts evaluation using DRLViz, and applicability of DRLViz to other scenarios and navigation problems beyond simulation games, as well as its contribution to black box models interpretability and explainability in the field of visual analytics.

Deep Reinforcement Learning reinforcement-learning +1

Attentional PointNet for 3D-Object Detection in Point Clouds

1 code implementation14 Jun 2019 Anshul Paigwar, Özgür Erkent, Christian Wolf, Christian Laugier

In this study, we propose Attentional Point- Net, which is a novel end-to-end trainable deep architecture for object detection in point clouds.

3D Object Detection Autonomous Navigation +2

Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer

1 code implementation3 Apr 2019 Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

In this paper we argue that research on training agents capable of complex reasoning can be simplified by decoupling from the requirement of high fidelity photographic observations.

Deep Reinforcement Learning Reinforcement Learning (RL) +1

Object Level Visual Reasoning in Videos

1 code implementation ECCV 2018 Fabien Baradel, Natalia Neverova, Christian Wolf, Julien Mille, Greg Mori

Human activity recognition is typically addressed by detecting key concepts like global and local motion, features related to object classes present in the scene, as well as features related to the global context.

Human Activity Recognition Object +3

Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points

1 code implementation CVPR 2018 Fabien Baradel, Christian Wolf, Julien Mille, Graham W. Taylor

No spatial coherence is forced on the glimpse locations, which gives the module liberty to explore different points at each frame and better optimize the process of scrutinizing visual information.

Action Recognition Activity Prediction +3

Learning to recognize touch gestures: recurrent vs. convolutional features and dynamic sampling

1 code implementation19 Feb 2018 Quentin Debard, Christian Wolf, Stéphane Canu, Julien Arné

We propose a fully automatic method for learning gestures on big touch devices in a potentially multi-user context.

Gesture Recognition

Human Action Recognition: Pose-based Attention draws focus to Hands

no code implementations20 Dec 2017 Fabien Baradel, Christian Wolf, Julien Mille

We propose a new spatio-temporal attention based mechanism for human action recognition able to automatically attend to the hands most involved into the studied action and detect the most discriminative moments in an action.

Action Recognition Temporal Action Localization

Multi-view pose estimation with mixtures-of-parts and adaptive viewpoint selection

no code implementations25 Sep 2017 Emre Dogan, Gonen Eren, Christian Wolf, Eric Lombardi, Atilla Baskurt

We propose a new method for human pose estimation which leverages information from multiple views to impose a strong prior on articulated pose.

Pose Estimation

Residual Conv-Deconv Grid Network for Semantic Segmentation

1 code implementation25 Jul 2017 Damien Fourure, Rémi Emonet, Elisa Fromont, Damien Muselet, Alain Tremeau, Christian Wolf

However, for semantic image segmentation, where the task consists in providing a semantic class to each pixel of an image, feature maps reduction is harmful because it leads to a resolution loss in the output prediction.

Image Segmentation Segmentation +1

KiDS-450 + 2dFLenS: Cosmological parameter constraints from weak gravitational lensing tomography and overlapping redshift-space galaxy clustering

1 code implementation20 Jul 2017 Shahab Joudaki, Chris Blake, Andrew Johnson, Alexandra Amon, Marika Asgari, Ami Choi, Thomas Erben, Karl Glazebrook, Joachim Harnois-Deraps, Catherine Heymans, Hendrik Hildebrandt, Henk Hoekstra, Dominik Klaes, Konrad Kuijken, Chris Lidman, Alexander Mead, Lance Miller, David Parkinson, Gregory B. Poole, Peter Schneider, Massimo Viola, Christian Wolf

The complementarity of our observables allows for constraints on modified gravity degrees of freedom that are not simultaneously bounded with either probe alone, and up to a factor of three improvement in the $S_8$ constraint in the extended cosmology compared to KiDS alone.

Cosmology and Nongalactic Astrophysics

Full-Page Text Recognition: Learning Where to Start and When to Stop

no code implementations27 Apr 2017 Bastien Moysset, Christopher Kermorvant, Christian Wolf

Text line detection and localization is a crucial step for full page document analysis, but still suffers from heterogeneity of real life documents.

Line Detection Position

Pose-conditioned Spatio-Temporal Attention for Human Action Recognition

no code implementations29 Mar 2017 Fabien Baradel, Christian Wolf, Julien Mille

We show that it is of high interest to shift the attention to different hands at different time steps depending on the activity itself.

Action Recognition Human Activity Recognition +1

Learning to detect and localize many objects from few examples

no code implementations17 Nov 2016 Bastien Moysset, Christoper Kermorvant, Christian Wolf

The current trend in object detection and localization is to learn predictions with high capacity deep neural networks trained on a very large amount of annotated data and using a high amount of processing power.

object-detection Object Detection

Learning Human Identity from Motion Patterns

no code implementations12 Nov 2015 Natalia Neverova, Christian Wolf, Griffin Lacey, Lex Fridman, Deepak Chandra, Brandon Barbello, Graham Taylor

We present a large-scale study exploring the capability of temporal deep neural networks to interpret natural human kinematics and introduce the first method for active biometric authentication with mobile inertial sensors.

Activity recognition from videos with parallel hypergraph matching on GPUs

no code implementations4 May 2015 Eric Lombardi, Christian Wolf, Oya Celiktutan, Bülent Sankur

In this paper, we propose a method for activity recognition from videos based on sparse local features and hypergraph matching.

Activity Recognition Graph Matching +2

ModDrop: adaptive multi-modal gesture recognition

no code implementations31 Dec 2014 Natalia Neverova, Christian Wolf, Graham W. Taylor, Florian Nebout

We present a method for gesture detection and localisation based on multi-scale and multi-modal deep learning.

Gesture Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.