Search Results for author: Christian Wolf

Found 49 papers, 20 papers with code

Zero-BEV: Zero-shot Projection of Any First-Person Modality to BEV Maps

no code implementations • 21 Feb 2024 • Gianluca Monaci, Leonid Antsfeld, Boris Chidlovskii, Christian Wolf

Bird's-eye view (BEV) maps are an important geometrically structured representation widely used in robotics, in particular self-driving vehicles and terrestrial robots.

Monocular Depth Estimation Semantic Segmentation

Paper
Add Code

Task-conditioned adaptation of visual features in multi-task policy learning

no code implementations • 12 Feb 2024 • Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf

We evaluate the method on a wide variety of tasks from the CortexBench benchmark and show that, compared to existing work, it can be addressed with a single policy.

Decision Making

Paper
Add Code

Learning to navigate efficiently and precisely in real environments

no code implementations • 25 Jan 2024 • Guillaume Bono, Hervé Poirier, Leonid Antsfeld, Gianluca Monaci, Boris Chidlovskii, Christian Wolf

In the context of autonomous navigation of terrestrial robots, the creation of realistic models for agent dynamics and sensing is a widespread habit in the robotics literature and in commercial applications, where they are used for model based control and/or for localization and mapping.

Autonomous Navigation Navigate

Paper
Add Code

Multi-Object Navigation in real environments using hybrid policies

no code implementations • 24 Jan 2024 • Assem Sadek, Guillaume Bono, Boris Chidlovskii, Atilla Baskurt, Christian Wolf

More recently, beyond waypoint planning, problems involving significant components of (visual) high-level reasoning have been explored in simulated environments, mostly addressed with large-scale machine learning, in particular RL, offline-RL or imitation learning.

Imitation Learning Object +1

Paper
Add Code

End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon

no code implementations • 28 Sep 2023 • Guillaume Bono, Leonid Antsfeld, Boris Chidlovskii, Philippe Weinzaepfel, Christian Wolf

The main challenge lies in learning compact representations generalizable to unseen environments and in learning high-capacity perception modules capable of reasoning on high-dimensional input.

Pose Estimation Visual Navigation

Paper
Add Code

Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction

no code implementations • 6 Jun 2023 • Guillaume Bono, Leonid Antsfeld, Assem Sadek, Gianluca Monaci, Christian Wolf

Agents navigating in 3D environments require some form of memory, which should hold a compact and actionable representation of the history of observations useful for decision taking and planning.

Navigate

Paper
Add Code

AutoNeRF: Training Implicit Scene Representations with Autonomous Agents

1 code implementation • 21 Apr 2023 • Pierre Marza, Laetitia Matignon, Olivier Simonin, Dhruv Batra, Christian Wolf, Devendra Singh Chaplot

Empirical results show that NeRFs can be trained on actively collected data using just a single episode of experience in an unseen environment, and can be used for several downstream robotic tasks, and that modular trained exploration models outperform other classical and end-to-end baselines.

Novel View Synthesis

Paper
Code

Eagle: Large-Scale Learning of Turbulent Fluid Dynamics with Mesh Transformers

no code implementations • 16 Feb 2023 • Steeven Janny, Aurélien Béneteau, Madiha Nadri, Julie Digne, Nicolas Thome, Christian Wolf

To perform future forecasting of pressure and velocity on the challenging EAGLE dataset, we introduce a new mesh transformer.

Node Clustering

Paper
Add Code

Multi-Object Navigation with dynamically learned neural implicit representations

1 code implementation • ICCV 2023 • Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf

Understanding and mapping a new environment are core abilities of any autonomously navigating agent.

Object Reinforcement Learning (RL)

Paper
Code

Learning Reduced Nonlinear State-Space Models: an Output-Error Based Canonical Approach

no code implementations • 19 Apr 2022 • Steeven Janny, Quentin Possamai, Laurent Bako, Madiha Nadri, Christian Wolf

The identification of a nonlinear dynamic model is an open topic in control theory, especially from sparse input-output measurements.

Paper
Add Code

An experimental study of the vision-bottleneck in VQA

no code implementations • 14 Feb 2022 • Pierre Marza, Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf

We also study the impact of two methods to incorporate the information about objects necessary for answering a question, in the reasoning module directly, and earlier in the object selection stage.

Object Question Answering +2

Paper
Add Code

Filtered-CoPhy: Unsupervised Learning of Counterfactual Physics in Pixel Space

no code implementations • ICLR 2022 • Steeven Janny, Fabien Baradel, Natalia Neverova, Madiha Nadri, Greg Mori, Christian Wolf

Learning causal relationships in high-dimensional data (images, videos) is a hard task, as they are often defined on low dimensional manifolds and must be extracted from complex signals dominated by appearance, lighting, textures and also spurious correlations in the data.

counterfactual Counterfactual Reasoning +1

Paper
Add Code

Graph augmented Deep Reinforcement Learning in the GameRLand3D environment

no code implementations • 22 Dec 2021 • Edward Beeching, Maxim Peter, Philippe Marcotte, Jilles Debangoye, Olivier Simonin, Joshua Romoff, Christian Wolf

We address planning and navigation in challenging 3D video games featuring maps with disconnected regions reachable by agents using special actions.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Godot Reinforcement Learning Agents

1 code implementation • 7 Dec 2021 • Edward Beeching, Jilles Debangoye, Olivier Simonin, Christian Wolf

We present Godot Reinforcement Learning (RL) Agents, an open-source interface for developing environments and agents in the Godot Game Engine.

reinforcement-learning Reinforcement Learning (RL)

747

Paper
Code

An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments

no code implementations • 29 Nov 2021 • Assem Sadek, Guillaume Bono, Boris Chidlovskii, Christian Wolf

In this work we present an in-depth study of the performance and reasoning capacities of real physical agents, trained in simulation and deployed to two different physical environments.

Benchmarking Visual Navigation +1

Paper
Add Code

Satellite Image Semantic Segmentation

1 code implementation • 12 Oct 2021 • Eric Guérin, Killian Oechslin, Christian Wolf, Benoît Martinez

In this paper, we propose a method for the automatic semantic segmentation of satellite images into six classes (sparse forest, dense forest, moor, herbaceous formation, building, and road).

2D Semantic Segmentation Segmentation +1

Paper
Code

SIM2REALVIZ: Visualizing the Sim2Real Gap in Robot Ego-Pose Estimation

1 code implementation • 24 Sep 2021 • Theo Jaunet, Guillaume Bono, Romain Vuillemot, Christian Wolf

The Robotics community has started to heavily rely on increasingly realistic 3D simulators for large-scale training of robots on massive amounts of data.

Pose Estimation

Paper
Code

Underwater Acoustic Networks for Security Risk Assessment in Public Drinking Water Reservoirs

no code implementations • 29 Jul 2021 • Jörg Stork, Philip Wenzel, Severin Landwein, Maria-Elena Algorri, Martin Zaefferer, Wolfgang Kusch, Martin Staubach, Thomas Bartz-Beielstein, Hartmut Köhn, Hermann Dejager, Christian Wolf

We have built a novel system for the surveillance of drinking water reservoirs using underwater sensor networks.

Anomaly Detection

Paper
Add Code

Teaching Agents how to Map: Spatial Reasoning for Multi-Object Navigation

2 code implementations • 13 Jul 2021 • Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf

In the context of visual navigation, the capacity to map a novel environment is necessary for an agent to exploit its observation history in the considered place and efficiently reach known goals.

Reinforcement Learning (RL) Visual Navigation

Paper
Code

Universal Domain Adaptation in Ordinal Regression

no code implementations • 22 Jun 2021 • Boris Chidlovskii, Assem Sadek, Christian Wolf

We address the problem of universal domain adaptation (UDA) in ordinal regression (OR), which attempts to solve classification problems in which labels are not independent, but follow a natural order.

Age Estimation Clustering +2

Paper
Add Code

Supervising the Transfer of Reasoning Patterns in VQA

no code implementations • NeurIPS 2021 • Corentin Kervadec, Christian Wolf, Grigory Antipov, Moez Baccouche, Madiha Nadri

Methods for Visual Question Anwering (VQA) are notorious for leveraging dataset biases rather than performing reasoning, hindering generalization.

PAC learning Transfer Learning +1

Paper
Add Code

How Transferable are Reasoning Patterns in VQA?

no code implementations • CVPR 2021 • Corentin Kervadec, Theo Jaunet, Grigory Antipov, Moez Baccouche, Romain Vuillemot, Christian Wolf

Since its inception, Visual Question Answering (VQA) is notoriously known as a task, where models are prone to exploit biases in datasets to find shortcuts instead of performing high-level reasoning.

Question Answering Visual Question Answering

Paper
Add Code

VisQA: X-raying Vision and Language Reasoning in Transformers

1 code implementation • 2 Apr 2021 • Theo Jaunet, Corentin Kervadec, Romain Vuillemot, Grigory Antipov, Moez Baccouche, Christian Wolf

First, as a result of a collaboration of three fields, machine learning, vision and language reasoning, and data analytics, the work lead to a better understanding of bias exploitation of neural models for VQA, which eventually resulted in an impact on its design and training through the proposition of a method for the transfer of reasoning patterns from an oracle model.

Question Answering Visual Question Answering

Paper
Code

Deep KKL: Data-driven Output Prediction for Non-Linear Systems

1 code implementation • 23 Mar 2021 • Steeven Janny, Vincent Andrieu, Madiha Nadri, Christian Wolf

Building on this formulation and problem definition, we propose a predictor structure based on the Kazantzis-Kravaris/Luenberger (KKL) observer and we show that KKL fits well into our general framework.

Paper
Code

SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation

1 code implementation • CVPR 2021 • Brendan Duke, Abdalla Ahmed, Christian Wolf, Parham Aarabi, Graham W. Taylor

SST extracts per-pixel representations for each object in a video using sparse attention over spatiotemporal features.

Ranked #5 on Semi-Supervised Video Object Segmentation on DAVIS (no YouTube-VOS training)

Inductive Bias Motion Segmentation +6

Paper
Code

Learning to plan with uncertain topological maps

1 code implementation • ECCV 2020 • Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf

We train an agent to navigate in 3D environments using a hierarchical strategy including a high-level graph based planner and a local policy.

Inductive Bias Navigate

Paper
Code

Estimating semantic structure for the VQA answer space

no code implementations • 10 Jun 2020 • Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf

Since its appearance, Visual Question Answering (VQA, i. e. answering a question posed over an image), has always been treated as a classification problem over a set of predefined answers.

General Classification Question Answering +1

Paper
Add Code

Roses Are Red, Violets Are Blue... but Should Vqa Expect Them To?

1 code implementation • CVPR 2021 • Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf

Models for Visual Question Answering (VQA) are notorious for their tendency to rely on dataset biases, as the large and unbalanced diversity of questions and concepts involved and tends to prevent models from learning to reason, leading them to perform educated guesses instead.

Question Answering Visual Question Answering

Paper
Code

EgoMap: Projective mapping and structured egocentric memory for Deep RL

no code implementations • 24 Jan 2020 • Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

The EgoMap architecture incorporates several inductive biases including a differentiable inverse projection of CNN feature vectors onto a top-down spatially structured map.

Memorization reinforcement-learning +1

Paper
Add Code

Weak Supervision helps Emergence of Word-Object Alignment and improves Vision-Language Tasks

no code implementations • 6 Dec 2019 • Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf

The large adoption of the self-attention (i. e. transformer model) and BERT-like training principles has recently resulted in a number of high performing models on a large panoply of vision-and-language problems (such as Visual Question Answering (VQA), image retrieval, etc.).

Image Retrieval Inductive Bias +4

Paper
Add Code

CoPhy: Counterfactual Learning of Physical Dynamics

1 code implementation • ICLR 2020 • Fabien Baradel, Natalia Neverova, Julien Mille, Greg Mori, Christian Wolf

Understanding causes and effects in mechanical systems is an essential component of reasoning in the physical world.

counterfactual Video Prediction

Paper
Code

DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning

1 code implementation • 6 Sep 2019 • Theo Jaunet, Romain Vuillemot, Christian Wolf

We also report on experts evaluation using DRLViz, and applicability of DRLViz to other scenarios and navigation problems beyond simulation games, as well as its contribution to black box models interpretability and explainability in the field of visual analytics.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Attentional PointNet for 3D-Object Detection in Point Clouds

1 code implementation • 14 Jun 2019 • Anshul Paigwar, Özgür Erkent, Christian Wolf, Christian Laugier

In this study, we propose Attentional Point- Net, which is a novel end-to-end trainable deep architecture for object detection in point clouds.

3D Object Detection Autonomous Navigation +2

Paper
Code

Learning 3D Navigation Protocols on Touch Interfaces with Cooperative Multi-Agent Reinforcement Learning

no code implementations • 16 Apr 2019 • Quentin Debard, Jilles Steeve Dibangoye, Stéphane Canu, Christian Wolf

The second agent acts as the interaction protocol, interpreting and translating to 3D operations the 2D finger trajectories from the first agent.

Multi-agent Reinforcement Learning Navigate +3

Paper
Add Code

Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer

1 code implementation • 3 Apr 2019 • Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

In this paper we argue that research on training agents capable of complex reasoning can be simplified by decoupling from the requirement of high fidelity photographic observations.

Reinforcement Learning (RL) Scene Understanding

Paper
Code

Object Level Visual Reasoning in Videos

1 code implementation • ECCV 2018 • Fabien Baradel, Natalia Neverova, Christian Wolf, Julien Mille, Greg Mori

Human activity recognition is typically addressed by detecting key concepts like global and local motion, features related to object classes present in the scene, as well as features related to the global context.

Ranked #1 on Semantic Object Interaction Classification on VLOG

Human Activity Recognition Object +3

172

Paper
Code

Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points

1 code implementation • CVPR 2018 • Fabien Baradel, Christian Wolf, Julien Mille, Graham W. Taylor

No spatial coherence is forced on the glimpse locations, which gives the module liberty to explore different points at each frame and better optimize the process of scrutinizing visual information.

Ranked #19 on Skeleton Based Action Recognition on N-UCLA

Action Recognition Activity Prediction +3

Paper
Code

Learning to recognize touch gestures: recurrent vs. convolutional features and dynamic sampling

1 code implementation • 19 Feb 2018 • Quentin Debard, Christian Wolf, Stéphane Canu, Julien Arné

We propose a fully automatic method for learning gestures on big touch devices in a potentially multi-user context.

Gesture Recognition

Paper
Code

Human Action Recognition: Pose-based Attention draws focus to Hands

no code implementations • 20 Dec 2017 • Fabien Baradel, Christian Wolf, Julien Mille

We propose a new spatio-temporal attention based mechanism for human action recognition able to automatically attend to the hands most involved into the studied action and detect the most discriminative moments in an action.

Action Recognition Temporal Action Localization

Paper
Add Code

Multi-view pose estimation with mixtures-of-parts and adaptive viewpoint selection

no code implementations • 25 Sep 2017 • Emre Dogan, Gonen Eren, Christian Wolf, Eric Lombardi, Atilla Baskurt

We propose a new method for human pose estimation which leverages information from multiple views to impose a strong prior on articulated pose.

Pose Estimation

Paper
Add Code

Residual Conv-Deconv Grid Network for Semantic Segmentation

1 code implementation • 25 Jul 2017 • Damien Fourure, Rémi Emonet, Elisa Fromont, Damien Muselet, Alain Tremeau, Christian Wolf

However, for semantic image segmentation, where the task consists in providing a semantic class to each pixel of an image, feature maps reduction is harmful because it leads to a resolution loss in the output prediction.

Image Segmentation Segmentation +1

Paper
Code

KiDS-450 + 2dFLenS: Cosmological parameter constraints from weak gravitational lensing tomography and overlapping redshift-space galaxy clustering

1 code implementation • 20 Jul 2017 • Shahab Joudaki, Chris Blake, Andrew Johnson, Alexandra Amon, Marika Asgari, Ami Choi, Thomas Erben, Karl Glazebrook, Joachim Harnois-Deraps, Catherine Heymans, Hendrik Hildebrandt, Henk Hoekstra, Dominik Klaes, Konrad Kuijken, Chris Lidman, Alexander Mead, Lance Miller, David Parkinson, Gregory B. Poole, Peter Schneider, Massimo Viola, Christian Wolf

The complementarity of our observables allows for constraints on modified gravity degrees of freedom that are not simultaneously bounded with either probe alone, and up to a factor of three improvement in the $S_8$ constraint in the extended cosmology compared to KiDS alone.

Cosmology and Nongalactic Astrophysics

Paper
Code

Full-Page Text Recognition: Learning Where to Start and When to Stop

no code implementations • 27 Apr 2017 • Bastien Moysset, Christopher Kermorvant, Christian Wolf

Text line detection and localization is a crucial step for full page document analysis, but still suffers from heterogeneity of real life documents.

Line Detection Position

Paper
Add Code

Pose-conditioned Spatio-Temporal Attention for Human Action Recognition

no code implementations • 29 Mar 2017 • Fabien Baradel, Christian Wolf, Julien Mille

We show that it is of high interest to shift the attention to different hands at different time steps depending on the activity itself.

Action Recognition Human Activity Recognition +1

Paper
Add Code

Learning to detect and localize many objects from few examples

no code implementations • 17 Nov 2016 • Bastien Moysset, Christoper Kermorvant, Christian Wolf

The current trend in object detection and localization is to learn predictions with high capacity deep neural networks trained on a very large amount of annotated data and using a high amount of processing power.

object-detection Object Detection

Paper
Add Code

Hand Pose Estimation through Semi-Supervised and Weakly-Supervised Learning

no code implementations • 20 Nov 2015 • Natalia Neverova, Christian Wolf, Florian Nebout, Graham Taylor

We propose a method for hand pose estimation based on a deep regressor trained on two different kinds of input.

Hand Pose Estimation Segmentation +1

Paper
Add Code

Learning Human Identity from Motion Patterns

no code implementations • 12 Nov 2015 • Natalia Neverova, Christian Wolf, Griffin Lacey, Lex Fridman, Deepak Chandra, Brandon Barbello, Graham Taylor

We present a large-scale study exploring the capability of temporal deep neural networks to interpret natural human kinematics and introduce the first method for active biometric authentication with mobile inertial sensors.

Paper
Add Code

Activity recognition from videos with parallel hypergraph matching on GPUs

no code implementations • 4 May 2015 • Eric Lombardi, Christian Wolf, Oya Celiktutan, Bülent Sankur

In this paper, we propose a method for activity recognition from videos based on sparse local features and hypergraph matching.

Activity Recognition Graph Matching +2

Paper
Add Code

ModDrop: adaptive multi-modal gesture recognition

no code implementations • 31 Dec 2014 • Natalia Neverova, Christian Wolf, Graham W. Taylor, Florian Nebout

We present a method for gesture detection and localisation based on multi-scale and multi-modal deep learning.

Gesture Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.