Search Results for author: Jilles Dibangoye

Found 11 papers, 2 papers with code

Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing

no code implementations ICML 2020 Yuxuan Xie, Jilles Dibangoye, Olivier Buffet

Optimally solving decentralized partially observable Markov decision processes under either full or no information sharing received significant attention in recent years.

Vocal Bursts Valence Prediction

Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing

no code implementations ICML 2020 Yuxuan Xie, Jilles Dibangoye, Olivier Buffet

Optimally solving decentralized partially observable Markov decision processes under either full or no information sharing received significant attention in recent years.

Vocal Bursts Valence Prediction

HSVI for zs-POSGs using Concavity, Convexity and Lipschitz Properties

no code implementations25 Oct 2021 Aurélien Delage, Olivier Buffet, Jilles Dibangoye

Dynamic programming and heuristic search are at the core of state-of-the-art solvers for sequential decision-making problems.

Decision Making

Learning to plan with uncertain topological maps

1 code implementation ECCV 2020 Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf

We train an agent to navigate in 3D environments using a hierarchical strategy including a high-level graph based planner and a local policy.

Inductive Bias Navigate

On Bellman's Optimality Principle for zs-POSGs

no code implementations29 Jun 2020 Olivier Buffet, Jilles Dibangoye, Aurélien Delage, Abdallah Saffidine, Vincent Thomas

Many non-trivial sequential decision-making problems are efficiently solved by relying on Bellman's optimality principle, i. e., exploiting the fact that sub-problems are nested recursively within the original problem.

Decision Making

EgoMap: Projective mapping and structured egocentric memory for Deep RL

no code implementations24 Jan 2020 Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

The EgoMap architecture incorporates several inductive biases including a differentiable inverse projection of CNN feature vectors onto a top-down spatially structured map.

Memorization reinforcement-learning +1

Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer

1 code implementation3 Apr 2019 Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

In this paper we argue that research on training agents capable of complex reasoning can be simplified by decoupling from the requirement of high fidelity photographic observations.

Reinforcement Learning (RL) Scene Understanding

rho-POMDPs have Lipschitz-Continuous epsilon-Optimal Value Functions

no code implementations NeurIPS 2018 Mathieu Fehr, Olivier Buffet, Vincent Thomas, Jilles Dibangoye

In this paper, we focus on POMDPs and ρ-POMDPs with λ ρ -Lipschitz reward function, and demonstrate that, for finite horizons, the optimal value function is Lipschitz-continuous.

Learning to Act in Decentralized Partially Observable MDPs

no code implementations ICML 2018 Jilles Dibangoye, Olivier Buffet

We address a long-standing open problem of reinforcement learning in decentralized partially observable Markov decision processes.

Multi-agent Reinforcement Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.