Search Results for author: Daniel Hernandez

Found 7 papers, 2 papers with code

ETHER: Aligning Emergent Communication for Hindsight Experience Replay

no code implementations • 28 Jul 2023 • Kevin Denamganaï, Daniel Hernandez, Ozan Vardal, Sondess Missaoui, James Alfred Walker

We show that the referential game's agents make an artificial language emerge that is aligned with the natural-like language used to describe goals in the BabyAI benchmark and that it is expressive enough so as to also describe unsuccessful RL trajectories and thus provide feedback to the RL agent to leverage the linguistic, structured information contained in all trajectories.

Inductive Bias Instruction Following +1

Paper
Add Code

Composing Efficient, Robust Tests for Policy Selection

no code implementations • 12 Jun 2023 • Dustin Morrill, Thomas J. Walsh, Daniel Hernandez, Peter R. Wurman, Peter Stone

Empirical results demonstrate that RPOSST finds a small set of test cases that identify high quality policies in a toy one-shot game, poker datasets, and a high-fidelity racing simulator.

Paper
Add Code

BRExIt: On Opponent Modelling in Expert Iteration

no code implementations • 31 May 2022 • Daniel Hernandez, Hendrik Baier, Michael Kaisers

Finding a best response policy is a central objective in game theory and multi-agent learning, with modern population-based training approaches employing reinforcement learning algorithms as best-response oracles to improve play against candidate opponents (typically previously learnt policies).

Paper
Add Code

A Comparison of Self-Play Algorithms Under a Generalized Framework

no code implementations • 8 Jun 2020 • Daniel Hernandez, Kevin Denamganai, Sam Devlin, Spyridon Samothrakis, James Alfred Walker

They allow to verify and replicate existing findings, and to link is connected results.

Reinforcement Learning (RL)

Paper
Add Code

Metagame Autobalancing for Competitive Multiplayer Games

1 code implementation • 8 Jun 2020 • Daniel Hernandez, Charles Takashi Toyin Gbadomosi, James Goodman, James Alfred Walker

Automated game balancing has often focused on single-agent scenarios.

Paper
Code

Nonlinear Evolution via Spatially-Dependent Linear Dynamics for Electrophysiology and Calcium Data

no code implementations • 6 Nov 2018 • Daniel Hernandez, Antonio Khalil Moretti, Ziqiang Wei, Shreya Saxena, John Cunningham, Liam Paninski

We present Variational Inference for Nonlinear Dynamics (VIND), a variational inference framework that is able to uncover nonlinear, smooth latent dynamics from sequential data.

Time Series Time Series Analysis +1

Paper
Add Code

Learning Socially Appropriate Robot Approaching Behavior Toward Groups using Deep Reinforcement Learning

1 code implementation • 16 Oct 2018 • Yuan Gao, Fangkai Yang, Martin Frisk, Daniel Hernandez, Christopher Peters, Ginevra Castellano

Deep reinforcement learning has recently been widely applied in robotics to study tasks such as locomotion and grasping, but its application to social human-robot interaction (HRI) remains a challenge.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.