Search Results for author: Arash Tavakoli

Found 9 papers, 7 papers with code

On the Pitfalls of Heteroscedastic Uncertainty Estimation with Probabilistic Neural Networks

2 code implementations • ICLR 2022 • Maximilian Seitzer, Arash Tavakoli, Dimitrije Antic, Georg Martius

In this work, we examine this approach and identify potential hazards associated with the use of log-likelihood in conjunction with gradient-based optimizers.

182

Paper
Code

Orchestrated Value Mapping for Reinforcement Learning

1 code implementation • ICLR 2022 • Mehdi Fatemi, Arash Tavakoli

We present a general convergent class of reinforcement learning algorithms that is founded on two distinct principles: (1) mapping value estimates to a different space using arbitrary functions from a broad class, and (2) linearly decomposing the reward signal into multiple channels.

Ensemble Learning Q-Learning +2

Paper
Code

Learning to Represent Action Values as a Hypergraph on the Action Vertices

1 code implementation • ICLR 2021 • Arash Tavakoli, Mehdi Fatemi, Petar Kormushev

To test this, we set forth the action hypergraph networks framework -- a class of functions for learning action representations in multi-dimensional discrete action spaces with a structural inductive bias.

Atari Games Continuous Control +4

Paper
Code

A neural network oracle for quantum nonlocality problems in networks

1 code implementation • 24 Jul 2019 • Tamás Kriváchy, Yu Cai, Daniel Cavalcanti, Arash Tavakoli, Nicolas Gisin, Nicolas Brunner

As such, the neural network acts as an oracle, demonstrating that a behavior is classical if it can be learned.

Causal Inference

Paper
Code

Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning

2 code implementations • NeurIPS 2019 • Harm van Seijen, Mehdi Fatemi, Arash Tavakoli

In an effort to better understand the different ways in which the discount factor affects the optimization process in reinforcement learning, we designed a set of experiments to study each effect in isolation.

General Reinforcement Learning reinforcement-learning +1

Paper
Code

Exploring Restart Distributions

no code implementations • 27 Nov 2018 • Arash Tavakoli, Vitaly Levdik, Riashat Islam, Christopher M. Smith, Petar Kormushev

We consider the generic approach of using an experience memory to help exploration by adapting a restart distribution.

Paper
Add Code

Time Limits in Reinforcement Learning

1 code implementation • ICML 2018 • Fabio Pardo, Arash Tavakoli, Vitaly Levdik, Petar Kormushev

In case (ii), the time limits are not part of the environment and are only used to facilitate learning.

General Reinforcement Learning reinforcement-learning +1

392

Paper
Code

Action Branching Architectures for Deep Reinforcement Learning

5 code implementations • 24 Nov 2017 • Arash Tavakoli, Fabio Pardo, Petar Kormushev

This approach achieves a linear increase of the number of network outputs with the number of degrees of freedom by allowing a level of independence for each individual action dimension.

Continuous Control General Reinforcement Learning +2

105

Paper
Code

Multiplayer Games for Learning Multirobot Coordination Algorithms

no code implementations • 20 Apr 2016 • Arash Tavakoli, Haig Nalbandian, Nora Ayanian

This ability, if learned as a set of distributed multirobot coordination strategies, can enable programming large groups of robots to collaborate towards complex coordination objectives in a way similar to humans.

Decision Making

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.