Search Results for author: Yori Zwols

Found 5 papers, 2 papers with code

Solving Mixed Integer Programs Using Neural Networks

1 code implementation • 23 Dec 2020 • Vinod Nair, Sergey Bartunov, Felix Gimeno, Ingrid von Glehn, Pawel Lichocki, Ivan Lobov, Brendan O'Donoghue, Nicolas Sonnerat, Christian Tjandraatmadja, Pengming Wang, Ravichandra Addanki, Tharindi Hapuarachchi, Thomas Keck, James Keeling, Pushmeet Kohli, Ira Ktena, Yujia Li, Oriol Vinyals, Yori Zwols

Our approach constructs two corresponding neural network-based components, Neural Diving and Neural Branching, to use in a base MIP solver such as SCIP.

Variable Selection

12,794

Paper
Code

Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search

no code implementations • ICLR 2019 • Lars Buesing, Theophane Weber, Yori Zwols, Sebastien Racaniere, Arthur Guez, Jean-Baptiste Lespiau, Nicolas Heess

In contrast to off-policy algorithms based on Importance Sampling which re-weight data, CF-GPS leverages a model to explicitly consider alternative outcomes, allowing the algorithm to make better use of experience data.

counterfactual

Paper
Add Code

Generative Temporal Models with Spatial Memory for Partially Observed Environments

no code implementations • ICML 2018 • Marco Fraccaro, Danilo Jimenez Rezende, Yori Zwols, Alexander Pritzel, S. M. Ali Eslami, Fabio Viola

In model-based reinforcement learning, generative and temporal models of environments can be leveraged to boost agent performance, either by tuning the agent's representations during training or via use as part of an explicit planning mechanism.

Model-based Reinforcement Learning

Paper
Add Code

PathNet: Evolution Channels Gradient Descent in Super Neural Networks

1 code implementation • 30 Jan 2017 • Chrisantha Fernando, Dylan Banarse, Charles Blundell, Yori Zwols, David Ha, Andrei A. Rusu, Alexander Pritzel, Daan Wierstra

It is a neural network algorithm that uses agents embedded in the neural network whose task is to discover which parts of the network to re-use for new tasks.

Ranked #5 on Continual Learning on F-CelebA (10 tasks)

Continual Learning reinforcement-learning +2

Paper
Code

Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions

no code implementations • 3 Dec 2015 • Peter Sunehag, Richard Evans, Gabriel Dulac-Arnold, Yori Zwols, Daniel Visentin, Ben Coppin

Further, we use deep deterministic policy gradients to learn a policy that for each position of the slate, guides attention towards the part of the action space in which the value is the highest and we only evaluate actions in this area.

Q-Learning Recommendation Systems

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.