Search Results for author: Avraham Ruderman

Found 6 papers, 1 papers with code

PSI Draft Specification

no code implementations • 2 May 2022 • Mark Reid, James Montgomery, Barry Drake, Avraham Ruderman

This document presents the draft specification for delivering machine learning services over HTTP, developed as part of the Protocols and Structures for Inference project, which concluded in 2013.

BIG-bench Machine Learning

Paper
Add Code

Uncovering Surprising Behaviors in Reinforcement Learning via Worst-case Analysis

no code implementations • ICLR 2019 • Avraham Ruderman, Richard Everett, Bristy Sikder, Hubert Soyer, Jonathan Uesato, Ananya Kumar, Charlie Beattie, Pushmeet Kohli

Reinforcement learning agents are typically trained and evaluated according to their performance averaged over some distribution of environment settings.

Navigate reinforcement-learning +1

Paper
Add Code

Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures

no code implementations • ICLR 2019 • Jonathan Uesato, Ananya Kumar, Csaba Szepesvari, Tom Erez, Avraham Ruderman, Keith Anderson, Krishmamurthy, Dvijotham, Nicolas Heess, Pushmeet Kohli

We demonstrate this is an issue for current agents, where even matching the compute used for training is sometimes insufficient for evaluation.

Autonomous Driving Humanoid Control

Paper
Add Code

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

no code implementations • 3 Jul 2018 • Max Jaderberg, Wojciech M. Czarnecki, Iain Dunning, Luke Marris, Guy Lever, Antonio Garcia Castaneda, Charles Beattie, Neil C. Rabinowitz, Ari S. Morcos, Avraham Ruderman, Nicolas Sonnerat, Tim Green, Louise Deason, Joel Z. Leibo, David Silver, Demis Hassabis, Koray Kavukcuoglu, Thore Graepel

Recent progress in artificial intelligence through reinforcement learning (RL) has shown great success on increasingly complex single-agent environments and two-player turn-based games.

Reinforcement Learning (RL)

Paper
Add Code

Pooling is neither necessary nor sufficient for appropriate deformation stability in CNNs

no code implementations • ICLR 2019 • Avraham Ruderman, Neil C. Rabinowitz, Ari S. Morcos, Daniel Zoran

In this work, we rigorously test these questions, and find that deformation stability in convolutional networks is more nuanced than it first appears: (1) Deformation invariance is not a binary property, but rather that different tasks require different degrees of deformation stability at different layers.

General Classification Image Classification +1

Paper
Add Code

Model-Free Episodic Control

3 code implementations • 14 Jun 2016 • Charles Blundell, Benigno Uria, Alexander Pritzel, Yazhe Li, Avraham Ruderman, Joel Z. Leibo, Jack Rae, Daan Wierstra, Demis Hassabis

State of the art deep reinforcement learning algorithms take many millions of interactions to attain human-level performance.

Decision Making Hippocampus +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.