Search Results for author: Jonas Schneider

Found 14 papers, 8 papers with code

Parametric and Multivariate Uncertainty Calibration for Regression and Object Detection

1 code implementation • 4 Jul 2022 • Fabian Küppers, Jonas Schneider, Anselm Haselhoff

Our experiments show that common detection models overestimate the spatial uncertainty in comparison to the observed error.

object-detection Object Detection +3

314

Paper
Code

Confidence Calibration for Object Detection and Segmentation

no code implementations • 25 Feb 2022 • Fabian Küppers, Anselm Haselhoff, Jan Kronenberger, Jonas Schneider

Calibrated confidence estimates obtained from neural networks are crucial, particularly for safety-critical applications such as autonomous driving or medical image diagnosis.

Autonomous Driving Instance Segmentation +5

Paper
Add Code

Bayesian Confidence Calibration for Epistemic Uncertainty Modelling

1 code implementation • 21 Sep 2021 • Fabian Küppers, Jan Kronenberger, Jonas Schneider, Anselm Haselhoff

We introduce Bayesian confidence calibration - a framework to obtain calibrated confidence estimates in conjunction with an uncertainty of the calibration method.

object-detection Object Detection +1

314

Paper
Code

On Feature Relevance Uncertainty: A Monte Carlo Dropout Sampling Approach

no code implementations • 4 Aug 2020 • Kai Fischer, Jonas Schneider

Understanding decisions made by neural networks is key for the deployment of intelligent systems in real world applications.

Decision Making

Paper
Add Code

Dota 2 with Large Scale Deep Reinforcement Learning

1 code implementation • 13 Dec 2019 • Christopher Berner, Greg Brockman, Brooke Chan, Vicki Cheung, Przemysław Dębiak, Christy Dennison, David Farhi, Quirin Fischer, Shariq Hashme, Chris Hesse, Rafal Józefowicz, Scott Gray, Catherine Olsson, Jakub Pachocki, Michael Petrov, Henrique Pondé de Oliveira Pinto, Jonathan Raiman, Tim Salimans, Jeremy Schlatter, Jonas Schneider, Szymon Sidor, Ilya Sutskever, Jie Tang, Filip Wolski, Susan Zhang

On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game.

Dota 2 reinforcement-learning +1

399

Paper
Code

Solving Rubik's Cube with a Robot Hand

2 code implementations • 16 Oct 2019 • OpenAI, Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin, Bob McGrew, Arthur Petron, Alex Paino, Matthias Plappert, Glenn Powell, Raphael Ribas, Jonas Schneider, Nikolas Tezak, Jerry Tworek, Peter Welinder, Lilian Weng, Qiming Yuan, Wojciech Zaremba, Lei Zhang

We demonstrate that models trained only in simulation can be used to solve a manipulation problem of unprecedented complexity on a real robot.

Meta-Learning Rubik's Cube

Paper
Code

Learning Dexterous In-Hand Manipulation

no code implementations • 1 Aug 2018 • OpenAI, Marcin Andrychowicz, Bowen Baker, Maciek Chociej, Rafal Jozefowicz, Bob McGrew, Jakub Pachocki, Arthur Petron, Matthias Plappert, Glenn Powell, Alex Ray, Jonas Schneider, Szymon Sidor, Josh Tobin, Peter Welinder, Lilian Weng, Wojciech Zaremba

We use reinforcement learning (RL) to learn dexterous in-hand manipulation policies which can perform vision-based object reorientation on a physical Shadow Dexterous Hand.

Friction reinforcement-learning +1

Paper
Add Code

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

30 code implementations • 26 Feb 2018 • Matthias Plappert, Marcin Andrychowicz, Alex Ray, Bob McGrew, Bowen Baker, Glenn Powell, Jonas Schneider, Josh Tobin, Maciek Chociej, Peter Welinder, Vikash Kumar, Wojciech Zaremba

The purpose of this technical report is two-fold.

Continuous Control Multi-Goal Reinforcement Learning +3

141

Paper
Code

Domain Randomization and Generative Models for Robotic Grasping

no code implementations • 17 Oct 2017 • Joshua Tobin, Lukas Biewald, Rocky Duan, Marcin Andrychowicz, Ankur Handa, Vikash Kumar, Bob McGrew, Jonas Schneider, Peter Welinder, Wojciech Zaremba, Pieter Abbeel

In this work, we explore a novel data generation pipeline for training a deep neural network to perform grasp planning that applies the idea of domain randomization to object synthesis.

Object Robotic Grasping

Paper
Add Code

Hindsight Experience Replay

26 code implementations • NeurIPS 2017 • Marcin Andrychowicz, Filip Wolski, Alex Ray, Jonas Schneider, Rachel Fong, Peter Welinder, Bob McGrew, Josh Tobin, Pieter Abbeel, Wojciech Zaremba

Dealing with sparse rewards is one of the biggest challenges in Reinforcement Learning (RL).

Reinforcement Learning (RL)

7,975

Paper
Code

One-Shot Imitation Learning

no code implementations • NeurIPS 2017 • Yan Duan, Marcin Andrychowicz, Bradly C. Stadie, Jonathan Ho, Jonas Schneider, Ilya Sutskever, Pieter Abbeel, Wojciech Zaremba

A neural net is trained that takes as input one demonstration and the current state (which initially is the initial state of the other demonstration of the pair), and outputs an action with the goal that the resulting sequence of states and actions matches as closely as possible with the second demonstration.

Feature Engineering Imitation Learning +1

Paper
Add Code

Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World

6 code implementations • 20 Mar 2017 • Josh Tobin, Rachel Fong, Alex Ray, Jonas Schneider, Wojciech Zaremba, Pieter Abbeel

Bridging the 'reality gap' that separates simulated robotics from experiments on hardware could accelerate robotic research through improved data availability.

Object Localization

458

Paper
Code

Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model

no code implementations • 11 Oct 2016 • Paul Christiano, Zain Shah, Igor Mordatch, Jonas Schneider, Trevor Blackwell, Joshua Tobin, Pieter Abbeel, Wojciech Zaremba

Nevertheless, often the overall gist of what the policy does in simulation remains valid in the real world.

Friction

Paper
Add Code

OpenAI Gym

45 code implementations • 5 Jun 2016 • Greg Brockman, Vicki Cheung, Ludwig Pettersson, Jonas Schneider, John Schulman, Jie Tang, Wojciech Zaremba

OpenAI Gym is a toolkit for reinforcement learning research.

reinforcement-learning Reinforcement Learning (RL)

33,907

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.