Search Results for author: Davis Foote

Found 3 papers, 1 papers with code

When Your AIs Deceive You: Challenges with Partial Observability of Human Evaluators in Reward Learning

no code implementations • 27 Feb 2024 • Leon Lang, Davis Foote, Stuart Russell, Anca Dragan, Erik Jenner, Scott Emmons

Past analyses of reinforcement learning from human feedback (RLHF) assume that the human fully observes the environment.

Paper
Add Code

Development and Validation of a Deep Learning Algorithm for Improving Gleason Scoring of Prostate Cancer

no code implementations • 15 Nov 2018 • Kunal Nagpal, Davis Foote, Yun Liu, Po-Hsuan, Chen, Ellery Wulczyn, Fraser Tan, Niels Olson, Jenny L. Smith, Arash Mohtashamian, James H. Wren, Greg S. Corrado, Robert MacDonald, Lily H. Peng, Mahul B. Amin, Andrew J. Evans, Ankur R. Sangoi, Craig H. Mermel, Jason D. Hipp, Martin C. Stumpe

For prostate cancer patients, the Gleason score is one of the most important prognostic factors, potentially determining treatment independent of the stage.

whole slide images

Paper
Add Code

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

3 code implementations • NeurIPS 2017 • Haoran Tang, Rein Houthooft, Davis Foote, Adam Stooke, Xi Chen, Yan Duan, John Schulman, Filip De Turck, Pieter Abbeel

In this work, we describe a surprising finding: a simple generalization of the classic count-based approach can reach near state-of-the-art performance on various high-dimensional and/or continuous deep RL benchmarks.

Ranked #1 on Atari Games on Atari 2600 Freeway

Atari Games Continuous Control +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.