Search Results for author: Omer Gottesman

Found 17 papers, 5 papers with code

Decision-Focused Model-based Reinforcement Learning for Reward Transfer

no code implementations • 6 Apr 2023 • Abhishek Sharma, Sonali Parbhoo, Omer Gottesman, Finale Doshi-Velez

Decision-focused (DF) model-based reinforcement learning has recently been introduced as a powerful algorithm that can focus on learning the MDP dynamics that are most relevant for obtaining high returns.

Model-based Reinforcement Learning reinforcement-learning

Paper
Add Code

On the Geometry of Reinforcement Learning in Continuous State and Action Spaces

no code implementations • 29 Dec 2022 • Saket Tiwari, Omer Gottesman, George Konidaris

Central to our work is the idea that the transition dynamics induce a low dimensional manifold of reachable states embedded in the high-dimensional nominal state space.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes

no code implementations • 30 Jul 2022 • Kelly W. Zhang, Omer Gottesman, Finale Doshi-Velez

In the reinforcement learning literature, there are many algorithms developed for either Contextual Bandit (CB) or Markov Decision Processes (MDP) environments.

Decision Making reinforcement-learning +1

Paper
Add Code

Faster Deep Reinforcement Learning with Slower Online Network

1 code implementation • 10 Dec 2021 • Kavosh Asadi, Rasool Fakoor, Omer Gottesman, Taesup Kim, Michael L. Littman, Alexander J. Smola

In this paper we endow two popular deep reinforcement learning algorithms, namely DQN and Rainbow, with updates that incentivize the online network to remain in the proximity of the target network.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation

no code implementations • 28 Nov 2021 • Ramtin Keramati, Omer Gottesman, Leo Anthony Celi, Finale Doshi-Velez, Emma Brunskill

Off-policy policy evaluation methods for sequential decision making can be used to help identify if a proposed decision policy is better than a current baseline policy.

Decision Making

Paper
Add Code

Coarse-Grained Smoothness for RL in Metric Spaces

no code implementations • 23 Oct 2021 • Omer Gottesman, Kavosh Asadi, Cameron Allen, Sam Lobel, George Konidaris, Michael Littman

We propose a new coarse-grained smoothness definition that generalizes the notion of Lipschitz continuity, is more widely applicable, and allows us to compute significantly tighter bounds on Q-functions, leading to improved learning.

Decision Making

Paper
Add Code

State Relevance for Off-Policy Evaluation

1 code implementation • 13 Sep 2021 • Simon P. Shen, Yecheng Jason Ma, Omer Gottesman, Finale Doshi-Velez

Importance sampling-based estimators for off-policy evaluation (OPE) are valued for their simplicity, unbiasedness, and reliance on relatively few assumptions.

Off-policy evaluation

Paper
Code

Learning Markov State Abstractions for Deep Reinforcement Learning

1 code implementation • NeurIPS 2021 • Cameron Allen, Neev Parikh, Omer Gottesman, George Konidaris

A fundamental assumption of reinforcement learning in Markov decision processes (MDPs) is that the relevant decision process is, in fact, Markov.

Continuous Control Contrastive Learning +2

Paper
Code

Learning to search efficiently for causally near-optimal treatments

1 code implementation • NeurIPS 2020 • Samuel Håkansson, Viktor Lindblom, Omer Gottesman, Fredrik D. Johansson

Finding an effective medical treatment often requires a search by trial and error.

Causal Inference Reinforcement Learning (RL)

Paper
Code

Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions

no code implementations • ICML 2020 • Omer Gottesman, Joseph Futoma, Yao Liu, Sonali Parbhoo, Leo Anthony Celi, Emma Brunskill, Finale Doshi-Velez

Off-policy evaluation in reinforcement learning offers the chance of using observational data to improve future outcomes in domains such as healthcare and education, but safe deployment in high stakes settings requires ways of assessing its validity.

Off-policy evaluation reinforcement-learning

Paper
Add Code

A general method for regularizing tensor decomposition methods via pseudo-data

no code implementations • 24 May 2019 • Omer Gottesman, Weiwei Pan, Finale Doshi-Velez

Tensor decomposition methods allow us to learn the parameters of latent variable models through decomposition of low-order moments of data.

Tensor Decomposition Transfer Learning

Paper
Add Code

Combining Parametric and Nonparametric Models for Off-Policy Evaluation

no code implementations • 14 May 2019 • Omer Gottesman, Yao Liu, Scott Sussex, Emma Brunskill, Finale Doshi-Velez

We consider a model-based approach to perform batch off-policy evaluation in reinforcement learning.

Off-policy evaluation reinforcement-learning

Paper
Add Code

Improving Sepsis Treatment Strategies by Combining Deep and Kernel-Based Reinforcement Learning

no code implementations • 15 Jan 2019 • Xuefeng Peng, Yi Ding, David Wihl, Omer Gottesman, Matthieu Komorowski, Li-wei H. Lehman, Andrew Ross, Aldo Faisal, Finale Doshi-Velez

On a large retrospective cohort, this mixture-based approach outperforms physician, kernel only, and DRL-only experts.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters

no code implementations • 3 Jul 2018 • Aniruddh Raghu, Omer Gottesman, Yao Liu, Matthieu Komorowski, Aldo Faisal, Finale Doshi-Velez, Emma Brunskill

In this work, we consider the problem of estimating a behaviour policy for use in Off-Policy Policy Evaluation (OPE) when the true behaviour policy is unknown.

Paper
Add Code

Evaluating Reinforcement Learning Algorithms in Observational Health Settings

no code implementations • 31 May 2018 • Omer Gottesman, Fredrik Johansson, Joshua Meier, Jack Dent, Dong-hun Lee, Srivatsan Srinivasan, Linying Zhang, Yi Ding, David Wihl, Xuefeng Peng, Jiayu Yao, Isaac Lage, Christopher Mosch, Li-wei H. Lehman, Matthieu Komorowski, Aldo Faisal, Leo Anthony Celi, David Sontag, Finale Doshi-Velez

Much attention has been devoted recently to the development of machine learning algorithms with the goal of improving treatment policies in healthcare.

BIG-bench Machine Learning Decision Making +3

Paper
Add Code

Representation Balancing MDPs for Off-Policy Policy Evaluation

1 code implementation • NeurIPS 2018 • Yao Liu, Omer Gottesman, Aniruddh Raghu, Matthieu Komorowski, Aldo Faisal, Finale Doshi-Velez, Emma Brunskill

We study the problem of off-policy policy evaluation (OPPE) in RL.

Paper
Code

Weighted Tensor Decomposition for Learning Latent Variables with Partial Data

no code implementations • 18 Oct 2017 • Omer Gottesman, Weiwei Pan, Finale Doshi-Velez

Tensor decomposition methods are popular tools for learning latent variables given only lower-order moments of the data.

Tensor Decomposition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.