Search Results for author: Sam Lobel

Found 3 papers, 2 papers with code

Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning

1 code implementation • 5 Jun 2023 • Sam Lobel, Akhil Bagaria, George Konidaris

We propose a new method for count-based exploration in high-dimensional state spaces.

Montezuma's Revenge reinforcement-learning

Paper
Code

Coarse-Grained Smoothness for RL in Metric Spaces

no code implementations • 23 Oct 2021 • Omer Gottesman, Kavosh Asadi, Cameron Allen, Sam Lobel, George Konidaris, Michael Littman

We propose a new coarse-grained smoothness definition that generalizes the notion of Lipschitz continuity, is more widely applicable, and allows us to compute significantly tighter bounds on Q-functions, leading to improved learning.

Decision Making

Paper
Add Code

Towards Amortized Ranking-Critical Training for Collaborative Filtering

1 code implementation • 10 Jun 2019 • Sam Lobel, Chunyuan Li, Jianfeng Gao, Lawrence Carin

In this paper we investigate new methods for training collaborative filtering models based on actor-critic reinforcement learning, to directly optimize the non-differentiable quality metrics of interest.

Ranked #4 on Recommendation Systems on Million Song Dataset

Collaborative Filtering Learning-To-Rank +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.