Search Results for author: Gaurav Manek

Found 7 papers, 3 papers with code

Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning

no code implementations25 Nov 2023 Melrose Roderick, Gaurav Manek, Felix Berkenkamp, J. Zico Kolter

A key problem in off-policy Reinforcement Learning (RL) is the mismatch, or distribution shift, between the dataset and the distribution over states and actions visited by the learned policy.

Q-Learning Reinforcement Learning (RL)

Model-based Reinforcement Learning with Ensembled Model-value Expansion

no code implementations29 Sep 2021 Gaurav Manek, J Zico Kolter

Model-based reinforcement learning (MBRL) methods are often more data-efficient and quicker to converge than their model-free counterparts, but typically rely crucially on accurate modeling of the environment dynamics and associated uncertainty in order to perform well.

Model-based Reinforcement Learning reinforcement-learning +1

Learning Stable Deep Dynamics Models

1 code implementation NeurIPS 2019 Gaurav Manek, J. Zico Kolter

Deep networks are commonly used to model dynamical systems, predicting how the state of a system will evolve over time (either autonomously or in response to control inputs).

Pruning Convolutional Neural Networks for Image Instance Retrieval

no code implementations18 Jul 2017 Gaurav Manek, Jie Lin, Vijay Chandrasekhar, Ling-Yu Duan, Sateesh Giduthuri, Xiao-Li Li, Tomaso Poggio

In this work, we focus on the problem of image instance retrieval with deep descriptors extracted from pruned Convolutional Neural Networks (CNN).

Image Instance Retrieval Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.