Search Results for author: Shubham Kumar Bharti

Found 4 papers, 1 papers with code

Optimally Teaching a Linear Behavior Cloning Agent

no code implementations26 Nov 2023 Shubham Kumar Bharti, Stephen Wright, Adish Singla, Xiaojin Zhu

The goal of the teacher is to teach a realizable target policy to the learner using minimum number of state demonstrations.

Provable Defense against Backdoor Policies in Reinforcement Learning

1 code implementation18 Nov 2022 Shubham Kumar Bharti, Xuezhou Zhang, Adish Singla, Xiaojin Zhu

Instead, our defense mechanism sanitizes the backdoor policy by projecting observed states to a 'safe subspace', estimated from a small number of interactions with a clean (non-triggered) environment.

reinforcement-learning Reinforcement Learning (RL)

The Sample Complexity of Teaching-by-Reinforcement on Q-Learning

no code implementations16 Jun 2020 Xuezhou Zhang, Shubham Kumar Bharti, Yuzhe ma, Adish Singla, Xiaojin Zhu

Our TDim results provide the minimum number of samples needed for reinforcement learning, and we discuss their connections to standard PAC-style RL sample complexity and teaching-by-demonstration sample complexity results.

Q-Learning reinforcement-learning +1

On the relationship between multitask neural networks and multitask Gaussian Processes

no code implementations12 Dec 2019 Karthikeyan K, Shubham Kumar Bharti, Piyush Rai

Despite the effectiveness of multitask deep neural network (MTDNN), there is a limited theoretical understanding on how the information is shared across different tasks in MTDNN.

Bayesian Inference Gaussian Processes

Cannot find the paper you are looking for? You can Submit a new open access paper.