Search Results for author: Skanda Vaidyanath

Found 4 papers, 3 papers with code

Differentiable Weight Masks for Domain Transfer

no code implementations26 Aug 2023 Samar Khanna, Skanda Vaidyanath, Akash Velu

For instance, given a network that has been trained on a source task, we would like to re-train this network on a similar, yet different, target task while maintaining its performance on the source task.

Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning

1 code implementation21 Jul 2023 Akash Velu, Skanda Vaidyanath, Dilip Arumugam

Oftentimes, environments for sequential decision-making problems can be quite sparse in the provision of evaluative feedback to guide reinforcement-learning agents.

Decision Making Off-policy evaluation +2

LISA: Learning Interpretable Skill Abstractions from Language

1 code implementation28 Feb 2022 Divyansh Garg, Skanda Vaidyanath, Kuno Kim, Jiaming Song, Stefano Ermon

Learning policies that effectively utilize language instructions in complex, multi-task environments is an important problem in sequential decision-making.

Imitation Learning Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.