Search Results for author: Skanda Vaidyanath

Found 4 papers, 3 papers with code

Differentiable Weight Masks for Domain Transfer

no code implementations • 26 Aug 2023 • Samar Khanna, Skanda Vaidyanath, Akash Velu

For instance, given a network that has been trained on a source task, we would like to re-train this network on a similar, yet different, target task while maintaining its performance on the source task.

Paper
Add Code

Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning

1 code implementation • 21 Jul 2023 • Akash Velu, Skanda Vaidyanath, Dilip Arumugam

Oftentimes, environments for sequential decision-making problems can be quite sparse in the provision of evaluative feedback to guide reinforcement-learning agents.

Decision Making Off-policy evaluation +2

Paper
Code

PushWorld: A benchmark for manipulation planning with tools and movable obstacles

1 code implementation • 24 Jan 2023 • Ken Kansky, Skanda Vaidyanath, Scott Swingle, Xinghua Lou, Miguel Lazaro-Gredilla, Dileep George

We provide a benchmark of more than 200 PushWorld puzzles in PDDL and in an OpenAI Gym environment.

OpenAI Gym Starcraft

Paper
Code

LISA: Learning Interpretable Skill Abstractions from Language

1 code implementation • 28 Feb 2022 • Divyansh Garg, Skanda Vaidyanath, Kuno Kim, Jiaming Song, Stefano Ermon

Learning policies that effectively utilize language instructions in complex, multi-task environments is an important problem in sequential decision-making.

Imitation Learning Quantization

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.