Search Results for author: Skanda Vaidyanath

Found 5 papers, 3 papers with code

WILT: A Multi-Turn, Memorization-Robust Inductive Logic Benchmark for LLMs

no code implementations14 Oct 2024 Eryk Banatt, Jonathan Cheng, Skanda Vaidyanath, Tiffany Hwu

These challenges present significant obstacles for LLM chat user interfaces, which rely on multi-turn interactions to facilitate effective collaboration.

Memorization

Differentiable Weight Masks for Domain Transfer

no code implementations26 Aug 2023 Samar Khanna, Skanda Vaidyanath, Akash Velu

For instance, given a network that has been trained on a source task, we would like to re-train this network on a similar, yet different, target task while maintaining its performance on the source task.

Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning

1 code implementation21 Jul 2023 Akash Velu, Skanda Vaidyanath, Dilip Arumugam

Oftentimes, environments for sequential decision-making problems can be quite sparse in the provision of evaluative feedback to guide reinforcement-learning agents.

Decision Making Deep Reinforcement Learning +4

LISA: Learning Interpretable Skill Abstractions from Language

1 code implementation28 Feb 2022 Divyansh Garg, Skanda Vaidyanath, Kuno Kim, Jiaming Song, Stefano Ermon

Learning policies that effectively utilize language instructions in complex, multi-task environments is an important problem in sequential decision-making.

Imitation Learning Quantization +1

Cannot find the paper you are looking for? You can Submit a new open access paper.