no code implementations • 14 Oct 2024 • Eryk Banatt, Jonathan Cheng, Skanda Vaidyanath, Tiffany Hwu
These challenges present significant obstacles for LLM chat user interfaces, which rely on multi-turn interactions to facilitate effective collaboration.
no code implementations • 26 Aug 2023 • Samar Khanna, Skanda Vaidyanath, Akash Velu
For instance, given a network that has been trained on a source task, we would like to re-train this network on a similar, yet different, target task while maintaining its performance on the source task.
1 code implementation • 21 Jul 2023 • Akash Velu, Skanda Vaidyanath, Dilip Arumugam
Oftentimes, environments for sequential decision-making problems can be quite sparse in the provision of evaluative feedback to guide reinforcement-learning agents.
1 code implementation • 24 Jan 2023 • Ken Kansky, Skanda Vaidyanath, Scott Swingle, Xinghua Lou, Miguel Lazaro-Gredilla, Dileep George
We provide a benchmark of more than 200 PushWorld puzzles in PDDL and in an OpenAI Gym environment.
1 code implementation • 28 Feb 2022 • Divyansh Garg, Skanda Vaidyanath, Kuno Kim, Jiaming Song, Stefano Ermon
Learning policies that effectively utilize language instructions in complex, multi-task environments is an important problem in sequential decision-making.