no code implementations • 2 Feb 2024 • Subbarao Kambhampati, Karthik Valmeekam, Lin Guan, Kaya Stechly, Mudit Verma, Siddhant Bhambri, Lucas Saldyt, Anil Murthy
On the other side are perhaps over-pessimistic claims that all that LLMs are good for in planning/reasoning tasks are as mere translators of the problem specification from one syntactic format to another, and ship the problem off to external symbolic solvers.
no code implementations • 10 Jan 2024 • Mudit Verma, Siddhant Bhambri, Subbarao Kambhampati
In this work, we explore the task of Perceived Behavior Recognition, where a robot employs a Large Language Model (LLM) to assess the robot's generated behavior in a manner similar to human observer.
no code implementations • 21 Dec 2023 • Siddhant Bhambri, Mudit Verma, Anil Murthy, Subbarao Kambhampati
We introduce the notion of Human-Flexibility, i. e. whether the human partner is amenable to multiple team strategies, with a special case being Specified Orchestration where the human has a single team policy in mind (most constrained case).
no code implementations • 17 Feb 2023 • Mudit Verma, Siddhant Bhambri, Subbarao Kambhampati
Preference Based Reinforcement Learning has shown much promise for utilizing human binary feedback on queried trajectory pairs to recover the underlying reward model of the Human in the Loop (HiL).
no code implementations • 15 Nov 2022 • Siddhant Bhambri, Amrita Bhattacharjee, Dimitri Bertsekas
In this paper we address the solution of the popular Wordle puzzle, using new reinforcement learning methods, which apply more generally to adaptive control of dynamic systems and to classes of Partially Observable Markov Decision Process (POMDP) problems.
1 code implementation • 2 Apr 2021 • Yantian Zha, Siddhant Bhambri, Lin Guan
In this work, our goal is instead to fill the gap between affordance discovery and affordance-based policy learning by integrating the two objectives in an end-to-end imitation learning framework based on deep neural networks.
no code implementations • 29 Sep 2020 • Saurabh Gupta, Siddhant Bhambri, Karan Dhingra, Arun Balaji Buduru, Ponnurangam Kumaraguru
We experiment on real-world smart home data, and show that the multi-objective approaches: i) establish trade-off between the two objectives, ii) achieve better combined user satisfaction and power consumption than single-objective approaches.
no code implementations • 6 Dec 2019 • Mudit Verma, Siddhant Bhambri, Saurabh Gupta, Arun Balaji Buduru
Rapid advancements in the Internet of Things (IoT) have facilitated more efficient deployment of smart environment solutions for specific user requirement.
no code implementations • 3 Dec 2019 • Siddhant Bhambri, Sumanyu Muku, Avinash Tulasi, Arun Balaji Buduru
Machine learning has seen tremendous advances in the past few years, which has lead to deep learning models being deployed in varied applications of day-to-day life.