Search Results for author: Karthik Valmeekam

Found 8 papers, 2 papers with code

On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks

no code implementations12 Feb 2024 Kaya Stechly, Karthik Valmeekam, Subbarao Kambhampati

While the initial optimism that reasoning might emerge automatically with scale has been tempered thanks to a slew of counterexamples--ranging from multiplication to simple planning--there persists a wide spread belief that LLMs can self-critique and improve their own solutions in an iterative fashion.

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

no code implementations2 Feb 2024 Subbarao Kambhampati, Karthik Valmeekam, Lin Guan, Kaya Stechly, Mudit Verma, Siddhant Bhambri, Lucas Saldyt, Anil Murthy

On the other side are perhaps over-pessimistic claims that all that LLMs are good for in planning/reasoning tasks are as mere translators of the problem specification from one syntactic format to another, and ship the problem off to external symbolic solvers.

Can Large Language Models Really Improve by Self-critiquing Their Own Plans?

no code implementations12 Oct 2023 Karthik Valmeekam, Matthew Marquez, Subbarao Kambhampati

We evaluate a planning system that employs LLMs for both plan generation and verification.

On the Planning Abilities of Large Language Models : A Critical Investigation

2 code implementations25 May 2023 Karthik Valmeekam, Matthew Marquez, Sarath Sreedharan, Subbarao Kambhampati

We aim to evaluate (1) the effectiveness of LLMs in generating plans autonomously in commonsense planning tasks and (2) the potential of LLMs in LLM-Modulo settings where they act as a source of heuristic guidance for external planners and verifiers.

Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences

no code implementations28 Oct 2022 Lin Guan, Karthik Valmeekam, Subbarao Kambhampati

We propose two practical methods that can learn to model any kind of behavioral attributes from ordered behavior clips.

Cannot find the paper you are looking for? You can Submit a new open access paper.