Search Results for author: Karthik Valmeekam

Found 8 papers, 2 papers with code

On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks

no code implementations • 12 Feb 2024 • Kaya Stechly, Karthik Valmeekam, Subbarao Kambhampati

While the initial optimism that reasoning might emerge automatically with scale has been tempered thanks to a slew of counterexamples--ranging from multiplication to simple planning--there persists a wide spread belief that LLMs can self-critique and improve their own solutions in an iterative fashion.

Paper
Add Code

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

no code implementations • 2 Feb 2024 • Subbarao Kambhampati, Karthik Valmeekam, Lin Guan, Kaya Stechly, Mudit Verma, Siddhant Bhambri, Lucas Saldyt, Anil Murthy

On the other side are perhaps over-pessimistic claims that all that LLMs are good for in planning/reasoning tasks are as mere translators of the problem specification from one syntactic format to another, and ship the problem off to external symbolic solvers.

Paper
Add Code

Can Large Language Models Really Improve by Self-critiquing Their Own Plans?

no code implementations • 12 Oct 2023 • Karthik Valmeekam, Matthew Marquez, Subbarao Kambhampati

We evaluate a planning system that employs LLMs for both plan generation and verification.

Paper
Add Code

On the Planning Abilities of Large Language Models : A Critical Investigation

2 code implementations • 25 May 2023 • Karthik Valmeekam, Matthew Marquez, Sarath Sreedharan, Subbarao Kambhampati

We aim to evaluate (1) the effectiveness of LLMs in generating plans autonomously in commonsense planning tasks and (2) the potential of LLMs in LLM-Modulo settings where they act as a source of heuristic guidance for external planners and verifiers.

192

Paper
Code