no code implementations • 11 Oct 2024 • Rushang Karia, Daniel Bramblett, Daksh Dobhal, Siddharth Srivastava
This paper presents $\forall$uto$\exists$$\lor\!\land$L, a novel benchmark for scaling Large Language Model (LLM) assessment in formal tasks with clear notions of correctness, such as truth maintenance in translation and logical reasoning.
1 code implementation • 27 Mar 2024 • Rushang Karia, Daniel Bramblett, Daksh Dobhal, Pulkit Verma, Siddharth Srivastava
This paper presents $\forall$uto$\exists$val, a new approach for scaling LLM assessment in translating formal syntax -- such as first-order logic, regular expressions, etc -- to natural language (interpretation) or vice versa (compilation), thereby facilitating their use in applications such as generating/explaining logic and control flow for programs etc.
1 code implementation • 13 Feb 2024 • Rushang Karia, Pulkit Verma, Alberto Speranzon, Siddharth Srivastava
This paper introduces a new approach for continual planning and model learning in relational, non-stationary stochastic environments.
1 code implementation • 7 Jun 2023 • Pulkit Verma, Rushang Karia, Siddharth Srivastava
It is essential for users to understand what their AI systems can and can't do in order to use them safely.
no code implementations • 27 Apr 2022 • Rushang Karia, Siddharth Srivastava
Reinforcement learning in problems with symbolic state spaces is challenging due to the need for reasoning over long horizons.
10 code implementations • 16 Apr 2022 • Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza, Pulkit Verma, Ravsehaj Singh Puri, Rushang Karia, Shailaja Keyur Sampat, Savan Doshi, Siddhartha Mishra, Sujan Reddy, Sumanta Patro, Tanay Dixit, Xudong Shen, Chitta Baral, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi, Daniel Khashabi
This large and diverse collection of tasks enables rigorous benchmarking of cross-task generalization under instructions -- training models to follow instructions on a subset of tasks and evaluating them on the remaining unseen ones.
1 code implementation • 8 Apr 2022 • Rushang Karia, Rashmeet Kaur Nayyar, Siddharth Srivastava
Several goal-oriented problems in the real-world can be naturally expressed as Stochastic Shortest Path Problems (SSPs).
no code implementations • 10 Jul 2020 • Rushang Karia, Siddharth Srivastava
Computing goal-directed behavior is essential to designing efficient AI systems.