no code implementations • 13 Feb 2023 • Karthik Valmeekam, Sarath Sreedharan, Matthew Marquez, Alberto Olmo, Subbarao Kambhampati
On this benchmark, we evaluate LLMs in three modes: autonomous, heuristic and human-in-the-loop.
2 code implementations • NeurIPS 2023 • Karthik Valmeekam, Matthew Marquez, Alberto Olmo, Sarath Sreedharan, Subbarao Kambhampati
PlanBench provides sufficient diversity in both the task domains and the specific planning capabilities.
1 code implementation • 14 Jun 2021 • Alberto Olmo, Sarath Sreedharan, Subbarao Kambhampati
Operations in many essential industries including finance and banking are often characterized by the need to perform repetitive sequential tasks.
no code implementations • 26 Jun 2020 • Alberto Olmo, Sailik Sengupta, Subbarao Kambhampati
classifying the image of a dog to an airplane) can perplex humans and result in the loss of human trust in the system.
no code implementations • 26 Jan 2020 • Niharika Jain, Alberto Olmo, Sailik Sengupta, Lydia Manikonda, Subbarao Kambhampati
In this paper, we show that popular Generative Adversarial Networks (GANs) exacerbate biases along the axes of gender and skin tone when given a skewed distribution of face-shots.
no code implementations • 17 Mar 2019 • Sarath Sreedharan, Alberto Olmo, Aditya Prasad Mishra, Subbarao Kambhampati
One such approach has been the idea of {\em explanation as model-reconciliation}.