1 code implementation • 18 Dec 2023 • Madeleine Grunde-McLaughlin, Michelle S. Lam, Ranjay Krishna, Daniel S. Weld, Jeffrey Heer
LLM chains enable complex tasks by decomposing work into a sequence of sub-tasks.
no code implementations • 13 Dec 2022 • Helena Vasconcelos, Matthew Jörke, Madeleine Grunde-McLaughlin, Tobias Gerstenberg, Michael Bernstein, Ranjay Krishna
Prior work has identified a resilient phenomenon that threatens the performance of human-AI decision-making teams: overreliance, when people agree with an AI, even when it is incorrect.
no code implementations • CVPR 2022 • Mona Gandhi, Mustafa Omer Gul, Eva Prakash, Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala
Recent video question answering benchmarks indicate that state-of-the-art models struggle to answer compositional questions.
no code implementations • 12 Apr 2022 • Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala
Prior benchmarks have analyzed models' answers to questions about videos in order to measure visual compositional reasoning.
no code implementations • CVPR 2021 • Madeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala
AGQA contains $192M$ unbalanced question answer pairs for $9. 6K$ videos.