1 code implementation • 6 Mar 2024 • Adithya Bhaskar, Dan Friedman, Danqi Chen
Instead of finding competing subnetworks, we find that all subnetworks -- whether they generalize or not -- share a set of attention heads, which we refer to as the heuristic core.
1 code implementation • 21 Feb 2024 • Tianyu Gao, ZiRui Wang, Adithya Bhaskar, Danqi Chen
An emerging family of language models (LMs), capable of processing both text and images within a single visual view, has the promise to unlock complex tasks such as chart understanding and UI navigation.
1 code implementation • 20 Oct 2023 • Adithya Bhaskar, Tushar Tomar, Ashutosh Sathe, Sunita Sarawagi
Research in Text-to-SQL conversion has been largely benchmarked against datasets where each text query corresponds to one correct SQL.
1 code implementation • 29 Nov 2022 • Adithya Bhaskar, Alexander R. Fabbri, Greg Durrett
Large language models have shown impressive performance across a wide variety of tasks, including text summarization.