Search Results for author: Christopher Potts

Found 90 papers, 61 papers with code

Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLP

2 code implementations • 28 Dec 2022 • Omar Khattab, Keshav Santhanam, Xiang Lisa Li, David Hall, Percy Liang, Christopher Potts, Matei Zaharia

Retrieval-augmented in-context learning has emerged as a powerful approach for addressing knowledge-intensive tasks using frozen language models (LM) and retrieval models (RM).

In-Context Learning Language Modelling +2

10,206

Paper
Code

DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

2 code implementations • 5 Oct 2023 • Omar Khattab, Arnav Singhvi, Paridhi Maheshwari, Zhiyuan Zhang, Keshav Santhanam, Sri Vardhamanan, Saiful Haq, Ashutosh Sharma, Thomas T. Joshi, Hanna Moazam, Heather Miller, Matei Zaharia, Christopher Potts

The ML community is rapidly exploring techniques for prompting language models (LMs) and for stacking them into pipelines that solve complex tasks.

Language Modelling Math

10,206

Paper
Code

DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines

1 code implementation • 20 Dec 2023 • Arnav Singhvi, Manish Shetty, Shangyin Tan, Christopher Potts, Koushik Sen, Matei Zaharia, Omar Khattab

We integrate our constructs into the recent DSPy programming model for LMs, and present new strategies that allow DSPy to compile programs with LM Assertions into more reliable and accurate systems.

Language Modelling Prompt Engineering +2

10,206

Paper
Code

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank

2 code implementations • EMNLP 2013 • Richard Socher, Alex Perelygin, Jean Wu, Jason Chuang, Christopher D. Manning, Andrew Ng, Christopher Potts

Ranked #24 on Sentiment Analysis on SST-5 Fine-grained classification

Sentiment Analysis

9,456

Paper
Code

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

3 code implementations • 9 Jun 2022 • Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza, Ambrose Slone, Ameet Rahane, Anantharaman S. Iyer, Anders Andreassen, Andrea Madotto, Andrea Santilli, Andreas Stuhlmüller, Andrew Dai, Andrew La, Andrew Lampinen, Andy Zou, Angela Jiang, Angelica Chen, Anh Vuong, Animesh Gupta, Anna Gottardi, Antonio Norelli, Anu Venkatesh, Arash Gholamidavoodi, Arfa Tabassum, Arul Menezes, Arun Kirubarajan, Asher Mullokandov, Ashish Sabharwal, Austin Herrick, Avia Efrat, Aykut Erdem, Ayla Karakaş, B. Ryan Roberts, Bao Sheng Loe, Barret Zoph, Bartłomiej Bojanowski, Batuhan Özyurt, Behnam Hedayatnia, Behnam Neyshabur, Benjamin Inden, Benno Stein, Berk Ekmekci, Bill Yuchen Lin, Blake Howald, Bryan Orinion, Cameron Diao, Cameron Dour, Catherine Stinson, Cedrick Argueta, César Ferri Ramírez, Chandan Singh, Charles Rathkopf, Chenlin Meng, Chitta Baral, Chiyu Wu, Chris Callison-Burch, Chris Waites, Christian Voigt, Christopher D. Manning, Christopher Potts, Cindy Ramirez, Clara E. Rivera, Clemencia Siro, Colin Raffel, Courtney Ashcraft, Cristina Garbacea, Damien Sileo, Dan Garrette, Dan Hendrycks, Dan Kilman, Dan Roth, Daniel Freeman, Daniel Khashabi, Daniel Levy, Daniel Moseguí González, Danielle Perszyk, Danny Hernandez, Danqi Chen, Daphne Ippolito, Dar Gilboa, David Dohan, David Drakard, David Jurgens, Debajyoti Datta, Deep Ganguli, Denis Emelin, Denis Kleyko, Deniz Yuret, Derek Chen, Derek Tam, Dieuwke Hupkes, Diganta Misra, Dilyar Buzan, Dimitri Coelho Mollo, Diyi Yang, Dong-Ho Lee, Dylan Schrader, Ekaterina Shutova, Ekin Dogus Cubuk, Elad Segal, Eleanor Hagerman, Elizabeth Barnes, Elizabeth Donoway, Ellie Pavlick, Emanuele Rodola, Emma Lam, Eric Chu, Eric Tang, Erkut Erdem, Ernie Chang, Ethan A. Chi, Ethan Dyer, Ethan Jerzak, Ethan Kim, Eunice Engefu Manyasi, Evgenii Zheltonozhskii, Fanyue Xia, Fatemeh Siar, Fernando Martínez-Plumed, Francesca Happé, Francois Chollet, Frieda Rong, Gaurav Mishra, Genta Indra Winata, Gerard de Melo, Germán Kruszewski, Giambattista Parascandolo, Giorgio Mariani, Gloria Wang, Gonzalo Jaimovitch-López, Gregor Betz, Guy Gur-Ari, Hana Galijasevic, Hannah Kim, Hannah Rashkin, Hannaneh Hajishirzi, Harsh Mehta, Hayden Bogar, Henry Shevlin, Hinrich Schütze, Hiromu Yakura, Hongming Zhang, Hugh Mee Wong, Ian Ng, Isaac Noble, Jaap Jumelet, Jack Geissinger, Jackson Kernion, Jacob Hilton, Jaehoon Lee, Jaime Fernández Fisac, James B. Simon, James Koppel, James Zheng, James Zou, Jan Kocoń, Jana Thompson, Janelle Wingfield, Jared Kaplan, Jarema Radom, Jascha Sohl-Dickstein, Jason Phang, Jason Wei, Jason Yosinski, Jekaterina Novikova, Jelle Bosscher, Jennifer Marsh, Jeremy Kim, Jeroen Taal, Jesse Engel, Jesujoba Alabi, Jiacheng Xu, Jiaming Song, Jillian Tang, Joan Waweru, John Burden, John Miller, John U. Balis, Jonathan Batchelder, Jonathan Berant, Jörg Frohberg, Jos Rozen, Jose Hernandez-Orallo, Joseph Boudeman, Joseph Guerr, Joseph Jones, Joshua B. Tenenbaum, Joshua S. Rule, Joyce Chua, Kamil Kanclerz, Karen Livescu, Karl Krauth, Karthik Gopalakrishnan, Katerina Ignatyeva, Katja Markert, Kaustubh D. Dhole, Kevin Gimpel, Kevin Omondi, Kory Mathewson, Kristen Chiafullo, Ksenia Shkaruta, Kumar Shridhar, Kyle McDonell, Kyle Richardson, Laria Reynolds, Leo Gao, Li Zhang, Liam Dugan, Lianhui Qin, Lidia Contreras-Ochando, Louis-Philippe Morency, Luca Moschella, Lucas Lam, Lucy Noble, Ludwig Schmidt, Luheng He, Luis Oliveros Colón, Luke Metz, Lütfi Kerem Şenel, Maarten Bosma, Maarten Sap, Maartje ter Hoeve, Maheen Farooqi, Manaal Faruqui, Mantas Mazeika, Marco Baturan, Marco Marelli, Marco Maru, Maria Jose Ramírez Quintana, Marie Tolkiehn, Mario Giulianelli, Martha Lewis, Martin Potthast, Matthew L. Leavitt, Matthias Hagen, Mátyás Schubert, Medina Orduna Baitemirova, Melody Arnaud, Melvin McElrath, Michael A. Yee, Michael Cohen, Michael Gu, Michael Ivanitskiy, Michael Starritt, Michael Strube, Michał Swędrowski, Michele Bevilacqua, Michihiro Yasunaga, Mihir Kale, Mike Cain, Mimee Xu, Mirac Suzgun, Mitch Walker, Mo Tiwari, Mohit Bansal, Moin Aminnaseri, Mor Geva, Mozhdeh Gheini, Mukund Varma T, Nanyun Peng, Nathan A. Chi, Nayeon Lee, Neta Gur-Ari Krakover, Nicholas Cameron, Nicholas Roberts, Nick Doiron, Nicole Martinez, Nikita Nangia, Niklas Deckers, Niklas Muennighoff, Nitish Shirish Keskar, Niveditha S. Iyer, Noah Constant, Noah Fiedel, Nuan Wen, Oliver Zhang, Omar Agha, Omar Elbaghdadi, Omer Levy, Owain Evans, Pablo Antonio Moreno Casares, Parth Doshi, Pascale Fung, Paul Pu Liang, Paul Vicol, Pegah Alipoormolabashi, Peiyuan Liao, Percy Liang, Peter Chang, Peter Eckersley, Phu Mon Htut, Pinyu Hwang, Piotr Miłkowski, Piyush Patil, Pouya Pezeshkpour, Priti Oli, Qiaozhu Mei, Qing Lyu, Qinlang Chen, Rabin Banjade, Rachel Etta Rudolph, Raefer Gabriel, Rahel Habacker, Ramon Risco, Raphaël Millière, Rhythm Garg, Richard Barnes, Rif A. Saurous, Riku Arakawa, Robbe Raymaekers, Robert Frank, Rohan Sikand, Roman Novak, Roman Sitelew, Ronan LeBras, Rosanne Liu, Rowan Jacobs, Rui Zhang, Ruslan Salakhutdinov, Ryan Chi, Ryan Lee, Ryan Stovall, Ryan Teehan, Rylan Yang, Sahib Singh, Saif M. Mohammad, Sajant Anand, Sam Dillavou, Sam Shleifer, Sam Wiseman, Samuel Gruetter, Samuel R. Bowman, Samuel S. Schoenholz, Sanghyun Han, Sanjeev Kwatra, Sarah A. Rous, Sarik Ghazarian, Sayan Ghosh, Sean Casey, Sebastian Bischoff, Sebastian Gehrmann, Sebastian Schuster, Sepideh Sadeghi, Shadi Hamdan, Sharon Zhou, Shashank Srivastava, Sherry Shi, Shikhar Singh, Shima Asaadi, Shixiang Shane Gu, Shubh Pachchigar, Shubham Toshniwal, Shyam Upadhyay, Shyamolima, Debnath, Siamak Shakeri, Simon Thormeyer, Simone Melzi, Siva Reddy, Sneha Priscilla Makini, Soo-Hwan Lee, Spencer Torene, Sriharsha Hatwar, Stanislas Dehaene, Stefan Divic, Stefano Ermon, Stella Biderman, Stephanie Lin, Stephen Prasad, Steven T. Piantadosi, Stuart M. Shieber, Summer Misherghi, Svetlana Kiritchenko, Swaroop Mishra, Tal Linzen, Tal Schuster, Tao Li, Tao Yu, Tariq Ali, Tatsu Hashimoto, Te-Lin Wu, Théo Desbordes, Theodore Rothschild, Thomas Phan, Tianle Wang, Tiberius Nkinyili, Timo Schick, Timofei Kornev, Titus Tunduny, Tobias Gerstenberg, Trenton Chang, Trishala Neeraj, Tushar Khot, Tyler Shultz, Uri Shaham, Vedant Misra, Vera Demberg, Victoria Nyamai, Vikas Raunak, Vinay Ramasesh, Vinay Uday Prabhu, Vishakh Padmakumar, Vivek Srikumar, William Fedus, William Saunders, William Zhang, Wout Vossen, Xiang Ren, Xiaoyu Tong, Xinran Zhao, Xinyi Wu, Xudong Shen, Yadollah Yaghoobzadeh, Yair Lakretz, Yangqiu Song, Yasaman Bahri, Yejin Choi, Yichi Yang, Yiding Hao, Yifu Chen, Yonatan Belinkov, Yu Hou, Yufang Hou, Yuntao Bai, Zachary Seid, Zhuoye Zhao, Zijian Wang, Zijie J. Wang, ZiRui Wang, Ziyi Wu

BIG-bench focuses on tasks that are believed to be beyond the capabilities of current language models.

Common Sense Reasoning Math +1

2,647

Paper
Code

Relevance-guided Supervision for OpenQA with ColBERT

5 code implementations • 1 Jul 2020 • Omar Khattab, Christopher Potts, Matei Zaharia

In much recent work, the retriever is a learned component that uses coarse-grained vector representations of questions and passages.

Natural Questions Open-Domain Question Answering +2

2,438

Paper
Code

Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval

2 code implementations • NeurIPS 2021 • Omar Khattab, Christopher Potts, Matei Zaharia

Multi-hop reasoning (i. e., reasoning across two or more documents) is a key ingredient for NLP models that leverage large corpora to exhibit broad knowledge.

Claim Verification Question Answering +1

2,438

Paper
Code

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

3 code implementations • NAACL 2022 • Keshav Santhanam, Omar Khattab, Jon Saad-Falcon, Christopher Potts, Matei Zaharia

Neural information retrieval (IR) has greatly advanced search and other knowledge-intensive language tasks.

Ranked #6 on Zero-shot Text Search on BEIR

Information Retrieval Open-Domain Question Answering +2

2,438

Paper
Code

PLAID: An Efficient Engine for Late Interaction Retrieval

1 code implementation • 19 May 2022 • Keshav Santhanam, Omar Khattab, Christopher Potts, Matei Zaharia

PLAID uses centroid interaction as well as centroid pruning, a mechanism for sparsifying the bag of centroids, within a highly-optimized engine to reduce late interaction search latency by up to 7$\times$ on a GPU and 45$\times$ on a CPU against vanilla ColBERTv2, while continuing to deliver state-of-the-art retrieval quality.

Information Retrieval Retrieval

2,438

Paper
Code

On the Opportunities and Risks of Foundation Models

2 code implementations • 16 Aug 2021 • Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh, Li Fei-Fei, Chelsea Finn, Trevor Gale, Lauren Gillespie, Karan Goel, Noah Goodman, Shelby Grossman, Neel Guha, Tatsunori Hashimoto, Peter Henderson, John Hewitt, Daniel E. Ho, Jenny Hong, Kyle Hsu, Jing Huang, Thomas Icard, Saahil Jain, Dan Jurafsky, Pratyusha Kalluri, Siddharth Karamcheti, Geoff Keeling, Fereshte Khani, Omar Khattab, Pang Wei Koh, Mark Krass, Ranjay Krishna, Rohith Kuditipudi, Ananya Kumar, Faisal Ladhak, Mina Lee, Tony Lee, Jure Leskovec, Isabelle Levent, Xiang Lisa Li, Xuechen Li, Tengyu Ma, Ali Malik, Christopher D. Manning, Suvir Mirchandani, Eric Mitchell, Zanele Munyikwa, Suraj Nair, Avanika Narayan, Deepak Narayanan, Ben Newman, Allen Nie, Juan Carlos Niebles, Hamed Nilforoshan, Julian Nyarko, Giray Ogut, Laurel Orr, Isabel Papadimitriou, Joon Sung Park, Chris Piech, Eva Portelance, Christopher Potts, aditi raghunathan, Rob Reich, Hongyu Ren, Frieda Rong, Yusuf Roohani, Camilo Ruiz, Jack Ryan, Christopher Ré, Dorsa Sadigh, Shiori Sagawa, Keshav Santhanam, Andy Shih, Krishnan Srinivasan, Alex Tamkin, Rohan Taori, Armin W. Thomas, Florian Tramèr, Rose E. Wang, William Wang, Bohan Wu, Jiajun Wu, Yuhuai Wu, Sang Michael Xie, Michihiro Yasunaga, Jiaxuan You, Matei Zaharia, Michael Zhang, Tianyi Zhang, Xikun Zhang, Yuhui Zhang, Lucia Zheng, Kaitlyn Zhou, Percy Liang

AI is undergoing a paradigm shift with the rise of models (e. g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks.

Transfer Learning

847

Paper
Code

UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

1 code implementation • 1 Mar 2023 • Jon Saad-Falcon, Omar Khattab, Keshav Santhanam, Radu Florian, Martin Franz, Salim Roukos, Avirup Sil, Md Arafat Sultan, Christopher Potts

Many information retrieval tasks require large labeled datasets for fine-tuning.

Information Retrieval Retrieval +1

697

Paper
Code

pyvene: A Library for Understanding and Improving PyTorch Models via Interventions

3 code implementations • 12 Mar 2024 • Zhengxuan Wu, Atticus Geiger, Aryaman Arora, Jing Huang, Zheng Wang, Noah D. Goodman, Christopher D. Manning, Christopher Potts

Interventions on model-internal states are fundamental operations in many areas of AI, including model editing, steering, robustness, and interpretability.

Model Editing

553

Paper
Code

ReFT: Representation Finetuning for Language Models

2 code implementations • 4 Apr 2024 • Zhengxuan Wu, Aryaman Arora, Zheng Wang, Atticus Geiger, Dan Jurafsky, Christopher D. Manning, Christopher Potts

LoReFT is a drop-in replacement for existing PEFTs and learns interventions that are 10x-50x more parameter-efficient than prior state-of-the-art PEFTs.

Arithmetic Reasoning

553

Paper
Code

Interpretability at Scale: Identifying Causal Mechanisms in Alpaca

1 code implementation • NeurIPS 2023 • Zhengxuan Wu, Atticus Geiger, Thomas Icard, Christopher Potts, Noah D. Goodman

With Boundless DAS, we discover that Alpaca does this by implementing a causal model with two interpretable boolean variables.

454

Paper
Code

In-Context Learning for Extreme Multi-Label Classification

2 code implementations • 22 Jan 2024 • Karel D'Oosterlinck, Omar Khattab, François Remy, Thomas Demeester, Chris Develder, Christopher Potts

Multi-label classification problems with thousands of classes are hard to solve with in-context learning alone, as language models (LMs) might lack prior knowledge about the precise classes or how to assign them, and it is generally infeasible to demonstrate every class in a prompt.

Classification Extreme Multi-Label Classification +2

295

Paper
Code

ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems

1 code implementation • 16 Nov 2023 • Jon Saad-Falcon, Omar Khattab, Christopher Potts, Matei Zaharia

Evaluating retrieval-augmented generation (RAG) systems traditionally relies on hand annotations for input queries, passages to retrieve, and responses to generate.

Retrieval

269

Paper
Code

Mittens: An Extension of GloVe for Learning Domain-Specialized Representations

1 code implementation • NAACL 2018 • Nicholas Dingwall, Christopher Potts

We present a simple extension of the GloVe representation learning model that begins with general-purpose representations and updates them based on data from a specialized domain.

Representation Learning

243

Paper
Code

A Fast Unified Model for Parsing and Sentence Understanding

3 code implementations • ACL 2016 • Samuel R. Bowman, Jon Gauthier, Abhinav Rastogi, Raghav Gupta, Christopher D. Manning, Christopher Potts

Tree-structured neural networks exploit valuable syntactic parse information as they interpret the meanings of sentences.

Ranked #86 on Natural Language Inference on SNLI

Sentence

205

Paper
Code

Retrofitting Distributional Embeddings to Knowledge Graphs with Functional Relations

1 code implementation • COLING 2018 • Benjamin J. Lengerich, Andrew L. Maas, Christopher Potts

Knowledge graphs are a versatile framework to encode richly structured data relationships, but it can be challenging to combine these graphs with unstructured data.

195

Paper
Code

DynaSent: A Dynamic Benchmark for Sentiment Analysis

1 code implementation • ACL 2021 • Christopher Potts, Zhengxuan Wu, Atticus Geiger, Douwe Kiela

We introduce DynaSent ('Dynamic Sentiment'), a new English-language benchmark task for ternary (positive/negative/neutral) sentiment analysis.

Sentiment Analysis

161

Paper
Code

Tree-structured composition in neural networks without tree-structured architectures

1 code implementation • 16 Jun 2015 • Samuel R. Bowman, Christopher D. Manning, Christopher Potts

We hypothesize that neural sequence models like LSTMs are in fact able to discover and implicitly use recursive compositional structure, at least for tasks with clear cues to that structure in the data.

Sentence

Paper
Code

MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

2 code implementations • 24 May 2023 • Zexuan Zhong, Zhengxuan Wu, Christopher D. Manning, Christopher Potts, Danqi Chen

The information stored in large language models (LLMs) falls out of date quickly, and retraining from scratch is often not an option.

knowledge editing Language Modelling +2

Paper
Code

A large annotated corpus for learning natural language inference

3 code implementations • EMNLP 2015 • Samuel R. Bowman, Gabor Angeli, Christopher Potts, Christopher D. Manning

Understanding entailment and contradiction is fundamental to understanding natural language, and inference about entailment and contradiction is a valuable testing ground for the development of semantic representations.

Ranked #91 on Natural Language Inference on SNLI

Image Captioning Natural Language Inference +1

Paper
Code

Data and Representation for Turkish Natural Language Inference

1 code implementation • EMNLP 2020 • Emrah Budur, Rıza Özçelik, Tunga Güngör, Christopher Potts

In this paper, we offer a positive response for natural language inference (NLI) in Turkish.

Machine Translation Natural Language Inference +1

Paper
Code

BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance

1 code implementation • 22 May 2023 • Karel D'Oosterlinck, François Remy, Johannes Deleu, Thomas Demeester, Chris Develder, Klim Zaporojets, Aneiss Ghodsi, Simon Ellershaw, Jack Collins, Christopher Potts

We introduce BioDEX, a large-scale resource for Biomedical adverse Drug Event Extraction, rooted in the historical output of drug safety reporting in the U. S. BioDEX consists of 65k abstracts and 19k full-text biomedical papers with 256k associated document-level safety reports created by medical experts.

Event Extraction

Paper
Code

Learning to Generate Compositional Color Descriptions

1 code implementation • EMNLP 2016 • Will Monroe, Noah D. Goodman, Christopher Potts

The production of color language is essential for grounded language generation.

Language Modelling Text Generation

Paper
Code

I am a Strange Dataset: Metalinguistic Tests for Language Models

1 code implementation • 10 Jan 2024 • Tristan Thrush, Jared Moore, Miguel Monares, Christopher Potts, Douwe Kiela

We also provide minimally different metalinguistic non-self-reference examples to complement the main dataset by probing for whether models can handle metalinguistic language at all.

Sentence

Paper
Code

ReaSCAN: Compositional Reasoning in Language Grounding

3 code implementations • 18 Sep 2021 • Zhengxuan Wu, Elisa Kreiss, Desmond C. Ong, Christopher Potts

The ability to compositionally map language to referents, relations, and actions is an essential component of language understanding.

Paper
Code

Inducing Causal Structure for Interpretable Neural Networks

2 code implementations • 1 Dec 2021 • Atticus Geiger, Zhengxuan Wu, Hanson Lu, Josh Rozner, Elisa Kreiss, Thomas Icard, Noah D. Goodman, Christopher Potts

In IIT, we (1) align variables in a causal model (e. g., a deterministic program or Bayesian network) with representations in a neural model and (2) train the neural model to match the counterfactual behavior of the causal model on a base input when aligned representations in both models are set to be the value they would be for a source input.

counterfactual Data Augmentation +1

Paper
Code

Causal Distillation for Language Models

1 code implementation • NAACL 2022 • Zhengxuan Wu, Atticus Geiger, Josh Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah D. Goodman

Distillation efforts have led to language models that are more compact and efficient without serious drops in performance.

Language Modelling Masked Language Modeling +5

Paper
Code

CausalGym: Benchmarking causal interpretability methods on linguistic tasks

1 code implementation • 19 Feb 2024 • Aryaman Arora, Dan Jurafsky, Christopher Potts

Language models (LMs) have proven to be powerful tools for psycholinguistic research, but most prior work has focused on purely behavioural measures (e. g., surprisal comparisons).

Ranked #1 on Interpretability Techniques for Deep Learning on CausalGym

Benchmarking Interpretability Techniques for Deep Learning

Paper
Code

Concadia: Towards Image-Based Text Generation with a Purpose

1 code implementation • 16 Apr 2021 • Elisa Kreiss, Fei Fang, Noah D. Goodman, Christopher Potts

Current deep learning models often achieve excellent results on benchmark image-to-text datasets but fail to generate texts that are useful in practice.

Image Captioning Text Generation

Paper
Code

Psychologically-informed chain-of-thought prompts for metaphor understanding in large language models

1 code implementation • 16 Sep 2022 • Ben Prystawski, Paul Thibodeau, Christopher Potts, Noah D. Goodman

Probabilistic models of language understanding are valuable tools for investigating human language use.

Paper
Code

Generating Bilingual Pragmatic Color References

1 code implementation • NAACL 2018 • Will Monroe, Jennifer Hu, Andrew Jong, Christopher Potts

Contextual influences on language often exhibit substantial cross-lingual regularities; for example, we are more verbose in situations that require finer distinctions.

Paper
Code

Colors in Context: A Pragmatic Neural Model for Grounded Language Understanding

1 code implementation • TACL 2017 • Will Monroe, Robert X. D. Hawkins, Noah D. Goodman, Christopher Potts

We present a model of pragmatic referring expression interpretation in a grounded communication task (identifying colors from descriptions) that draws upon predictions from two recurrent neural network classifiers, a speaker and a listener, unified by a recursive pragmatic reasoning framework.

Referring Expression

Paper
Code

CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior

1 code implementation • 27 May 2022 • Eldar David Abraham, Karel D'Oosterlinck, Amir Feder, Yair Ori Gat, Atticus Geiger, Christopher Potts, Roi Reichart, Zhengxuan Wu

We introduce CEBaB, a new benchmark dataset for assessing concept-based explanation methods in Natural Language Processing (NLP).

Causal Inference counterfactual

Paper
Code

Representing Social Media Users for Sarcasm Detection

1 code implementation • EMNLP 2018 • Y. Alex Kolchinski, Christopher Potts

We explore two methods for representing authors in the context of textual sarcasm detection: a Bayesian approach that directly represents authors' propensities to be sarcastic, and a dense embedding approach that can learn interactions between the author and the text.

Sarcasm Detection

Paper
Code

Mission: Impossible Language Models

1 code implementation • 12 Jan 2024 • Julie Kallini, Isabel Papadimitriou, Richard Futrell, Kyle Mahowald, Christopher Potts

Chomsky and others have very directly claimed that large language models (LLMs) are equally capable of learning languages that are possible and impossible for humans to learn.

Paper
Code

TalkDown: A Corpus for Condescension Detection in Context

1 code implementation • IJCNLP 2019 • Zijian Wang, Christopher Potts

Condescending language use is caustic; it can bring dialogues to an end and bifurcate communities.

Paper
Code

Decrypting Cryptic Crosswords: Semantically Complex Wordplay Puzzles as a Target for NLP

1 code implementation • NeurIPS 2021 • Joshua Rozner, Christopher Potts, Kyle Mahowald

Cryptic crosswords, the dominant crossword variety in the UK, are a promising target for advancing NLP systems that seek to process semantically complex, highly compositional language.

Language Modelling

Paper
Code

Causal Proxy Models for Concept-Based Model Explanations

1 code implementation • 28 Sep 2022 • Zhengxuan Wu, Karel D'Oosterlinck, Atticus Geiger, Amir Zur, Christopher Potts

The core of our proposal is the Causal Proxy Model (CPM).

Causal Inference counterfactual

Paper
Code

ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation

1 code implementation • 24 Mar 2023 • Zhengxuan Wu, Christopher D. Manning, Christopher Potts

We argue that this concern is realized for the COGS benchmark.

Semantic Parsing

Paper
Code

CAW-coref: Conjunction-Aware Word-level Coreference Resolution

1 code implementation • 9 Oct 2023 • Karel D'Oosterlinck, Semere Kiros Bitew, Brandon Papineau, Christopher Potts, Thomas Demeester, Chris Develder

State-of-the-art coreference resolutions systems depend on multiple LLM calls per document and are thus prohibitively expensive for many use cases (e. g., information extraction with large corpora).

Ranked #3 on Coreference Resolution on OntoNotes

coreference-resolution

Paper
Code

Causal Abstractions of Neural Networks

1 code implementation • NeurIPS 2021 • Atticus Geiger, Hanson Lu, Thomas Icard, Christopher Potts

Structural analysis methods (e. g., probing and feature attribution) are increasingly important tools for neural network analysis.

Natural Language Inference

Paper
Code

Context Matters for Image Descriptions for Accessibility: Challenges for Referenceless Evaluation Metrics

1 code implementation • 21 May 2022 • Elisa Kreiss, Cynthia Bennett, Shayan Hooshmand, Eric Zelikman, Meredith Ringel Morris, Christopher Potts

Few images on the Web receive alt-text descriptions that would make them accessible to blind and low vision (BLV) users.

Paper
Code

GIO: Gradient Information Optimization for Training Dataset Selection

1 code implementation • 20 Jun 2023 • Dante Everaert, Christopher Potts

It is often advantageous to train models on a subset of the available train examples, because the examples are of variable quality or because one would like to train with fewer examples, without sacrificing performance.

Machine Translation Spelling Correction

Paper
Code

Building Efficient and Effective OpenQA Systems for Low-Resource Languages

1 code implementation • 7 Jan 2024 • Emrah Budur, Rıza Özçelik, Dilara Soylu, Omar Khattab, Tunga Güngör, Christopher Potts

We present SQuAD-TR, a machine translation of SQuAD2. 0, and we build our OpenQA system by adapting ColBERT-QA for Turkish.

Machine Translation Question Answering

Paper
Code

RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations

1 code implementation • 27 Feb 2024 • Jing Huang, Zhengxuan Wu, Christopher Potts, Mor Geva, Atticus Geiger

Individual neurons participate in the representation of multiple high-level concepts.

Attribute Language Modelling

Paper
Code

Communication-based Evaluation for Natural Language Generation

1 code implementation • SCiL 2020 • Benjamin Newman, Reuben Cohn-Gordon, Christopher Potts

Natural language generation (NLG) systems are commonly evaluated using n-gram overlap measures (e. g. BLEU, ROUGE).

Text Generation

Paper
Code

ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning

1 code implementation • 30 May 2023 • Jingyuan Selena She, Christopher Potts, Samuel R. Bowman, Atticus Geiger

For in-context learning, we test InstructGPT models and find that most prompt strategies are not successful, including those using step-by-step reasoning.

Benchmarking In-Context Learning +3

Paper
Code

A Reply to Makelov et al. (2023)'s "Interpretability Illusion" Arguments

1 code implementation • 23 Jan 2024 • Zhengxuan Wu, Atticus Geiger, Jing Huang, Aryaman Arora, Thomas Icard, Christopher Potts, Noah D. Goodman

We respond to the recent paper by Makelov et al. (2023), which reviews subspace interchange intervention methods like distributed alignment search (DAS; Geiger et al. 2023) and claims that these methods potentially cause "interpretability illusions".

Paper
Code

Relational reasoning and generalization using non-symbolic neural networks

1 code implementation • 14 Jun 2020 • Atticus Geiger, Alexandra Carstensen, Michael C. Frank, Christopher Potts

In the two latter cases, our models perform tasks proposed in previous work to demarcate human-unique symbolic abilities.

Relational Reasoning Zero-shot Generalization

Paper
Code

Neural Natural Language Inference Models Partially Embed Theories of Lexical Entailment and Negation

1 code implementation • EMNLP (BlackboxNLP) 2020 • Atticus Geiger, Kyle Richardson, Christopher Potts

We address whether neural models for Natural Language Inference (NLI) can learn the compositional interactions between lexical entailment and negation, using four methods: the behavioral evaluation methods of (1) challenge test sets and (2) systematic generalization tasks, and the structural evaluation methods of (3) probes and (4) interventions.

Lexical Entailment Natural Language Inference +2

Paper
Code

Pragmatic Issue-Sensitive Image Captioning

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Allen Nie, Reuben Cohn-Gordon, Christopher Potts

Image captioning systems have recently improved dramatically, but they still tend to produce captions that are insensitive to the communicative goals that captions should meet.

Descriptive Image Captioning +2

Paper
Code

Color Overmodification Emerges from Data-Driven Learning and Pragmatic Reasoning

1 code implementation • 18 May 2022 • Fei Fang, Kunal Sinha, Noah D. Goodman, Christopher Potts, Elisa Kreiss

It seems likely that these patterns are shaped by the environment a speaker is exposed to in complex ways.

Language Acquisition

Paper
Code

ContextRef: Evaluating Referenceless Metrics For Image Description Generation

1 code implementation • 21 Sep 2023 • Elisa Kreiss, Eric Zelikman, Christopher Potts, Nick Haber

None of the methods is successful with ContextRef, but we show that careful fine-tuning yields substantial improvements.

Paper
Code

Identifying the Limits of Cross-Domain Knowledge Transfer for Pretrained Models

1 code implementation • RepL4NLP (ACL) 2022 • Zhengxuan Wu, Nelson F. Liu, Christopher Potts

There is growing evidence that pretrained language models improve task-specific fine-tuning not just for the languages seen in pretraining, but also for new languages and even non-linguistic data.

Transfer Learning

Paper
Code

Context-VQA: Towards Context-Aware and Purposeful Visual Question Answering

1 code implementation • 28 Jul 2023 • Nandita Naik, Christopher Potts, Elisa Kreiss

We also find that context effects are especially important when participants can't see the image.

Question Answering Visual Question Answering

Paper
Code

CommVQA: Situating Visual Question Answering in Communicative Contexts

1 code implementation • 22 Feb 2024 • Nandita Shankar Naik, Christopher Potts, Elisa Kreiss

Current visual question answering (VQA) models tend to be trained and evaluated on image-question pairs in isolation.

Question Answering Visual Question Answering

Paper
Code

Pragmatically Informative Image Captioning with Character-Level Inference

no code implementations • NAACL 2018 • Reuben Cohn-Gordon, Noah Goodman, Christopher Potts

We combine a neural image captioner with a Rational Speech Acts (RSA) model to make a system that is pragmatically informative: its objective is to produce captions that are not merely true but also distinguish their inputs from similar images.

Image Captioning Rolling Shutter Correction

Paper
Add Code

On the Effective Use of Pretraining for Natural Language Inference

no code implementations • 5 Oct 2017 • Ignacio Cases, Minh-Thang Luong, Christopher Potts

Neural networks have excelled at many NLP tasks, but there remain open questions about the performance of pretrained distributed word representations and their interaction with weight initialization and other hyperparameters.

Natural Language Inference

Paper
Add Code

Learning in the Rational Speech Acts Model

no code implementations • 23 Oct 2015 • Will Monroe, Christopher Potts

The Rational Speech Acts (RSA) model treats language use as a recursive process in which probabilistic speaker and listener agents reason about each other's intentions to enrich the literal semantics of their language along broadly Gricean lines.

Text Generation

Paper
Add Code

Text to 3D Scene Generation with Rich Lexical Grounding

no code implementations • IJCNLP 2015 • Angel Chang, Will Monroe, Manolis Savva, Christopher Potts, Christopher D. Manning

The ability to map descriptions of scenes to 3D geometric representations has many applications in areas such as art, education, and robotics.

Scene Generation Text to 3D

Paper
Add Code

Recursive Neural Networks Can Learn Logical Semantics

no code implementations • WS 2015 • Samuel R. Bowman, Christopher Potts, Christopher D. Manning

Tree-structured recursive neural networks (TreeRNNs) for sentence meaning have been successful for many applications, but it remains an open question whether the fixed-length representations that they learn can support tasks as demanding as logical deduction.

Open-Ended Question Answering Relational Reasoning +2

Paper
Add Code

Learning Distributed Word Representations for Natural Logic Reasoning

no code implementations • 15 Oct 2014 • Samuel R. Bowman, Christopher Potts, Christopher D. Manning

Natural logic offers a powerful relational conception of meaning that is a natural counterpart to distributed semantic representations, which have proven valuable in a wide range of sophisticated language tasks.

Logical Reasoning Open-Ended Question Answering +1

Paper
Add Code

Exploiting Social Network Structure for Person-to-Person Sentiment Analysis

no code implementations • TACL 2014 • Robert West, Hristo S. Paskov, Jure Leskovec, Christopher Potts

Person-to-person evaluations are prevalent in all kinds of discourse and important for establishing reputations, building social bonds, and shaping public opinion.

Decision Making Sentiment Analysis

Paper
Add Code

A Computational Approach to Politeness with Application to Social Factors

no code implementations • ACL 2013 • Cristian Danescu-Niculescu-Mizil, Moritz Sudhof, Dan Jurafsky, Jure Leskovec, Christopher Potts

We propose a computational framework for identifying linguistic aspects of politeness.

Paper
Add Code

A case for deep learning in semantics

no code implementations • 10 Sep 2018 • Christopher Potts

Pater's target article builds a persuasive case for establishing stronger ties between theoretical linguistics and connectionism (deep learning).

Paper
Add Code

An Incremental Iterated Response Model of Pragmatics

no code implementations • WS 2019 • Reuben Cohn-Gordon, Noah D. Goodman, Christopher Potts

Recent Iterated Response (IR) models of pragmatics conceptualize language use as a recursive process in which agents reason about each other to increase communicative efficiency.

Referring Expression Referring expression generation

Paper
Add Code

Stress-Testing Neural Models of Natural Language Inference with Multiply-Quantified Sentences

no code implementations • 30 Oct 2018 • Atticus Geiger, Ignacio Cases, Lauri Karttunen, Christopher Potts

Standard evaluations of deep learning models for semantics using naturalistic corpora are limited in what they can tell us about the fidelity of the learned representations, because the corpora rarely come with good measures of semantic complexity.

Natural Language Inference

Paper
Add Code

Effective Feature Representation for Clinical Text Concept Extraction

no code implementations • WS 2019 • Yifeng Tao, Bruno Godefroy, Guillaume Genthial, Christopher Potts

Crucial information about the practice of healthcare is recorded only in free-form text, which creates an enormous opportunity for high-impact NLP.

Paper
Add Code

Did It Happen? The Pragmatic Complexity of Veridicality Assessment

no code implementations • CL 2012 • Marie-Catherine de Marneffe, Christopher D. Manning, Christopher Potts

Paper
Add Code

Implicatures and Nested Beliefs in Approximate Decentralized-POMDPs

no code implementations • ACL 2013 • Adam Vogel, Christopher Potts, Dan Jurafsky

Decision Making Implicatures

Paper
Add Code

The Life and Death of Discourse Entities: Identifying Singleton Mentions

no code implementations • NAACL 2013 • Marta Recasens, Marie-Catherine de Marneffe, Christopher Potts

Coreference Resolution Natural Language Inference

Paper
Add Code

Emergence of Gricean Maxims from Multi-Agent Decision Theory

no code implementations • NAACL 2013 • Adam Vogel, Max Bodoia, Christopher Potts, Daniel Jurafsky

Decision Making Slot Filling +1

Paper
Add Code

Modeling Drug-Disease Relations with Linguistic and Knowledge Graph Constraints

no code implementations • 31 Mar 2019 • Bruno Godefroy, Christopher Potts

FDA drug labels are rich sources of information about drugs and drug-disease relations, but their complexity makes them challenging texts to analyze in isolation.

Knowledge Graphs

Paper
Add Code

Recursive Routing Networks: Learning to Compose Modules for Language Understanding

no code implementations • NAACL 2019 • Ignacio Cases, Clemens Rosenbaum, Matthew Riemer, Atticus Geiger, Tim Klinger, Alex Tamkin, Olivia Li, S Agarwal, hini, Joshua D. Greene, Dan Jurafsky, Christopher Potts, Lauri Karttunen

The model jointly optimizes the parameters of the functions and the meta-learner{'}s policy for routing inputs through those functions.

Decision Making Natural Language Inference

Paper
Add Code

Modeling Subjective Assessments of Guilt in Newspaper Crime Narratives

1 code implementation • CONLL 2020 • Elisa Kreiss, Zijian Wang, Christopher Potts

Crime reporting is a prevalent form of journalism with the power to shape public perceptions and social policies.

Paper
Code

Dynabench: Rethinking Benchmarking in NLP

no code implementations • NAACL 2021 • Douwe Kiela, Max Bartolo, Yixin Nie, Divyansh Kaushik, Atticus Geiger, Zhengxuan Wu, Bertie Vidgen, Grusha Prasad, Amanpreet Singh, Pratik Ringshia, Zhiyi Ma, Tristan Thrush, Sebastian Riedel, Zeerak Waseem, Pontus Stenetorp, Robin Jia, Mohit Bansal, Christopher Potts, Adina Williams

We introduce Dynabench, an open-source platform for dynamic dataset creation and model benchmarking.

Benchmarking

Paper
Add Code

Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking

no code implementations • NeurIPS 2021 • Zhiyi Ma, Kawin Ethayarajh, Tristan Thrush, Somya Jain, Ledell Wu, Robin Jia, Christopher Potts, Adina Williams, Douwe Kiela

We introduce Dynaboard, an evaluation-as-a-service framework for hosting benchmarks and conducting holistic model comparison, integrated with the Dynabench platform.

Benchmarking

Paper
Add Code

Hindsight: Posterior-guided training of retrievers for improved open-ended generation

no code implementations • ICLR 2022 • Ashwin Paranjape, Omar Khattab, Christopher Potts, Matei Zaharia, Christopher D. Manning

Many text generation systems benefit from using a retriever to retrieve passages from a textual knowledge corpus (e. g., Wikipedia) which are then provided as additional context to the generator.

Text Generation

Paper
Add Code

Systematicity in GPT-3's Interpretation of Novel English Noun Compounds

no code implementations • 18 Oct 2022 • Siyan Li, Riley Carlson, Christopher Potts

However, this evidence is consistent with GPT3 reasoning only about specific lexical items rather than the more abstract conceptual categories of Levin et al.'s theory.

Language Modelling Large Language Model

Paper
Add Code

Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking

no code implementations • 2 Dec 2022 • Keshav Santhanam, Jon Saad-Falcon, Martin Franz, Omar Khattab, Avirup Sil, Radu Florian, Md Arafat Sultan, Salim Roukos, Matei Zaharia, Christopher Potts

Neural information retrieval (IR) systems have progressed rapidly in recent years, in large part due to the release of publicly available benchmarking tasks.

Benchmarking Information Retrieval +1

Paper
Add Code

Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training

1 code implementation • 19 Dec 2022 • Jing Huang, Zhengxuan Wu, Kyle Mahowald, Christopher Potts

Language tasks involving character-level manipulations (e. g., spelling corrections, arithmetic operations, word games) are challenging for models operating on subword units.

Spelling Correction

Paper
Code

Detecting Contradictory COVID-19 Drug Efficacy Claims from Biomedical Literature

no code implementations • 19 Dec 2022 • Daniel N. Sosa, Malavika Suresh, Christopher Potts, Russ B. Altman

Our task is to automatically identify contradictory claims about COVID-19 drug efficacy.

Natural Language Inference

Paper
Add Code

Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations

no code implementations • 5 Mar 2023 • Atticus Geiger, Zhengxuan Wu, Christopher Potts, Thomas Icard, Noah D. Goodman

In DAS, we find the alignment between high-level and low-level models using gradient descent rather than conducting a brute-force search, and we allow individual neurons to play multiple distinct roles by analyzing representations in non-standard bases-distributed representations.

Explainable artificial intelligence

Paper
Add Code

Rigorously Assessing Natural Language Explanations of Neurons

no code implementations • 19 Sep 2023 • Jing Huang, Atticus Geiger, Karel D'Oosterlinck, Zhengxuan Wu, Christopher Potts

Natural language is an appealing medium for explaining how large language models process and store information, but evaluating the faithfulness of such explanations is challenging.

Paper
Add Code

Flexible Model Interpretability through Natural Language Model Editing

no code implementations • 17 Nov 2023 • Karel D'Oosterlinck, Thomas Demeester, Chris Develder, Christopher Potts

Model interpretability and model editing are crucial goals in the age of large language models.

Language Modelling Model Editing

Paper
Add Code

Multi-teacher Distillation for Multilingual Spelling Correction

no code implementations • 20 Nov 2023 • Jingfen Zhang, Xuan Guo, Sravan Bodapati, Christopher Potts

Accurate spelling correction is a critical step in modern search interfaces, especially in an era of mobile devices and speech-to-text interfaces.

Multilingual NLP Spelling Correction

Paper
Add Code

Mapping the Increasing Use of LLMs in Scientific Papers

no code implementations • 1 Apr 2024 • Weixin Liang, Yaohui Zhang, Zhengxuan Wu, Haley Lepp, Wenlong Ji, Xuandong Zhao, Hancheng Cao, Sheng Liu, Siyu He, Zhi Huang, Diyi Yang, Christopher Potts, Christopher D Manning, James Y. Zou

To address this gap, we conduct the first systematic, large-scale analysis across 950, 965 papers published between January 2020 and February 2024 on the arXiv, bioRxiv, and Nature portfolio journals, using a population-level statistical framework to measure the prevalence of LLM-modified content over time.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.