Search Results for author: Emma Strubell

Found 45 papers, 19 papers with code

Training for Fast Sequential Prediction Using Dynamic Feature Selection

no code implementations • 30 Oct 2014 • Emma Strubell, Luke Vilnis, Andrew McCallum

We present paired learning and inference algorithms for significantly reducing computation and increasing speed of the vector dot products in the classifiers that are at the heart of many NLP components.

feature selection Part-Of-Speech Tagging

Paper
Add Code

Learning Dynamic Feature Selection for Fast Sequential Prediction

no code implementations • IJCNLP 2015 • Emma Strubell, Luke Vilnis, Kate Silverstein, Andrew McCallum

Benchmarking feature selection +7

Paper
Add Code

Multilingual Relation Extraction using Compositional Universal Schema

1 code implementation • NAACL 2016 • Patrick Verga, David Belanger, Emma Strubell, Benjamin Roth, Andrew McCallum

In response, this paper introduces significant further improvements to the coverage and flexibility of universal schema relation extraction: predictions for entities unseen in training and multilingual transfer learning to domains with no annotation.

Relation Relation Extraction +4

Paper
Code

Fast and Accurate Entity Recognition with Iterated Dilated Convolutions

4 code implementations • EMNLP 2017 • Emma Strubell, Patrick Verga, David Belanger, Andrew McCallum

Today when many practitioners run basic NLP on the entire web and large-volume traffic, faster methods are paramount to saving time and energy costs.

Ranked #24 on Named Entity Recognition (NER) on Ontonotes v5 (English)

Computational Efficiency NER +1

244

Paper
Code

Dependency Parsing with Dilated Iterated Graph CNNs

no code implementations • WS 2017 • Emma Strubell, Andrew McCallum

Dependency parses are an effective way to inject linguistic knowledge into many downstream tasks, and many practitioners wish to efficiently parse sentences at scale.

Dependency Parsing Sentence

Paper
Add Code

Attending to All Mention Pairs for Full Abstract Biological Relation Extraction

no code implementations • 23 Oct 2017 • Patrick Verga, Emma Strubell, Ofer Shai, Andrew McCallum

We propose a model to consider all mention and entity pairs simultaneously in order to make a prediction.

Relation Relation Extraction +1

Paper
Add Code

Automatically Extracting Action Graphs from Materials Science Synthesis Procedures

no code implementations • 18 Nov 2017 • Sheshera Mysore, Edward Kim, Emma Strubell, Ao Liu, Haw-Shiuan Chang, Srikrishna Kompella, Kevin Huang, Andrew McCallum, Elsa Olivetti

In this work, we present a system for automatically extracting structured representations of synthesis procedures from the texts of materials science journal articles that describe explicit, experimental syntheses of inorganic compounds.

Paper
Add Code

Simultaneously Self-Attending to All Mentions for Full-Abstract Biological Relation Extraction

1 code implementation • NAACL 2018 • Patrick Verga, Emma Strubell, Andrew McCallum

Most work in relation extraction forms a prediction by looking at a short span of text within a single sentence containing a single entity pair mention.

Relation Relation Extraction +1

127

Paper
Code

Linguistically-Informed Self-Attention for Semantic Role Labeling

1 code implementation • EMNLP 2018 • Emma Strubell, Patrick Verga, Daniel Andor, David Weiss, Andrew McCallum

Unlike previous models which require significant pre-processing to prepare linguistic features, LISA can incorporate syntax using merely raw tokens as input, encoding the sequence only once to simultaneously perform parsing, predicate detection and role labeling for all predicates.

Ranked #1 on Semantic Role Labeling (predicted predicates) on CoNLL 2005

Dependency Parsing Multi-Task Learning +4

202

Paper
Code

Syntax Helps ELMo Understand Semantics: Is Syntax Still Relevant in a Deep Neural Architecture for SRL?

no code implementations • WS 2018 • Emma Strubell, Andrew McCallum

Do unsupervised methods for learning rich, contextualized token representations obviate the need for explicit modeling of linguistic structure in neural network models for semantic role labeling (SRL)?

Semantic Role Labeling Word Embeddings

Paper
Add Code

Inorganic Materials Synthesis Planning with Literature-Trained Neural Networks

1 code implementation • 31 Dec 2018 • Edward Kim, Zach Jensen, Alexander van Grootel, Kevin Huang, Matthew Staib, Sheshera Mysore, Haw-Shiuan Chang, Emma Strubell, Andrew McCallum, Stefanie Jegelka, Elsa Olivetti

Leveraging new data sources is a key step in accelerating the pace of materials design and discovery.

named-entity-recognition Named Entity Recognition +2

Paper
Code

The Materials Science Procedural Text Corpus: Annotating Materials Synthesis Procedures with Shallow Semantic Structures

no code implementations • WS 2019 • Sheshera Mysore, Zach Jensen, Edward Kim, Kevin Huang, Haw-Shiuan Chang, Emma Strubell, Jeffrey Flanigan, Andrew McCallum, Elsa Olivetti

Materials science literature contains millions of materials synthesis procedures described in unstructured natural language text.

Paper
Add Code

Energy and Policy Considerations for Deep Learning in NLP

3 code implementations • ACL 2019 • Emma Strubell, Ananya Ganesh, Andrew McCallum

Recent progress in hardware and methodology for training neural networks has ushered in a new generation of large networks trained on abundant data.

2,093

Paper
Code

End-to-end Quantized Training via Log-Barrier Extensions

no code implementations • 1 Jan 2021 • Juncheng B Li, Shuhui Qu, Xinjian Li, Emma Strubell, Florian Metze

Quantization of neural network parameters and activations has emerged as a successful approach to reducing the model size and inference time on hardware that sup-ports native low-precision arithmetic.

Quantization

Paper
Add Code

Unsupervised Domain Adaptation Via Pseudo-labels And Objectness Constraints

no code implementations • 29 Sep 2021 • Rajshekhar Das, Jonathan Francis, Sanket Vaibhav Mehta, Jean Oh, Emma Strubell, Jose Moura

Crucially, the objectness constraint is agnostic to the ground-truth semantic segmentation labels and, therefore, remains appropriate for unsupervised adaptation settings.

Object Pseudo Label +4

Paper
Add Code

Improving Compositional Generalization with Self-Training for Data-to-Text Generation

1 code implementation • ACL 2022 • Sanket Vaibhav Mehta, Jinfeng Rao, Yi Tay, Mihir Kale, Ankur P. Parikh, Emma Strubell

Data-to-text generation focuses on generating fluent natural language responses from structured meaning representations (MRs).

Data-to-Text Generation

32,745

Paper
Code

An Empirical Investigation of the Role of Pre-training in Lifelong Learning

1 code implementation • NeurIPS 2023 • Sanket Vaibhav Mehta, Darshan Patil, Sarath Chandar, Emma Strubell

The lifelong learning paradigm in machine learning is an attractive alternative to the more prominent isolated learning scheme not only due to its resemblance to biological learning but also its potential to reduce energy waste by obviating excessive model re-training.

Continual Learning Image Classification

Paper
Code

Train Flat, Then Compress: Sharpness-Aware Minimization Learns More Compressible Models

no code implementations • 25 May 2022 • Clara Na, Sanket Vaibhav Mehta, Emma Strubell

Model compression by way of parameter pruning, quantization, or distillation has recently gained popularity as an approach for reducing the computational requirements of modern deep neural network models for NLP.

Model Compression Quantization +3

Paper
Add Code

Measuring the Carbon Intensity of AI in Cloud Instances

no code implementations • 10 Jun 2022 • Jesse Dodge, Taylor Prewitt, Remi Tachet des Combes, Erika Odmark, Roy Schwartz, Emma Strubell, Alexandra Sasha Luccioni, Noah A. Smith, Nicole DeCario, Will Buchanan

By providing unprecedented access to computational resources, cloud computing has enabled rapid growth in technologies such as machine learning, the computational demands of which incur a high energy cost and a commensurate carbon footprint.

Cloud Computing Language Modelling

Paper
Add Code

Efficient Methods for Natural Language Processing: A Survey

no code implementations • 31 Aug 2022 • Marcos Treviso, Ji-Ung Lee, Tianchu Ji, Betty van Aken, Qingqing Cao, Manuel R. Ciosici, Michael Hassid, Kenneth Heafield, Sara Hooker, Colin Raffel, Pedro H. Martins, André F. T. Martins, Jessica Zosa Forde, Peter Milder, Edwin Simpson, Noam Slonim, Jesse Dodge, Emma Strubell, Niranjan Balasubramanian, Leon Derczynski, Iryna Gurevych, Roy Schwartz

Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows.

Information Retrieval Open-Domain Question Answering

Paper
Add Code

SQuAT: Sharpness- and Quantization-Aware Training for BERT

no code implementations • 13 Oct 2022 • Zheng Wang, Juncheng B Li, Shuhui Qu, Florian Metze, Emma Strubell

Quantization is an effective technique to reduce memory footprint, inference latency, and power consumption of deep learning models.

Quantization

Paper
Add Code

Mention Annotations Alone Enable Efficient Domain Adaptation for Coreference Resolution

1 code implementation • 14 Oct 2022 • Nupoor Gandhi, Anjalie Field, Emma Strubell

Although recent neural models for coreference resolution have led to substantial improvements on benchmark datasets, transferring these models to new target domains containing out-of-vocabulary spans and requiring differing annotation schemes remains challenging.

coreference-resolution Domain Adaptation

Paper
Code

A Survey of Active Learning for Natural Language Processing

1 code implementation • 18 Oct 2022 • Zhisong Zhang, Emma Strubell, Eduard Hovy

In this work, we provide a survey of active learning (AL) for its applications in natural language processing (NLP).

Active Learning Structured Prediction

Paper
Code

Bridging Fairness and Environmental Sustainability in Natural Language Processing

no code implementations • 8 Nov 2022 • Marius Hessenthaler, Emma Strubell, Dirk Hovy, Anne Lauscher

Fairness and environmental impact are important research directions for the sustainable development of artificial intelligence.

Dimensionality Reduction Fairness +4

Paper
Add Code

Error-aware Quantization through Noise Tempering

no code implementations • 11 Dec 2022 • Zheng Wang, Juncheng B Li, Shuhui Qu, Florian Metze, Emma Strubell

In this work, we incorporate exponentially decaying quantization-error-aware noise together with a learnable scale of task loss gradient to approximate the effect of a quantization operator.

Model Compression Quantization

Paper
Add Code

DSI++: Updating Transformer Memory with New Documents

no code implementations • 19 Dec 2022 • Sanket Vaibhav Mehta, Jai Gupta, Yi Tay, Mostafa Dehghani, Vinh Q. Tran, Jinfeng Rao, Marc Najork, Emma Strubell, Donald Metzler

In this work, we introduce DSI++, a continual learning challenge for DSI to incrementally index new documents while being able to answer queries related to both previously and newly indexed documents.

Continual Learning Natural Questions +1

Paper
Add Code

To Adapt or to Annotate: Challenges and Interventions for Domain Adaptation in Open-Domain Question Answering

no code implementations • 20 Dec 2022 • Dheeru Dua, Emma Strubell, Sameer Singh, Pat Verga

Recent advances in open-domain question answering (ODQA) have demonstrated impressive accuracy on standard Wikipedia style benchmarks.

Domain Adaptation Open-Domain Question Answering +1

Paper
Add Code

The Framework Tax: Disparities Between Inference Efficiency in NLP Research and Deployment

1 code implementation • 13 Feb 2023 • Jared Fernandez, Jacob Kahn, Clara Na, Yonatan Bisk, Emma Strubell

In this work, we examine this phenomenon through a series of case studies analyzing the effects of model design decisions, framework paradigms, and hardware platforms on total model latency.

Computational Efficiency

Paper
Code

Regularizing Self-training for Unsupervised Domain Adaptation via Structural Constraints

no code implementations • 29 Apr 2023 • Rajshekhar Das, Jonathan Francis, Sanket Vaibhav Mehta, Jean Oh, Emma Strubell, Jose Moura

Self-training based on pseudo-labels has emerged as a dominant approach for addressing conditional distribution shifts in unsupervised domain adaptation (UDA) for semantic segmentation problems.

Object Semantic Segmentation +1

Paper
Add Code

Data-efficient Active Learning for Structured Prediction with Partial Annotation and Self-Training

1 code implementation • 22 May 2023 • Zhisong Zhang, Emma Strubell, Eduard Hovy

To address this challenge, we adopt an error estimator to adaptively decide the partial selection ratio according to the current model's capability.

Active Learning Structured Prediction

Paper
Code

How To Train Your (Compressed) Large Language Model

no code implementations • 24 May 2023 • Ananya Harsh Jha, Tom Sherborne, Evan Pete Walsh, Dirk Groeneveld, Emma Strubell, Iz Beltagy

With the increase in the size of large language models (LLMs), we need compression methods that can reduce the model size while preserving the generality and zero-shot promptability of the model.

Knowledge Distillation Language Modelling +1

Paper
Add Code

Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research

no code implementations • 29 Jun 2023 • Ji-Ung Lee, Haritz Puerto, Betty van Aken, Yuki Arase, Jessica Zosa Forde, Leon Derczynski, Andreas Rücklé, Iryna Gurevych, Roy Schwartz, Emma Strubell, Jesse Dodge

Many recent improvements in NLP stem from the development and use of large pre-trained language models (PLMs) with billions of parameters.

Paper
Add Code

Queer People are People First: Deconstructing Sexual Identity Stereotypes in Large Language Models

no code implementations • 30 Jun 2023 • Harnoor Dhingra, Preetiha Jayashanker, Sayali Moghe, Emma Strubell

Large Language Models (LLMs) are trained primarily on minimally processed web text, which exhibits the same wide range of social biases held by the humans who created that content.

Sentence

Paper
Add Code

Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation

no code implementations • 19 Jul 2023 • Hao Peng, Qingqing Cao, Jesse Dodge, Matthew E. Peters, Jared Fernandez, Tom Sherborne, Kyle Lo, Sam Skjonsberg, Emma Strubell, Darrell Plessas, Iz Beltagy, Evan Pete Walsh, Noah A. Smith, Hannaneh Hajishirzi

In response, we introduce Pentathlon, a benchmark for holistic and realistic evaluation of model efficiency.

Paper
Add Code

To Build Our Future, We Must Know Our Past: Contextualizing Paradigm Shifts in Natural Language Processing

no code implementations • 11 Oct 2023 • Sireesh Gururaja, Amanda Bertsch, Clara Na, David Gray Widder, Emma Strubell

NLP is in a period of disruptive change that is impacting our methodologies, funding sources, and public perception.

Paper
Add Code

Energy and Carbon Considerations of Fine-Tuning BERT

no code implementations • 17 Nov 2023 • Xiaorong Wang, Clara Na, Emma Strubell, Sorelle Friedler, Sasha Luccioni

Despite the popularity of the `pre-train then fine-tune' paradigm in the NLP community, existing work quantifying energy costs and associated carbon emissions has largely focused on language model pre-training.

Language Modelling

Paper
Add Code

Power Hungry Processing: Watts Driving the Cost of AI Deployment?

no code implementations • 28 Nov 2023 • Alexandra Sasha Luccioni, Yacine Jernite, Emma Strubell

Recent years have seen a surge in the popularity of commercial AI products based on generative, multi-purpose AI systems promising a unified approach to building machine learning (ML) models into technology.

Paper
Add Code

Understanding the Effect of Model Compression on Social Bias in Large Language Models

1 code implementation • 9 Dec 2023 • Gustavo Gonçalves, Emma Strubell

Large Language Models (LLMs) trained with self-supervision on vast corpora of web text fit to the social biases of that text.

Knowledge Distillation Model Compression +1

Paper
Code

AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters

1 code implementation • 12 Jan 2024 • Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge

Large language models' (LLMs) abilities are drawn from their pretraining data, and model development begins with data curation.

Language Identification

Paper
Code

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

1 code implementation • 31 Jan 2024 • Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo

Language models have become a critical technology to tackling a wide range of natural language processing tasks, yet many details about how the best-performing language models were developed are not reported.

Language Modelling

755

Paper
Code

OLMo: Accelerating the Science of Language Models

2 code implementations • 1 Feb 2024 • Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi

Given the importance of these details in scientifically studying these models, including their biases and potential risks, we believe it is essential for the research community to have access to powerful, truly open LMs.

Language Modelling

3,919

Paper
Code

Source-Aware Training Enables Knowledge Attribution in Language Models

1 code implementation • 1 Apr 2024 • Muhammad Khalifa, David Wadden, Emma Strubell, Honglak Lee, Lu Wang, Iz Beltagy, Hao Peng

We investigate the problem of intrinsic source citation, where LLMs are required to cite the pretraining source supporting a generated response.

Data Augmentation

Paper
Code

Comparing Span Extraction Methods for Semantic Role Labeling

1 code implementation • ACL (spnlp) 2021 • Zhisong Zhang, Emma Strubell, Eduard Hovy

In this work, we empirically compare span extraction methods for the task of semantic role labeling (SRL).

Semantic Role Labeling

Paper
Code

Evaluating Gender Bias Transfer from Film Data

no code implementations • NAACL (GeBNLP) 2022 • Amanda Bertsch, Ashley Oh, Sanika Natu, Swetha Gangu, Alan W. black, Emma Strubell

We extend our analysis to a longitudinal study of bias in film dialogue over the last 110 years and find that continued pre-training on OpenSubtitles encodes additional bias into BERT.

Dialogue Generation Machine Translation +4

Paper
Add Code

On the Benefit of Syntactic Supervision for Cross-lingual Transfer in Semantic Role Labeling

1 code implementation • EMNLP 2021 • Zhisong Zhang, Emma Strubell, Eduard Hovy

Although recent developments in neural architectures and pre-trained representations have greatly increased state-of-the-art model performance on fully-supervised semantic role labeling (SRL), the task remains challenging for languages where supervised SRL training data are not abundant.

Cross-Lingual Transfer Semantic Role Labeling

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.