Search Results for author: Roy Schwartz

Found 58 papers, 26 papers with code

Learnability-Based Syntactic Annotation Design

no code implementations • COLING 2012 • Roy Schwartz, Omri Abend, Ari Rappoport

Paper
Add Code

Authorship Attribution of Micro-Messages

no code implementations • EMNLP 2013 • Roy Schwartz, Oren Tsur, Ari Rappoport, Moshe Koppel

Authorship Attribution

Paper
Add Code

Minimally Supervised Classification to Semantic Categories using Automatically Acquired Symmetric Patterns

no code implementations • COLING 2014 • Roy Schwartz, Roi Reichart, Ari Rappoport

General Classification

Paper
Add Code

Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction

no code implementations • CONLL 2015 • Roy Schwartz, Roi Reichart, Ari Rappoport

Language Modelling Representation Learning +3

Paper
Add Code

How Well Do Distributional Models Capture Different Types of Semantic Knowledge?

no code implementations • IJCNLP 2015 • Dana Rubinstein, Effi Levi, Roy Schwartz, Ari Rappoport

Paper
Add Code

Symmetric Patterns and Coordinations: Fast and Enhanced Representations of Verbs and Adjectives

no code implementations • NAACL 2016 • Roy Schwartz, Roi Reichart, Ari Rappoport

Word Embeddings

Paper
Add Code

Automatic Selection of Context Configurations for Improved Class-Specific Word Representations

no code implementations • CONLL 2017 • Ivan Vulić, Roy Schwartz, Ari Rappoport, Roi Reichart, Anna Korhonen

With our selected context configurations, we train on only 14% (A), 26. 2% (V), and 33. 6% (N) of all dependency-based contexts, resulting in a reduced training time.

Word Similarity

Paper
Add Code

The Effect of Different Writing Tasks on Linguistic Style: A Case Study of the ROC Story Cloze Task

1 code implementation • CONLL 2017 • Roy Schwartz, Maarten Sap, Ioannis Konstas, Li Zilles, Yejin Choi, Noah A. Smith

A writer's style depends not just on personal traits but also on her intent and mental state.

Language Modelling

Paper
Code

Story Cloze Task: UW NLP System

no code implementations • WS 2017 • Roy Schwartz, Maarten Sap, Ioannis Konstas, Leila Zilles, Yejin Choi, Noah A. Smith

This paper describes University of Washington NLP{'}s submission for the Linking Models of Lexical, Sentential and Discourse-level Semantics (LSDSem 2017) shared task{---}the Story Cloze Task.

Language Modelling

Paper
Add Code

Annotation Artifacts in Natural Language Inference Data

no code implementations • NAACL 2018 • Suchin Gururangan, Swabha Swayamdipta, Omer Levy, Roy Schwartz, Samuel R. Bowman, Noah A. Smith

Large-scale datasets for natural language inference are created by presenting crowd workers with a sentence (premise), and asking them to generate three new sentences (hypotheses) that it entails, contradicts, or is logically neutral with respect to.

Natural Language Inference Negation +2

Paper
Add Code

A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications

1 code implementation • NAACL 2018 • Dongyeop Kang, Waleed Ammar, Bhavana Dalvi, Madeleine van Zuylen, Sebastian Kohlmeier, Eduard Hovy, Roy Schwartz

In the first task, we show that simple models can predict whether a paper is accepted with up to 21% error reduction compared to the majority baseline.

377

Paper
Code

SoPa: Bridging CNNs, RNNs, and Weighted Finite-State Machines

2 code implementations • 15 May 2018 • Roy Schwartz, Sam Thomson, Noah A. Smith

Recurrent and convolutional neural networks comprise two distinct families of models that have proven to be useful for encoding natural language utterances.

Explainable artificial intelligence General Classification +3

Paper
Code

LSTMs Exploit Linguistic Attributes of Data

no code implementations • WS 2018 • Nelson F. Liu, Omer Levy, Roy Schwartz, Chenhao Tan, Noah A. Smith

While recurrent neural networks have found success in a variety of natural language processing applications, they are general models of sequential data.

Memorization Open-Ended Question Answering

Paper
Add Code

Bridging CNNs, RNNs, and Weighted Finite-State Machines

no code implementations • ACL 2018 • Roy Schwartz, Sam Thomson, Noah A. Smith

Recurrent and convolutional neural networks comprise two distinct families of models that have proven to be useful for encoding natural language utterances.

General Classification Representation Learning +3

Paper
Add Code

SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference

1 code implementation • EMNLP 2018 • Rowan Zellers, Yonatan Bisk, Roy Schwartz, Yejin Choi

Given a partial description like "she opened the hood of the car," humans can reason about the situation and anticipate what might come next ("then, she examined the engine").

Ranked #4 on Common Sense Reasoning on SWAG

Common Sense Reasoning Multiple-choice +2

Paper
Code

Rational Recurrences

1 code implementation • EMNLP 2018 • Hao Peng, Roy Schwartz, Sam Thomson, Noah A. Smith

We characterize this connection formally, defining rational recurrences to be recurrent hidden state update functions that can be written as the Forward calculation of a finite set of WFSAs.

Language Modelling text-classification +1

Paper
Code

Inoculation by Fine-Tuning: A Method for Analyzing Challenge Datasets

no code implementations • NAACL 2019 • Nelson F. Liu, Roy Schwartz, Noah A. Smith

Several datasets have recently been constructed to expose brittleness in models trained on existing benchmarks.

Paper
Add Code

Green AI

2 code implementations • 22 Jul 2019 • Roy Schwartz, Jesse Dodge, Noah A. Smith, Oren Etzioni

Moreover, the financial cost of the computations can make it difficult for academics, students, and researchers, in particular those from emerging economies, to engage in deep learning research.

4,766

Paper
Code

PaLM: A Hybrid Parser and Language Model

1 code implementation • IJCNLP 2019 • Hao Peng, Roy Schwartz, Noah A. Smith

We present PaLM, a hybrid parser and neural language model.

Language Modelling

Paper
Code

Show Your Work: Improved Reporting of Experimental Results

4 code implementations • IJCNLP 2019 • Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz, Noah A. Smith

Research in natural language processing proceeds, in part, by demonstrating that new models achieve superior performance (e. g., accuracy) on held-out test data, compared to previous results.

2,096

Paper
Code

RNN Architecture Learning with Sparse Regularization

1 code implementation • IJCNLP 2019 • Jesse Dodge, Roy Schwartz, Hao Peng, Noah A. Smith

Our method also highlights the interpretable properties of rational RNNs.

Sentiment Analysis

Paper
Code

Knowledge Enhanced Contextual Word Representations

1 code implementation • IJCNLP 2019 • Matthew E. Peters, Mark Neumann, Robert L. Logan IV, Roy Schwartz, Vidur Joshi, Sameer Singh, Noah A. Smith

Contextual word representations, typically trained on unstructured, unlabeled text, do not contain any explicit grounding to real world entities and are often unable to remember facts about those entities.

Ranked #9 on Relation Classification on TACRED

Entity Linking Entity Typing +3

362

Paper
Code

Fair Correlation Clustering

no code implementations • 10 Feb 2020 • Saba Ahmadi, Sainyam Galhotra, Barna Saha, Roy Schwartz

We consider two variations of fairness constraint for the problem of correlation clustering where each node has a color, and the goal is to form clusters that do not over-represent vertices of any color.

Clustering Fairness

Paper
Add Code

Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping

4 code implementations • 15 Feb 2020 • Jesse Dodge, Gabriel Ilharco, Roy Schwartz, Ali Farhadi, Hannaneh Hajishirzi, Noah Smith

We publicly release all of our experimental data, including training and validation scores for 2, 100 trials, to encourage further analysis of training dynamics during fine-tuning.

2,293

Paper
Code

The Right Tool for the Job: Matching Model and Instance Complexities

1 code implementation • ACL 2020 • Roy Schwartz, Gabriel Stanovsky, Swabha Swayamdipta, Jesse Dodge, Noah A. Smith

Our method presents a favorable speed/accuracy tradeoff in almost all cases, producing models which are up to five times faster than the state of the art, while preserving their accuracy.

Natural Language Inference text-classification +1

Paper
Code

A Formal Hierarchy of RNN Architectures

no code implementations • ACL 2020 • William Merrill, Gail Weiss, Yoav Goldberg, Roy Schwartz, Noah A. Smith, Eran Yahav

While formally extending these findings to unsaturated RNNs is left to future work, we hypothesize that the practical learnable capacity of unsaturated RNNs obeys a similar hierarchy.

Paper
Add Code

A Mixture of $h-1$ Heads is Better than $h$ Heads

no code implementations • 13 May 2020 • Hao Peng, Roy Schwartz, Dianqi Li, Noah A. Smith

Multi-head attentive neural architectures have achieved state-of-the-art results on a variety of natural language processing tasks.

Language Modelling Machine Translation +1

Paper
Add Code

A Mixture of h - 1 Heads is Better than h Heads

no code implementations • ACL 2020 • Hao Peng, Roy Schwartz, Dianqi Li, Noah A. Smith

Multi-head attentive neural architectures have achieved state-of-the-art results on a variety of natural language processing tasks.

Language Modelling Machine Translation +1

Paper
Add Code

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

6 code implementations • EMNLP 2020 • Swabha Swayamdipta, Roy Schwartz, Nicholas Lourie, Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith, Yejin Choi

Experiments across four datasets show that these model-dependent measures reveal three distinct regions in the data map, each with pronounced characteristics.

Model Optimization Out-of-Distribution Generalization

183

Paper
Code

Extracting a Knowledge Base of Mechanisms from COVID-19 Papers

3 code implementations • NAACL 2021 • Tom Hope, Aida Amini, David Wadden, Madeleine van Zuylen, Sravanthi Parasa, Eric Horvitz, Daniel Weld, Roy Schwartz, Hannaneh Hajishirzi

The COVID-19 pandemic has spawned a diverse body of scientific literature that is challenging to navigate, stimulating interest in automated tools to help find useful knowledge.

Navigate

561

Paper
Code

Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent

1 code implementation • EMNLP 2021 • William Merrill, Vivek Ramanujan, Yoav Goldberg, Roy Schwartz, Noah Smith

To better understand this bias, we study the tendency for transformer parameters to grow in magnitude ($\ell_2$ norm) during training, and its implications for the emergent representations within self attention layers.

Inductive Bias

Paper
Code

A Refined Analysis of Submodular Greedy

no code implementations • 25 Feb 2021 • Ariel Kulik, Roy Schwartz, Hadas Shachnai

Many algorithms for maximizing a monotone submodular function subject to a knapsack constraint rely on the natural greedy heuristic.

Data Structures and Algorithms

Paper
Add Code

Random Feature Attention

no code implementations • ICLR 2021 • Hao Peng, Nikolaos Pappas, Dani Yogatama, Roy Schwartz, Noah A. Smith, Lingpeng Kong

RFA can be used as a drop-in replacement for conventional softmax attention and offers a straightforward way of learning with recency bias through an optional gating mechanism.

Ranked #27 on Machine Translation on IWSLT2014 German-English

Language Modelling Machine Translation +3

Paper
Add Code

Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA

2 code implementations • NAACL 2021 • Yonatan Bitton, Gabriel Stanovsky, Roy Schwartz, Michael Elhadad

Recent works have shown that supervised models often exploit data artifacts to achieve good test scores while their performance severely degrades on samples outside their training distribution.

Question Answering Relational Reasoning +1

Paper
Code

Provable Limitations of Acquiring Meaning from Ungrounded Form: What Will Future Language Models Understand?

no code implementations • 22 Apr 2021 • William Merrill, Yoav Goldberg, Roy Schwartz, Noah A. Smith

We study whether assertions enable a system to emulate representations preserving semantic relations like equivalence.

Paper
Add Code

Data Efficient Masked Language Modeling for Vision and Language

1 code implementation • Findings (EMNLP) 2021 • Yonatan Bitton, Gabriel Stanovsky, Michael Elhadad, Roy Schwartz

We investigate a range of alternative masking strategies specific to the cross-modal setting that address these shortcomings, aiming for better fusion of text and image in the learned representation.

Language Modelling Masked Language Modeling +1

Paper
Code

Expected Validation Performance and Estimation of a Random Variable's Maximum

no code implementations • 1 Oct 2021 • Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz, Noah A. Smith

We find that the two biased estimators lead to the fewest incorrect conclusions, which hints at the importance of minimizing variance and MSE.

Paper
Add Code

ABC: Attention with Bounded-memory Control

no code implementations • ACL 2022 • Hao Peng, Jungo Kasai, Nikolaos Pappas, Dani Yogatama, Zhaofeng Wu, Lingpeng Kong, Roy Schwartz, Noah A. Smith

One way to improve the efficiency is to bound the memory size.

Language Modelling Machine Translation

Paper
Add Code

Data Contamination: From Memorization to Exploitation

1 code implementation • ACL 2022 • Inbal Magar, Roy Schwartz

Experiments with two models and three downstream tasks show that exploitation exists in some cases, but in others the models memorize the contaminated data, but do not exploit it.

Memorization

Paper
Code

A deep learning framework for the detection and quantification of drusen and reticular pseudodrusen on optical coherence tomography

no code implementations • 5 Apr 2022 • Roy Schwartz, Hagar Khalid, Sandra Liakopoulos, Yanling Ouyang, Coen de Vente, Cristina González-Gonzalo, Aaron Y. Lee, Robyn Guymer, Emily Y. Chew, Catherine Egan, Zhichao Wu, Himeesh Kumar, Joseph Farrington, Clara I. Sánchez, Adnan Tufail

Methods - A DL framework was developed consisting of a classification model and an out-of-distribution (OOD) detection model for the identification of ungradable scans; a classification model to identify scans with drusen or RPD; and an image segmentation model to independently segment lesions as RPD or drusen.

Classification Image Segmentation +4

Paper
Add Code

TangoBERT: Reducing Inference Cost by using Cascaded Architecture

no code implementations • 13 Apr 2022 • Jonathan Mamou, Oren Pereg, Moshe Wasserblat, Roy Schwartz

In order to reduce this computational load in inference time, we present TangoBERT, a cascaded model architecture in which instances are first processed by an efficient but less accurate first tier model, and only part of those instances are additionally processed by a less efficient but more accurate second tier model.

Reading Comprehension SST-2 +2

Paper
Add Code

On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations

no code implementations • Findings (NAACL) 2022 • Roy Schwartz, Gabriel Stanovsky

Recent work has shown that deep learning models in NLP are highly sensitive to low-level correlations between simple features and specific output labels, leading to overfitting and lack of generalization.

Common Sense Reasoning World Knowledge

Paper
Add Code

Measuring the Carbon Intensity of AI in Cloud Instances

no code implementations • 10 Jun 2022 • Jesse Dodge, Taylor Prewitt, Remi Tachet des Combes, Erika Odmark, Roy Schwartz, Emma Strubell, Alexandra Sasha Luccioni, Noah A. Smith, Nicole DeCario, Will Buchanan

By providing unprecedented access to computational resources, cloud computing has enabled rapid growth in technologies such as machine learning, the computational demands of which incur a high energy cost and a commensurate carbon footprint.

Cloud Computing Language Modelling

Paper
Add Code

Fewer Errors, but More Stereotypes? The Effect of Model Size on Gender Bias

1 code implementation • NAACL (GeBNLP) 2022 • Yarden Tal, Inbal Magar, Roy Schwartz

We find that while larger models outperform smaller ones, the probability that their mistakes are caused by gender bias is higher.

Language Modelling Memorization

Paper
Code

WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models

1 code implementation • 25 Jul 2022 • Yonatan Bitton, Nitzan Bitton Guetta, Ron Yosef, Yuval Elovici, Mohit Bansal, Gabriel Stanovsky, Roy Schwartz

While vision-and-language models perform well on tasks such as visual question answering, they struggle when it comes to basic human commonsense reasoning skills.

Ranked #1 on Common Sense Reasoning on WinoGAViL

Common Sense Reasoning General Knowledge +4

Paper
Code

Efficient Methods for Natural Language Processing: A Survey

no code implementations • 31 Aug 2022 • Marcos Treviso, Ji-Ung Lee, Tianchu Ji, Betty van Aken, Qingqing Cao, Manuel R. Ciosici, Michael Hassid, Kenneth Heafield, Sara Hooker, Colin Raffel, Pedro H. Martins, André F. T. Martins, Jessica Zosa Forde, Peter Milder, Edwin Simpson, Noam Slonim, Jesse Dodge, Emma Strubell, Niranjan Balasubramanian, Leon Derczynski, Iryna Gurevych, Roy Schwartz

Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows.

Information Retrieval Open-Domain Question Answering

Paper
Add Code

How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers

1 code implementation • 7 Nov 2022 • Michael Hassid, Hao Peng, Daniel Rotem, Jungo Kasai, Ivan Montero, Noah A. Smith, Roy Schwartz

Our results motivate research on simpler alternatives to input-dependent attention, as well as on methods for better utilization of this mechanism in the Transformer architecture.

Paper
Code

VASR: Visual Analogies of Situation Recognition

1 code implementation • 8 Dec 2022 • Yonatan Bitton, Ron Yosef, Eli Strugo, Dafna Shahaf, Roy Schwartz, Gabriel Stanovsky

We leverage situation recognition annotations and the CLIP model to generate a large set of 500k candidate analogies.

Ranked #1 on Visual Reasoning on VASR

Common Sense Reasoning Visual Analogies +1

Paper
Code

Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images

no code implementations • ICCV 2023 • Nitzan Bitton-Guetta, Yonatan Bitton, Jack Hessel, Ludwig Schmidt, Yuval Elovici, Gabriel Stanovsky, Roy Schwartz

We introduce WHOOPS!, a new dataset and benchmark for visual commonsense.

Ranked #1 on Image-to-Text Retrieval on WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images (using extra training data)

Common Sense Reasoning Explanation Generation +6

Paper
Add Code

Textually Pretrained Speech Language Models

1 code implementation • NeurIPS 2023 • Michael Hassid, Tal Remez, Tu Anh Nguyen, Itai Gat, Alexis Conneau, Felix Kreuk, Jade Copet, Alexandre Defossez, Gabriel Synnaeve, Emmanuel Dupoux, Roy Schwartz, Yossi Adi

In this work, we propose TWIST, a method for training SpeechLMs using a warm-start from a pretrained textual language models.

Paper
Code

Fighting Bias with Bias: Promoting Model Robustness by Amplifying Dataset Biases

1 code implementation • 30 May 2023 • Yuval Reif, Roy Schwartz

We suggest that in order to drive the development of models robust to subtle biases, dataset biases should be amplified in the training set.

Paper
Code

Finding the SWEET Spot: Analysis and Improvement of Adaptive Inference in Low Resource Settings

no code implementations • 4 Jun 2023 • Daniel Rotem, Michael Hassid, Jonathan Mamou, Roy Schwartz

Adaptive inference is a simple method for reducing inference costs.

Paper
Add Code

Morphosyntactic probing of multilingual BERT models

1 code implementation • 9 Jun 2023 • Judit Acs, Endre Hamerlik, Roy Schwartz, Noah A. Smith, Andras Kornai

We introduce an extensive dataset for multilingual probing of morphological information in language models (247 tasks across 42 languages from 10 families), each consisting of a sentence with a target word and a morphological tag as the desired label, derived from the Universal Dependencies treebanks.

Sentence TAG

Paper
Code

Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research

no code implementations • 29 Jun 2023 • Ji-Ung Lee, Haritz Puerto, Betty van Aken, Yuki Arase, Jessica Zosa Forde, Leon Derczynski, Andreas Rücklé, Iryna Gurevych, Roy Schwartz, Emma Strubell, Jesse Dodge

Many recent improvements in NLP stem from the development and use of large pre-trained language models (PLMs) with billions of parameters.

Paper
Add Code

Read, Look or Listen? What's Needed for Solving a Multimodal Dataset

no code implementations • 6 Jul 2023 • Netta Madvil, Yonatan Bitton, Roy Schwartz

We propose a two-step method to analyze multimodal datasets, which leverages a small seed of human annotation to map each multimodal instance to the modalities required to process it.

Question Answering Speaker Identification +1

Paper
Add Code

Transformers are Multi-State RNNs

1 code implementation • 11 Jan 2024 • Matanel Oren, Michael Hassid, Yossi Adi, Roy Schwartz

We further show that pretrained transformers can be converted into $\textit{finite}$ multi-state RNNs by fixing the size of their hidden state.

103

Paper
Code

The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

no code implementations • 31 Mar 2024 • Michael Hassid, Tal Remez, Jonas Gehring, Roy Schwartz, Yossi Adi

On the other hand, in scenarios where unit-tests are unavailable, a ranking-based selection of candidates from the smaller model falls short of the performance of a single output from larger ones.

Code Generation

Paper
Add Code

Expected Validation Performance and Estimation of a Random Variable’s Maximum

no code implementations • Findings (EMNLP) 2021 • Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz, Noah A. Smith

We find that the two biased estimators lead to the fewest incorrect conclusions, which hints at the importance of minimizing variance and MSE.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.