Search Results for author: Gabriel Stanovsky

Found 60 papers, 34 papers with code

Leveraging Collection-Wide Similarities for Unsupervised Document Structure Extraction

no code implementations21 Feb 2024 Gili Lior, Yoav Goldberg, Gabriel Stanovsky

Document collections of various domains, e. g., legal, medical, or financial, often share some underlying collection-wide structure, which captures information that can aid both human users and structure-aware models.

K-QA: A Real-World Medical Q&A Benchmark

1 code implementation25 Jan 2024 Itay Manes, Naama Ronn, David Cohen, Ran Ilan Ber, Zehavi Horowitz-Kugler, Gabriel Stanovsky

Ensuring the accuracy of responses provided by large language models (LLMs) is crucial, particularly in clinical settings where incorrect information may directly impact patient health.

Hallucination In-Context Learning +1

State of What Art? A Call for Multi-Prompt LLM Evaluation

1 code implementation31 Dec 2023 Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, Gabriel Stanovsky

Recent advances in large language models (LLMs) have led to the development of various evaluation benchmarks.

Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias

1 code implementation1 Aug 2023 Itay Itzhak, Gabriel Stanovsky, Nir Rosenfeld, Yonatan Belinkov

Recent studies show that instruction tuning (IT) and reinforcement learning from human feedback (RLHF) improve the abilities of large language models (LMs) dramatically.

Decision Making

Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents

1 code implementation1 Jun 2023 Catherine Chen, Zejiang Shen, Dan Klein, Gabriel Stanovsky, Doug Downey, Kyle Lo

Recent work has shown that infusing layout features into language models (LMs) improves processing of visually-rich documents such as scientific papers.

Schema-Driven Information Extraction from Heterogeneous Tables

1 code implementation23 May 2023 Fan Bai, Junmo Kang, Gabriel Stanovsky, Dayne Freitag, Alan Ritter

We use this collection of annotated tables to evaluate the ability of open-source and API-based language models to extract information from tables covering diverse domains and data formats.

Attribute Extraction Instruction Following +1

The Perfect Victim: Computational Analysis of Judicial Attitudes towards Victims of Sexual Violence

no code implementations9 May 2023 Eliya Habba, Renana Keydar, Dan Bareket, Gabriel Stanovsky

Second, we curate a manually annotated dataset for judicial assessments of victim's credibility in the Hebrew language, as well as a model that can extract credibility labels from court cases.

Evaluating and Improving the Coreference Capabilities of Machine Translation Models

no code implementations16 Feb 2023 Asaf Yehudai, Arie Cattan, Omri Abend, Gabriel Stanovsky

Machine translation (MT) requires a wide range of linguistic capabilities, which current end-to-end models are expected to learn implicitly by observing aligned sentences in bilingual corpora.

coreference-resolution Machine Translation +1

A Large-Scale Multilingual Study of Visual Constraints on Linguistic Selection of Descriptions

no code implementations9 Feb 2023 Uri Berger, Lea Frermann, Gabriel Stanovsky, Omri Abend

We study the relation between visual input and linguistic choices by training classifiers to predict the probability of expressing a property from raw images, and find evidence supporting the claim that linguistic properties are constrained by visual context across languages.

Text Generation

VASR: Visual Analogies of Situation Recognition

1 code implementation8 Dec 2022 Yonatan Bitton, Ron Yosef, Eli Strugo, Dafna Shahaf, Roy Schwartz, Gabriel Stanovsky

We leverage situation recognition annotations and the CLIP model to generate a large set of 500k candidate analogies.

Common Sense Reasoning Visual Analogies +1

"Covid vaccine is against Covid but Oxford vaccine is made at Oxford!" Semantic Interpretation of Proper Noun Compounds

1 code implementation24 Oct 2022 Keshav Kolluru, Gabriel Stanovsky, Mausam

Proper noun compounds, e. g., "Covid vaccine", convey information in a succinct manner (a "Covid vaccine" is a "vaccine that immunizes against the Covid disease").

Proper Noun

You Can Have Your Data and Balance It Too: Towards Balanced and Efficient Multilingual Models

no code implementations13 Oct 2022 Tomasz Limisiewicz, Dan Malkin, Gabriel Stanovsky

Our method outperforms standard training methods in low-resource languages and retrains performance on high-resource languages while using the same amount of data.

Cross-Lingual Transfer Knowledge Distillation

WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models

1 code implementation25 Jul 2022 Yonatan Bitton, Nitzan Bitton Guetta, Ron Yosef, Yuval Elovici, Mohit Bansal, Gabriel Stanovsky, Roy Schwartz

While vision-and-language models perform well on tasks such as visual question answering, they struggle when it comes to basic human commonsense reasoning skills.

Common Sense Reasoning General Knowledge +4

A Computational Acquisition Model for Multimodal Word Categorization

1 code implementation NAACL 2022 Uri Berger, Gabriel Stanovsky, Omri Abend, Lea Frermann

Recent advances in self-supervised modeling of text and images open new opportunities for computational models of child language acquisition, which is believed to rely heavily on cross-modal signals.

Language Acquisition Object Recognition

On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations

no code implementations Findings (NAACL) 2022 Roy Schwartz, Gabriel Stanovsky

Recent work has shown that deep learning models in NLP are highly sensitive to low-level correlations between simple features and specific output labels, leading to overfitting and lack of generalization.

Common Sense Reasoning World Knowledge

Filling the Gaps in Ancient Akkadian Texts: A Masked Language Modelling Approach

1 code implementation EMNLP 2021 Koren Lazar, Benny Saret, Asaf Yehudai, Wayne Horowitz, Nathan Wasserman, Gabriel Stanovsky

We present models which complete missing text given transliterations of ancient Mesopotamian documents, originally written on cuneiform clay tablets (2500 BCE - 100 CE).

Language Modelling

Data Efficient Masked Language Modeling for Vision and Language

1 code implementation Findings (EMNLP) 2021 Yonatan Bitton, Gabriel Stanovsky, Michael Elhadad, Roy Schwartz

We investigate a range of alternative masking strategies specific to the cross-modal setting that address these shortcomings, aiming for better fusion of text and image in the learned representation.

Language Modelling Masked Language Modeling +1

Realistic Evaluation Principles for Cross-document Coreference Resolution

1 code implementation Joint Conference on Lexical and Computational Semantics 2021 Arie Cattan, Alon Eirew, Gabriel Stanovsky, Mandar Joshi, Ido Dagan

We point out that common evaluation practices for cross-document coreference resolution have been unrealistically permissive in their assumed settings, yielding inflated results.

coreference-resolution Cross Document Coreference Resolution

Cross-document Coreference Resolution over Predicted Mentions

1 code implementation Findings (ACL) 2021 Arie Cattan, Alon Eirew, Gabriel Stanovsky, Mandar Joshi, Ido Dagan

Here, we introduce the first end-to-end model for CD coreference resolution from raw text, which extends the prominent model for within-document coreference to the CD setting.

coreference-resolution Cross Document Coreference Resolution

Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA

2 code implementations NAACL 2021 Yonatan Bitton, Gabriel Stanovsky, Roy Schwartz, Michael Elhadad

Recent works have shown that supervised models often exploit data artifacts to achieve good test scores while their performance severely degrades on samples outside their training distribution.

Question Answering Relational Reasoning +1

Process-Level Representation of Scientific Protocols with Interactive Annotation

2 code implementations EACL 2021 Ronen Tamari, Fan Bai, Alan Ritter, Gabriel Stanovsky

We develop Process Execution Graphs (PEG), a document-level representation of real-world wet lab biochemistry protocols, addressing challenges such as cross-sentence relations, long-range coreference, grounding, and implicit arguments.

Relation Extraction Sentence

Gender Coreference and Bias Evaluation at WMT 2020

1 code implementation WMT (EMNLP) 2020 Tom Kocmi, Tomasz Limisiewicz, Gabriel Stanovsky

Our work presents the largest evidence for the phenomenon in more than 19 systems submitted to the WMT over four diverse target languages: Czech, German, Polish, and Russian.

Machine Translation Translation

MOCHA: A Dataset for Training and Evaluating Generative Reading Comprehension Metrics

1 code implementation EMNLP 2020 Anthony Chen, Gabriel Stanovsky, Sameer Singh, Matt Gardner

Posing reading comprehension as a generation problem provides a great deal of flexibility, allowing for open-ended questions with few restrictions on possible answers.

Question Answering Reading Comprehension

Streamlining Cross-Document Coreference Resolution: Evaluation and Modeling

2 code implementations23 Sep 2020 Arie Cattan, Alon Eirew, Gabriel Stanovsky, Mandar Joshi, Ido Dagan

Recent evaluation protocols for Cross-document (CD) coreference resolution have often been inconsistent or lenient, leading to incomparable results across works and overestimation of performance.

coreference-resolution Cross Document Coreference Resolution +2

Active Learning for Coreference Resolution using Discrete Annotation

1 code implementation ACL 2020 Belinda Z. Li, Gabriel Stanovsky, Luke Zettlemoyer

We improve upon pairwise annotation for active learning in coreference resolution, by asking annotators to identify mention antecedents if a presented mention pair is deemed not coreferent.

Active Learning Clustering +1

The Right Tool for the Job: Matching Model and Instance Complexities

1 code implementation ACL 2020 Roy Schwartz, Gabriel Stanovsky, Swabha Swayamdipta, Jesse Dodge, Noah A. Smith

Our method presents a favorable speed/accuracy tradeoff in almost all cases, producing models which are up to five times faster than the state of the art, while preserving their accuracy.

Natural Language Inference text-classification +1

Ecological Semantics: Programming Environments for Situated Language Understanding

no code implementations10 Mar 2020 Ronen Tamari, Gabriel Stanovsky, Dafna Shahaf, Reut Tsarfaty

Large-scale natural language understanding (NLU) systems have made impressive progress: they can be applied flexibly across a variety of tasks, and employ minimal structural assumptions.

Common Sense Reasoning Grounded language learning +1

Y'all should read this! Identifying Plurality in Second-Person Personal Pronouns in English Texts

no code implementations WS 2019 Gabriel Stanovsky, Ronen Tamari

Distinguishing between singular and plural {``}you{''} in English is a challenging task which has potential for downstream applications, such as machine translation or coreference resolution.

coreference-resolution Machine Translation +1

Evaluating Question Answering Evaluation

no code implementations WS 2019 Anthony Chen, Gabriel Stanovsky, Sameer Singh, Matt Gardner

Our study suggests that while current metrics may be suitable for existing QA datasets, they limit the complexity of QA datasets that can be created.

Answer Generation Multiple-choice +1

Yall should read this! Identifying Plurality in Second-Person Personal Pronouns in English Texts

1 code implementation26 Oct 2019 Gabriel Stanovsky, Ronen Tamari

Distinguishing between singular and plural "you" in English is a challenging task which has potential for downstream applications, such as machine translation or coreference resolution.

coreference-resolution Machine Translation +1

Evaluating Gender Bias in Machine Translation

1 code implementation ACL 2019 Gabriel Stanovsky, Noah A. Smith, Luke Zettlemoyer

We present the first challenge set and evaluation protocol for the analysis of gender bias in machine translation (MT).

coreference-resolution Machine Translation +2

Semantics as a Foreign Language

no code implementations EMNLP 2018 Gabriel Stanovsky, Ido Dagan

We propose a novel approach to semantic dependency parsing (SDP) by casting the task as an instance of multi-lingual machine translation, where each semantic representation is a different foreign dialect.

Dependency Parsing Machine Translation +3

Crowdsourcing Question-Answer Meaning Representations

1 code implementation NAACL 2018 Julian Michael, Gabriel Stanovsky, Luheng He, Ido Dagan, Luke Zettlemoyer

We introduce Question-Answer Meaning Representations (QAMRs), which represent the predicate-argument structure of a sentence as a set of question-answer pairs.

Sentence

Integrating Deep Linguistic Features in Factuality Prediction over Unified Datasets

no code implementations ACL 2017 Gabriel Stanovsky, Judith Eckle-Kohler, Yevgeniy Puzikov, Ido Dagan, Iryna Gurevych

Previous models for the assessment of commitment towards a predicate in a sentence (also known as factuality prediction) were trained and tested against a specific annotated dataset, subsequently limiting the generality of their results.

Knowledge Base Population Question Answering +1

A Consolidated Open Knowledge Representation for Multiple Texts

1 code implementation WS 2017 Rachel Wities, Vered Shwartz, Gabriel Stanovsky, Meni Adler, Ori Shapira, Shyam Upadhyay, Dan Roth, Eugenio Martinez Camara, Iryna Gurevych, Ido Dagan

We propose to move from Open Information Extraction (OIE) ahead to Open Knowledge Representation (OKR), aiming to represent information conveyed jointly in a set of texts in an open text-based manner.

Lexical Entailment Open Information Extraction

Recognizing Mentions of Adverse Drug Reaction in Social Media Using Knowledge-Infused Recurrent Models

no code implementations EACL 2017 Gabriel Stanovsky, Daniel Gruhl, Pablo Mendes

Recognizing mentions of Adverse Drug Reactions (ADR) in social media is challenging: ADR mentions are context-dependent and include long, varied and unconventional descriptions as compared to more formal medical symptom terminology.

Active Learning Knowledge Graph Embeddings

Modeling Extractive Sentence Intersection via Subtree Entailment

no code implementations COLING 2016 Omer Levy, Ido Dagan, Gabriel Stanovsky, Judith Eckle-Kohler, Iryna Gurevych

Sentence intersection captures the semantic overlap of two texts, generalizing over paradigms such as textual entailment and semantic text similarity.

Abstractive Text Summarization Natural Language Inference +2

Getting More Out Of Syntax with PropS

no code implementations4 Mar 2016 Gabriel Stanovsky, Jessica Ficler, Ido Dagan, Yoav Goldberg

Semantic NLP applications often rely on dependency trees to recognize major elements of the proposition structure of sentences.

Open Information Extraction

Cannot find the paper you are looking for? You can Submit a new open access paper.