Search Results for author: Rotem Dror

Found 15 papers, 9 papers with code

RESIN-11: Schema-guided Event Prediction for 11 Newsworthy Scenarios

1 code implementation • NAACL (ACL) 2022 • Xinya Du, Zixuan Zhang, Sha Li, Pengfei Yu, Hongwei Wang, Tuan Lai, Xudong Lin, Ziqi Wang, Iris Liu, Ben Zhou, Haoyang Wen, Manling Li, Darryl Hannan, Jie Lei, Hyounghun Kim, Rotem Dror, Haoyu Wang, Michael Regan, Qi Zeng, Qing Lyu, Charles Yu, Carl Edwards, Xiaomeng Jin, Yizhu Jiao, Ghazaleh Kazeminejad, Zhenhailong Wang, Chris Callison-Burch, Mohit Bansal, Carl Vondrick, Jiawei Han, Dan Roth, Shih-Fu Chang, Martha Palmer, Heng Ji

We introduce RESIN-11, a new schema-guided event extraction&prediction framework that can be applied to a large variety of newsworthy scenarios.

Event Extraction

Paper
Code

State of What Art? A Call for Multi-Prompt LLM Evaluation

1 code implementation • 31 Dec 2023 • Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, Gabriel Stanovsky

Recent advances in large language models (LLMs) have led to the development of various evaluation benchmarks.

Paper
Code

DMLR: Data-centric Machine Learning Research -- Past, Present and Future

no code implementations • 21 Nov 2023 • Luis Oala, Manil Maskey, Lilith Bat-Leah, Alicia Parrish, Nezihe Merve Gürel, Tzu-Sheng Kuo, Yang Liu, Rotem Dror, Danilo Brajovic, Xiaozhe Yao, Max Bartolo, William A Gaviria Rojas, Ryan Hileman, Rainier Aliment, Michael W. Mahoney, Meg Risdal, Matthew Lease, Wojciech Samek, Debojyoti Dutta, Curtis G Northcutt, Cody Coleman, Braden Hancock, Bernard Koch, Girmaw Abebe Tadesse, Bojan Karlaš, Ahmed Alaa, Adji Bousso Dieng, Natasha Noy, Vijay Janapa Reddi, James Zou, Praveen Paritosh, Mihaela van der Schaar, Kurt Bollacker, Lora Aroyo, Ce Zhang, Joaquin Vanschoren, Isabelle Guyon, Peter Mattson

Drawing from discussions at the inaugural DMLR workshop at ICML 2023 and meetings prior, in this report we outline the relevance of community engagement and infrastructure development for the creation of next-generation public datasets that will advance machine learning science.

Paper
Add Code

The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics

1 code implementation • 30 Oct 2023 • Christoph Leiter, Juri Opitz, Daniel Deutsch, Yang Gao, Rotem Dror, Steffen Eger

Specifically, we propose a novel competition setting in which we select a list of allowed LLMs and disallow fine-tuning to ensure a focus on prompting.

Machine Translation Text Generation

Paper
Code

Human-in-the-Loop Schema Induction

no code implementations • 25 Feb 2023 • Tianyi Zhang, Isaac Tham, Zhaoyi Hou, Jiaxuan Ren, Liyang Zhou, Hainiu Xu, Li Zhang, Lara J. Martin, Rotem Dror, Sha Li, Heng Ji, Martha Palmer, Susan Brown, Reece Suchocki, Chris Callison-Burch

Schema induction builds a graph representation explaining how events unfold in a scenario.

Information Retrieval Retrieval

Paper
Add Code

On the Limitations of Reference-Free Evaluations of Generated Text

no code implementations • 22 Oct 2022 • Daniel Deutsch, Rotem Dror, Dan Roth

There is significant interest in developing evaluation metrics which accurately estimate the quality of generated text without the aid of a human-written reference text, which can be time consuming and expensive to collect or entirely unavailable in online applications.

Machine Translation

Paper
Add Code

Zero-Shot On-the-Fly Event Schema Induction

no code implementations • 12 Oct 2022 • Rotem Dror, Haoyu Wang, Dan Roth

The answers to these questions can be found by collecting many documents on the complex event of interest, extracting relevant information, and analyzing it.

Paper
Add Code

Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics

no code implementations • NAACL 2022 • Daniel Deutsch, Rotem Dror, Dan Roth

How reliably an automatic summarization evaluation metric replicates human judgments of summary quality is quantified by system-level correlations.

Paper
Add Code

A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods

1 code implementation • 31 Mar 2021 • Daniel Deutsch, Rotem Dror, Dan Roth

After evaluating which of the proposed methods is most appropriate for summarization through two simulation experiments, we analyze the results of applying these methods to several different automatic evaluation metrics across three sets of human annotations.

Paper
Code

The Structured Weighted Violations MIRA

1 code implementation • 9 May 2020 • Dor Ringel, Rotem Dror, Roi Reichart

We present the Structured Weighted Violation MIRA (SWVM), a new structured prediction algorithm that is based on an hybridization between MIRA (Crammer and Singer, 2003) and the structured weighted violations perceptron (SWVP) (Dror and Reichart, 2016).

Chunking named-entity-recognition +3

Paper
Code

Deep Dominance - How to Properly Compare Deep Neural Models

1 code implementation • ACL 2019 • Rotem Dror, Segev Shlomov, Roi Reichart

Comparing between Deep Neural Network (DNN) models based on their performance on unseen data is crucial for the progress of the NLP field.

Paper
Code

Appendix - Recommended Statistical Significance Tests for NLP Tasks

1 code implementation • 5 Sep 2018 • Rotem Dror, Roi Reichart

Statistical significance testing plays an important role when drawing conclusions from experimental results in NLP papers.

valid

223

Paper
Code

The Hitchhiker's Guide to Testing Statistical Significance in Natural Language Processing

1 code implementation • ACL 2018 • Rotem Dror, Gili Baumer, Segev Shlomov, Roi Reichart

We establish the fundamental concepts of significance testing and discuss the specific aspects of NLP tasks, experimental setups and evaluation measures that affect the choice of significance tests in NLP research.

223

Paper
Code

Replicability Analysis for Natural Language Processing: Testing Significance with Multiple Datasets

1 code implementation • TACL 2017 • Rotem Dror, Gili Baumer, Marina Bogomolov, Roi Reichart

With the ever-growing amounts of textual data from a large variety of languages, domains, and genres, it has become standard to evaluate NLP algorithms on multiple datasets in order to ensure consistent performance across heterogeneous setups.

Dependency Parsing General Classification +5

Paper
Code

The Structured Weighted Violations Perceptron Algorithm

no code implementations • EMNLP 2016 • Rotem Dror, Roi Reichart

We present the Structured Weighted Violations Perceptron (SWVP) algorithm, a new structured prediction algorithm that generalizes the Collins Structured Perceptron (CSP).

Dependency Parsing Generalization Bounds +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.