1 code implementation • 1 Nov 2024 • Yang Janet Liu, Tatsuya Aoyama, Wesley Scivetti, YIlun Zhu, Shabnam Behzad, Lauren Elizabeth Levine, Jessica Lin, Devika Tiwari, Amir Zeldes
Work on shallow discourse parsing in English has focused on the Wall Street Journal corpus, the only large-scale dataset for the language in the PDTB framework.
1 code implementation • 6 Aug 2024 • Heidi C. Zhang, Shabnam Behzad, Kawin Ethayarajh, Dan Jurafsky
Model checklists (Ribeiro et al., 2020) have emerged as a useful tool for understanding the behavior of LLMs, analogous to unit-testing in software engineering.
1 code implementation • 29 Jan 2024 • William Gantt, Shabnam Behzad, Hannah Youngeun An, Yunmo Chen, Aaron Steven White, Benjamin Van Durme, Mahsa Yarmohammadi
We introduce MultiMUC, the first multilingual parallel corpus for template filling, comprising translations of the classic MUC-4 template filling benchmark into five languages: Arabic, Chinese, Farsi, Korean, and Russian.
1 code implementation • 3 Jun 2023 • Tatsuya Aoyama, Shabnam Behzad, Luke Gessler, Lauren Levine, Jessica Lin, Yang Janet Liu, Siyao Peng, YIlun Zhu, Amir Zeldes
We evaluate state-of-the-art NLP systems on GENTLE and find severe degradation for at least some genres in their performance on all tasks, which indicates GENTLE's utility as an evaluation dataset for NLP systems.
no code implementations • 18 Dec 2022 • Shabnam Behzad, Amir Zeldes, Nathan Schneider
In this paper, we present strong baselines for the task of Feedback Comment Generation for Writing Learning.
1 code implementation • 1 May 2022 • Shabnam Behzad, Keisuke Sakaguchi, Nathan Schneider, Amir Zeldes
We present ELQA, a corpus of questions and answers in and about the English language.
1 code implementation • EMNLP (DISRPT) 2021 • Luke Gessler, Shabnam Behzad, Yang Janet Liu, Siyao Peng, YIlun Zhu, Amir Zeldes
This paper describes our submission to the DISRPT2021 Shared Task on Discourse Unit Segmentation, Connective Detection, and Relation Classification.
no code implementations • SEMEVAL 2020 • Michael Kranzlein, Shabnam Behzad, Nazli Goharian
This paper presents our systems for SemEval 2020 Shared Task 11: Detection of Propaganda Techniques in News Articles.
1 code implementation • LREC 2020 • Luke Gessler, Siyao Peng, Yang Liu, YIlun Zhu, Shabnam Behzad, Amir Zeldes
We present a freely available, genre-balanced English web corpus totaling 4M tokens and featuring a large number of high-quality automatic annotation layers, including dependency trees, non-named entity annotations, coreference resolution, and discourse trees in Rhetorical Structure Theory.
1 code implementation • LREC 2020 • Shabnam Behzad, Amir Zeldes
However, when these models are applied to other corpora with different genres, and especially user-generated data from the Web, we see substantial drops in performance.