Search Results for author: Amir Zeldes

Found 35 papers, 8 papers with code

The Making of Coptic Wordnet

no code implementations GWC 2019 Laura Slaughter, Luis Morgado Da Costa, So Miyagawa, Marco Büchler, Amir Zeldes, Heike Behlmer

With the increasing availability of wordnets for ancient languages, such as Ancient Greek and Latin, gaps remain in the coverage of less studied languages of antiquity.

The DISRPT 2021 Shared Task on Elementary Discourse Unit Segmentation, Connective Detection, and Relation Classification

no code implementations EMNLP (DISRPT) 2021 Amir Zeldes, Yang Janet Liu, Mikel Iruskieta, Philippe Muller, Chloé Braud, Sonia Badene

In 2021, we organized the second iteration of a shared task dedicated to the underlying units used in discourse parsing across formalisms: the DISRPT Shared Task (Discourse Relation Parsing and Treebanking).

connective detection Relation Classification

A Balanced and Broadly Targeted Computational Linguistics Curriculum

no code implementations NAACL (TeachingNLP) 2021 Emma Manning, Nathan Schneider, Amir Zeldes

This paper describes the primarily-graduate computational linguistics and NLP curriculum at Georgetown University, a U. S. university that has seen significant growth in these areas in recent years.

ELQA: A Corpus of Questions and Answers about the English Language

1 code implementation1 May 2022 Shabnam Behzad, Keisuke Sakaguchi, Nathan Schneider, Amir Zeldes

We introduce a community-sourced dataset for English Language Question Answering (ELQA), which consists of more than 180k questions and answers on numerous topics about English language such as grammar, meaning, fluency, and etymology.

Answer Generation Question Answering

Can we Fix the Scope for Coreference? Problems and Solutions for Benchmarks beyond OntoNotes

no code implementations17 Dec 2021 Amir Zeldes

Current work on automatic coreference resolution has focused on the OntoNotes benchmark dataset, due to both its size and consistency.

Coreference Resolution

WikiGUM: Exhaustive Entity Linking for Wikification in 12 Genres

no code implementations EMNLP (LAW, DMR) 2021 Jessica Lin, Amir Zeldes

Previous work on Entity Linking has focused on resources targeting non-nested proper named entity mentions, often in data from Wikipedia, i. e. Wikification.

Entity Linking

Mischievous Nominal Constructions in Universal Dependencies

no code implementations UDW (SyntaxFest) 2021 Nathan Schneider, Amir Zeldes

While the highly multilingual Universal Dependencies (UD) project provides extensive guidelines for clausal structure as well as structure within canonical nominal phrases, a standard treatment is lacking for many "mischievous" nominal phenomena that break the mold.

Treebanking User-Generated Content: a UD Based Overview of Guidelines, Corpora and Unified Recommendations

no code implementations3 Nov 2020 Manuela Sanguinetti, Lauren Cassidy, Cristina Bosco, Özlem Çetinoğlu, Alessandra Teresa Cignarella, Teresa Lynn, Ines Rehbein, Josef Ruppenhofer, Djamé Seddah, Amir Zeldes

This article presents a discussion on the main linguistic phenomena which cause difficulties in the analysis of user-generated texts found on the web and in social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework of syntactic analysis.

Exhaustive Entity Recognition for Coptic: Challenges and Solutions

no code implementations COLING (LaTeCHCLfL, CLFL, LaTeCH) 2020 Amir Zeldes, Lance Martin, Sichang Tu

Entity recognition provides semantic access to ancient materials in the Digital Humanities: itexposes people and places of interest in texts that cannot be read exhaustively, facilitates linkingresources and can provide a window into text contents, even for texts with no translations.

Entity Linking NER

AMALGUM -- A Free, Balanced, Multilayer English Web Corpus

1 code implementation LREC 2020 Luke Gessler, Siyao Peng, Yang Liu, YIlun Zhu, Shabnam Behzad, Amir Zeldes

We present a freely available, genre-balanced English web corpus totaling 4M tokens and featuring a large number of high-quality automatic annotation layers, including dependency trees, non-named entity annotations, coreference resolution, and discourse trees in Rhetorical Structure Theory.

Coreference Resolution Natural Language Processing

Treebanking User-Generated Content: A Proposal for a Unified Representation in Universal Dependencies

no code implementations LREC 2020 Manuela Sanguinetti, Cristina Bosco, Lauren Cassidy, {\"O}zlem {\c{C}}etino{\u{g}}lu, Aless Cignarella, ra Teresa, Teresa Lynn, Ines Rehbein, Josef Ruppenhofer, Djam{\'e} Seddah, Amir Zeldes

The paper presents a discussion on the main linguistic phenomena of user-generated texts found in web and social media, and proposes a set of annotation guidelines for their treatment within the Universal Dependencies (UD) framework.

A Cross-Genre Ensemble Approach to Robust Reddit Part of Speech Tagging

1 code implementation LREC 2020 Shabnam Behzad, Amir Zeldes

However, when these models are applied to other corpora with different genres, and especially user-generated data from the Web, we see substantial drops in performance.

Part-Of-Speech Tagging

A Neural Approach to Discourse Relation Signal Detection

no code implementations8 Jan 2020 Amir Zeldes, Yang Liu

Previous data-driven work investigating the types and distributions of discourse relation signals, including discourse markers such as 'however' or phrases such as 'as a result' has focused on the relative frequencies of signal words within and outside text from each discourse relation.

Relation Classification

A Collaborative Ecosystem for Digital Coptic Studies

no code implementations11 Dec 2019 Caroline T. Schroeder, Amir Zeldes

Scholarship on underresourced languages bring with them a variety of challenges which make access to the full spectrum of source materials and their evaluation difficult.

Natural Language Processing

All Roads Lead to UD: Converting Stanford and Penn Parses to English Universal Dependencies with Multilayer Annotations

no code implementations COLING 2018 Siyao Peng, Amir Zeldes

We describe and evaluate different approaches to the conversion of gold standard corpus data from Stanford Typed Dependencies (SD) and Penn-style constituent trees to the latest English Universal Dependencies representation (UD 2. 2).

Coreference Resolution NER

A Discourse Signal Annotation System for RST Trees

1 code implementation WS 2019 Luke Gessler, Yang Liu, Amir Zeldes

This paper presents a new system for open-ended discourse relation signal annotation in the framework of Rhetorical Structure Theory (RST), implemented on top of an online tool for RST annotation.

The DISRPT 2019 Shared Task on Elementary Discourse Unit Segmentation and Connective Detection

no code implementations WS 2019 Amir Zeldes, Debopam Das, Erick Galani Maziero, Juliano Antonio, Mikel Iruskieta

In 2019, we organized the first iteration of a shared task dedicated to the underlying units used in discourse parsing across formalisms: the DISRPT Shared Task on Elementary Discourse Unit Segmentation and Connective Detection.

connective detection

The Coptic Universal Dependency Treebank

no code implementations WS 2018 Amir Zeldes, Mitchell Abrams

This paper presents the Coptic Universal Dependency Treebank, the first dependency treebank within the Egyptian subfamily of the Afro-Asiatic languages.

A Characterwise Windowed Approach to Hebrew Morphological Segmentation

1 code implementation WS 2018 Amir Zeldes

This paper presents a novel approach to the segmentation of orthographic word forms in contemporary Hebrew, focusing purely on splitting without carrying out morphological analysis or disambiguation.

General Classification Morphological Analysis

A Linked Coptic Dictionary Online

no code implementations COLING 2018 Frank Feder, Maxim Kupreyev, Emma Manning, Caroline T. Schroeder, Amir Zeldes

We describe a new project publishing a freely available online dictionary for Coptic.

A Predictive Model for Notional Anaphora in English

no code implementations WS 2018 Amir Zeldes

Notional anaphors are pronouns which disagree with their antecedents' grammatical categories for notional reasons, such as plural to singular agreement in: 'the government ... they'.

Coreference Resolution Referring Expression +1

A Deeper Look into Dependency-Based Word Embeddings

no code implementations NAACL 2018 Sean MacAvaney, Amir Zeldes

We investigate the effect of various dependency-based word embeddings on distinguishing between functional and domain similarity, word similarity rankings, and two downstream tasks in English.

Word Embeddings Word Similarity

Serialising the ISO SynAF Syntactic Object Model

no code implementations2 Aug 2011 Laurent Romary, Amir Zeldes, Florian Zipser

This paper introduces, an XML format developed to serialise the object model defined by the ISO Syntactic Annotation Framework SynAF.

Cannot find the paper you are looking for? You can Submit a new open access paper.