no code implementations • WMT (EMNLP) 2020 • Haijiang Wu, Zixuan Wang, Qingsong Ma, Xinjie Wen, Ruichen Wang, Xiaoli Wang, Yulin Zhang, Zhipeng Yao, Siyao Peng
This paper presents Tencent’s submission to the WMT20 Quality Estimation (QE) Shared Task: Sentence-Level Post-editing Effort for English-Chinese in Task 2.
1 code implementation • 20 Sep 2023 • YIlun Zhu, Siyao Peng, Sameer Pradhan, Amir Zeldes
Previous attempts to incorporate a mention detection step into end-to-end neural coreference resolution for English have been hampered by the lack of singleton mention span data as well as other entity information.
Ranked #1 on
Coreference Resolution
on OntoGUM
no code implementations • 3 Jun 2023 • Tatsuya Aoyama, Shabnam Behzad, Luke Gessler, Lauren Levine, Jessica Lin, Yang Janet Liu, Siyao Peng, YIlun Zhu, Amir Zeldes
We evaluate state-of-the-art NLP systems on GENTLE and find severe degradation for at least some genres in their performance on all tasks, which indicates GENTLE's utility as an evaluation dataset for NLP systems.
1 code implementation • 19 Oct 2022 • Siyao Peng, Yang Janet Liu, Amir Zeldes
A lack of large-scale human-annotated data has hampered the hierarchical discourse parsing of Chinese.
no code implementations • 11 Oct 2022 • Siyao Peng, Yang Janet Liu, Amir Zeldes
This document provides extensive guidelines and examples for Rhetorical Structure Theory (RST) annotation in Mandarin Chinese.
1 code implementation • COLING (LAW) 2020 • Michael Kranzlein, Emma Manning, Siyao Peng, Shira Wein, Aryaman Arora, Bradford Salen, Nathan Schneider
We present the Prepositions Annotated with Supersense Tags in Reddit International English ("PASTRIE") corpus, a new dataset containing manually annotated preposition supersenses of English data from presumed speakers of four L1s: English, French, German, and Spanish.
1 code implementation • EMNLP (DISRPT) 2021 • Luke Gessler, Shabnam Behzad, Yang Janet Liu, Siyao Peng, YIlun Zhu, Amir Zeldes
This paper describes our submission to the DISRPT2021 Shared Task on Discourse Unit Segmentation, Connective Detection, and Relation Classification.
1 code implementation • LREC 2020 • Luke Gessler, Siyao Peng, Yang Liu, YIlun Zhu, Shabnam Behzad, Amir Zeldes
We present a freely available, genre-balanced English web corpus totaling 4M tokens and featuring a large number of high-quality automatic annotation layers, including dependency trees, non-named entity annotations, coreference resolution, and discourse trees in Rhetorical Structure Theory.
no code implementations • LREC 2020 • Siyao Peng, Yang Liu, YIlun Zhu, Austin Blodgett, Yushi Zhao, Nathan Schneider
Adpositions are frequent markers of semantic relations, but they are highly ambiguous and vary significantly from language to language.
no code implementations • 2 Sep 2019 • Yue Yu, Siyao Peng, Grace Hui Yang
Previous work on DA recognition either assumes one DA per utterance or fails to realize the sequential nature of dialogues.
no code implementations • COLING 2018 • Siyao Peng, Amir Zeldes
We describe and evaluate different approaches to the conversion of gold standard corpus data from Stanford Typed Dependencies (SD) and Penn-style constituent trees to the latest English Universal Dependencies representation (UD 2. 2).
1 code implementation • WS 2019 • Yue Yu, YIlun Zhu, Yang Liu, Yan Liu, Siyao Peng, Mackenzie Gong, Amir Zeldes
In this paper we present GumDrop, Georgetown University's entry at the DISRPT 2019 Shared Task on automatic discourse unit segmentation and connective detection.
no code implementations • 6 Dec 2018 • YIlun Zhu, Yang Liu, Siyao Peng, Austin Blodgett, Yushi Zhao, Nathan Schneider
This study adapts Semantic Network of Adposition and Case Supersenses (SNACS) annotation to Mandarin Chinese and demonstrates that the same supersense categories are appropriate for Chinese adposition semantics.