no code implementations • ACL (CASE) 2021 • Elizabeth Boschee
I will compare them with approaches based on machine translation (as well as with models trained using in-language training data, where available), and discuss their strengths and weaknesses in different contexts, including the amount of English/foreign bitext available and the nature of the target event ontology.
no code implementations • 9 Aug 2024 • Steven Fincke, Elizabeth Boschee
The task of deciding whether two documents are written by the same author is challenging for both machines and humans.
no code implementations • 1 Aug 2024 • Steven Fincke, Adrien Bibal, Elizabeth Boschee
Large Language Models (LLMs) such as GPT-4 have shown enough promise in the few-shot learning context to suggest use in the generation of "silver" data and refinement of new ontologies through iterative application and review.
1 code implementation • 24 Jun 2024 • Sahana Ramnath, Kartik Pandey, Elizabeth Boschee, Xiang Ren
Authorship Verification (AV) (do two documents have the same author?)
no code implementations • 17 May 2023 • Chris Jenkins, Shantanu Agarwal, Joel Barry, Steven Fincke, Elizabeth Boschee
In this paper, we present ISI-Clear, a state-of-the-art, cross-lingual, zero-shot event extraction system and accompanying user interface for event visualization & search.
1 code implementation • 22 Feb 2023 • Shantanu Agarwal, Steven Fincke, Chris Jenkins, Scott Miller, Elizabeth Boschee
Taking the task of cross-lingual event detection as a motivating example, we show that the choice of pooling strategy can have a significant impact on the target language performance.
no code implementations • 25 Sep 2021 • Steven Fincke, Shantanu Agarwal, Scott Miller, Elizabeth Boschee
We show that by enabling the language model to better compensate for the deficits of sparse and noisy training data, our approach improves both trigger and argument detection and classification significantly over the state of the art in a zero-shot cross-lingual setting.
no code implementations • 10 Sep 2021 • Dong-Ho Lee, Ravi Kiran Selvam, Sheikh Muhammad Sarwar, Bill Yuchen Lin, Fred Morstatter, Jay Pujara, Elizabeth Boschee, James Allan, Xiang Ren
Deep neural models for named entity recognition (NER) have shown impressive results in overcoming label scarcity and generalizing to unseen entities by leveraging distant supervision and auxiliary information such as explanations.
Low Resource Named Entity Recognition
named-entity-recognition
+2
2 code implementations • NAACL 2022 • I-Hung Hsu, Kuan-Hao Huang, Elizabeth Boschee, Scott Miller, Prem Natarajan, Kai-Wei Chang, Nanyun Peng
Given a passage and a manually designed prompt, DEGREE learns to summarize the events mentioned in the passage into a natural sentence that follows a predefined pattern.
2 code implementations • Findings of the Association for Computational Linguistics 2020 • Qinyuan Ye, Xiao Huang, Elizabeth Boschee, Xiang Ren
Advances in machine reading comprehension (MRC) rely heavily on the collection of large scale human-annotated examples in the form of (question, paragraph, answer) triples.
no code implementations • LREC 2020 • Joel Barry, Elizabeth Boschee, Marjorie Freedman, Scott Miller
We describe an approach to cross lingual information retrieval that does not rely on explicit translation of either document or query terms.
no code implementations • ACL 2020 • Dong-Ho Lee, Rahul Khanna, Bill Yuchen Lin, Jamin Chen, Seyeon Lee, Qinyuan Ye, Elizabeth Boschee, Leonardo Neves, Xiang Ren
Successfully training a deep neural network demands a huge corpus of labeled data.
1 code implementation • CONLL 2019 • Xiao Huang, Li Dong, Elizabeth Boschee, Nanyun Peng
Named entity recognition (NER) identifies typed entity mentions in raw text.
Ranked #11 on
Named Entity Recognition (NER)
on NCBI-disease
no code implementations • ACL 2019 • Elizabeth Boschee, Joel Barry, Jayadev Billa, Marjorie Freedman, Thamme Gowda, Constantine Lignos, Chester Palen-Michel, Michael Pust, Banriskhem Kayang Khonglah, Srikanth Madikeri, Jonathan May, Scott Miller
In this paper we present an end-to-end cross-lingual information retrieval (CLIR) and summarization system for low-resource languages that 1) enables English speakers to search foreign language repositories of text and audio using English queries, 2) summarizes the retrieved documents in English with respect to a particular information need, and 3) provides complete transcriptions and translations as needed.
no code implementations • CL 2018 • Ralph Weischedel, Elizabeth Boschee
Though information extraction (IE) research has more than a 25-year history, F1 scores remain low.
no code implementations • EMNLP 2016 • Ferhan Ture, Elizabeth Boschee
In multilingual question answering, either the question needs to be translated into the document language, or vice versa.