1 code implementation • NAACL (SIGTYP) 2021 • Hemanth Kandula, Bonan Min
Sentiment analysis has come a long way for high-resource languages due to the availability of large annotated corpora.
no code implementations • NAACL (HCINLP) 2022 • Mihai Surdeanu, John Hungerford, Yee Seng Chan, Jessica MacBride, Benjamin Gyori, Andrew Zupon, Zheng Tang, Haoling Qiu, Bonan Min, Yan Zverev, Caitlin Hilverman, Max Thomas, Walter Andrews, Keith Alcock, Zeyu Zhang, Michael Reynolds, Steven Bethard, Rebecca Sharp, Egoitz Laparra
An existing domain taxonomy for normalizing content is often assumed when discussing approaches to information extraction, yet often in real-world scenarios there is none. When one does exist, as the information needs shift, it must be continually extended.
1 code implementation • EMNLP (NLP-COVID19) 2020 • Panagiotis Lymperopoulos, Haoling Qiu, Bonan Min
In this paper, we address the problem of concept wikification for COVID-19, which is to automatically recognize mentions of concepts related to COVID-19 in text and resolve them into Wikipedia titles.
1 code implementation • CRAC (ACL) 2021 • Bonan Min
In this paper, we develop bilingual transfer learning approaches to improve Arabic coreference resolution by leveraging additional English annotation via bilingual or multilingual pre-trained transformers.
no code implementations • EMNLP 2021 • Minh Van Nguyen, Tuan Ngo Nguyen, Bonan Min, Thien Huu Nguyen
To address this issue, we propose a novel crosslingual alignment method that leverages class information of REE tasks for representation learning.
no code implementations • EMNLP 2020 • Wenlin Yao, Zeyu Dai, Maitreyi Ramaswamy, Bonan Min, Ruihong Huang
We first obtain the initial set of event pairs that are likely to have the subevent relation, by exploiting two observations that 1) subevents are temporally contained by the parent event, and 2) the definitions of the parent event can be used to further guide the identification of subevents.
no code implementations • EMNLP 2020 • Jiarui Yao, Haoling Qiu, Bonan Min, Nianwen Xue
We present the construction of a corpus of 500 Wikinews articles annotated with temporal dependency graphs (TDGs) that can be used to train systems to understand temporal relations in text.
no code implementations • EMNLP 2021 • Amir Pouran Ben Veyseh, Minh Van Nguyen, Nghia Ngo Trung, Bonan Min, Thien Huu Nguyen
To address this issue, we propose a novel method to model document-level context for ED that dynamically selects relevant sentences in the document for the event prediction of the target sentence.
no code implementations • Findings (ACL) 2022 • Amir Pouran Ben Veyseh, Minh Van Nguyen, Franck Dernoncourt, Bonan Min, Thien Nguyen
Event Argument Extraction (EAE) is one of the sub-tasks of event extraction, aiming to recognize the role of each entity mention toward a specific event trigger.
no code implementations • NAACL 2022 • Minh Van Nguyen, Bonan Min, Franck Dernoncourt, Thien Nguyen
However, previous JointIE models often assume heuristic manually-designed dependency between the task instances and mean-field factorization for the joint distribution of instance labels, thus unable to capture optimal dependencies among instances and labels to improve representation learning and IE performance.
1 code implementation • NAACL 2022 • Jiarui Yao, Nianwen Xue, Bonan Min
The task of modal dependency parsing aims to parse a text into its modal dependency structure, which is a representation for the factuality of events in the text.
no code implementations • 16 Oct 2024 • Siyi Liu, Qiang Ning, Kishaloy Halder, Wei Xiao, Zheng Qi, Phu Mon Htut, Yi Zhang, Neha Anna John, Bonan Min, Yassine Benajiba, Dan Roth
Open domain question answering systems frequently rely on information retrieved from large collections of text (such as the Web) to answer questions.
no code implementations • 7 Sep 2024 • Hemanth Kandula, Damianos Karakos, Haoling Qiu, Benjamin Rozonoyer, Ian Soboroff, Lee Tarlin, Bonan Min
We present a novel, interactive system called $\textit{QueryBuilder}$, which allows a novice, English-speaking user to create queries with a small amount of effort, through efficient exploration of an English development corpus in order to rapidly develop cross-lingual information retrieval queries corresponding to the user's information needs.
Cross-Lingual Information Retrieval Efficient Exploration +1
1 code implementation • 31 Jul 2024 • Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, Zhiheng Huang
Surprisingly, we find that less is more, as training ReSet with high-quality, yet substantially smaller data (three-fold less) yields superior results.
1 code implementation • 19 Jul 2024 • Rujun Han, Yuhao Zhang, Peng Qi, Yumo Xu, Jenyuan Wang, Lan Liu, William Yang Wang, Bonan Min, Vittorio Castelli
Question answering based on retrieval augmented generation (RAG-QA) is an important research topic in NLP and has a wide range of real-world applications.
no code implementations • 24 Apr 2024 • Jiaqing Yuan, Lin Pan, Chung-Wei Hang, Jiang Guo, Jiarong Jiang, Bonan Min, Patrick Ng, Zhiguo Wang
By further decoupling model known and unknown knowledge, we find the degradation is attributed to exemplars that contradict a model's known knowledge, as well as the number of such exemplars.
no code implementations • 10 Mar 2024 • Fei Wang, Chao Shang, Sarthak Jain, Shuai Wang, Qiang Ning, Bonan Min, Vittorio Castelli, Yassine Benajiba, Dan Roth
We investigate common constraints in NLP tasks, categorize them into three classes based on the types of their arguments, and propose a unified framework, ACT (Aligning to ConsTraints), to automatically produce supervision signals for user alignment with constraints.
no code implementations • 25 Oct 2023 • Yoshinari Fujinuma, Siddharth Varia, Nishant Sankaran, Srikar Appalaraju, Bonan Min, Yogarshi Vyas
Document image classification is different from plain-text document classification and consists of classifying a document by understanding the content and structure of documents such as forms, emails, and other such documents.
no code implementations • 10 Aug 2023 • Alexander Hanbo Li, Mingyue Shang, Evangelia Spiliopoulou, Jie Ma, Patrick Ng, Zhiguo Wang, Bonan Min, William Wang, Kathleen McKeown, Vittorio Castelli, Dan Roth, Bing Xiang
We present a novel approach for structured data-to-text generation that addresses the limitations of existing methods that primarily focus on specific types of structured data.
1 code implementation • Findings (NAACL) 2022 • Oscar Sainz, Itziar Gonzalez-Dios, Oier Lopez de Lacalle, Bonan Min, Eneko Agirre
In this work we show that entailment is also effective in Event Argument Extraction (EAE), reducing the need of manual annotation to 50% and 20% in ACE and WikiEvents respectively, while achieving the same performance as with full training.
Ranked #1 on Event Argument Extraction on WikiEvents
2 code implementations • NAACL (ACL) 2022 • Oscar Sainz, Haoling Qiu, Oier Lopez de Lacalle, Eneko Agirre, Bonan Min
The current workflow for Information Extraction (IE) analysts involves the definition of the entities/relations of interest and a training corpus with annotated examples.
1 code implementation • NAACL (ACL) 2022 • Minh Van Nguyen, Nghia Trung Ngo, Bonan Min, Thien Huu Nguyen
FAMIE is designed to address a fundamental problem in existing AL frameworks where annotators need to wait for a long time between annotation batches due to the time-consuming nature of model training and data selection at each AL iteration.
no code implementations • 1 Nov 2021 • Bonan Min, Hayley Ross, Elior Sulem, Amir Pouran Ben Veyseh, Thien Huu Nguyen, Oscar Sainz, Eneko Agirre, Ilana Heinz, Dan Roth
Large, pre-trained transformer-based language models such as BERT have drastically changed the Natural Language Processing (NLP) field.
1 code implementation • ACL 2021 • Jiarui Yao, Haoling Qiu, Jin Zhao, Bonan Min, Nianwen Xue
In this paper, we frame factuality assessment as a modal dependency parsing task that identifies the events and their sources, formally known as conceivers, and then determine the level of certainty that the sources are asserting with respect to the events.
no code implementations • EMNLP (ACL) 2021 • Bonan Min, Benjamin Rozonoyer, Haoling Qiu, Alexander Zamanian, Jessica MacBride
Timely responses from policy makers to mitigate the impact of the COVID-19 pandemic rely on a comprehensive grasp of events, their causes, and their impacts.
no code implementations • LREC 2020 • Bonan Min, Yee Seng Chan, Lingjun Zhao
Previous approaches treat event extraction as {``}one size fits all{''} with an ontology defined a priori.
1 code implementation • EMNLP 2020 • Hayley Ross, Jonathon Cai, Bonan Min
Extracting temporal relations between events and time expressions has many applications such as constructing event timelines and time-related question answering.
no code implementations • IJCNLP 2019 • Bonan Min, Xiaoxi Zhao
Socio-economic conditions are difficult to measure.
no code implementations • IJCNLP 2019 • Bonan Min, Yee Seng Chan, Haoling Qiu, Joshua Fasching
Solving long-lasting problems such as food insecurity requires a comprehensive understanding of interventions applied by governments and international humanitarian assistance organizations, and their results and consequences.
no code implementations • WS 2018 • Lisheng Fu, Bonan Min, Thien Huu Nguyen, Ralph Grishman
Typical relation extraction models are trained on a single corpus annotated with a pre-defined relation schema.
no code implementations • ACL 2019 • Yee Seng Chan, Joshua Fasching, Haoling Qiu, Bonan Min
We present a system for rapidly customizing event extraction capability to find new event types and their arguments.
no code implementations • IJCNLP 2017 • Bonan Min, Zhuolin Jiang, Marjorie Freedman, Ralph Weischedel
The learnt representation is discriminative and transferable between languages.
no code implementations • IJCNLP 2017 • Lisheng Fu, Thien Huu Nguyen, Bonan Min, Ralph Grishman
Our method is a joint model consisting of a CNN-based relation classifier and a domain-adversarial classifier.
no code implementations • EACL 2017 • Bonan Min, Marjorie Freedman, Talya Meltzer
Building knowledge bases (KB) automatically from text corpora is crucial for many applications such as question answering and web search.
no code implementations • LREC 2012 • Bonan Min, Ralph Grishman
The Knowledge Based Population (KBP) evaluation track of the Text Analysis Conferences (TAC) has been held for the past 3 years.