no code implementations • EMNLP 2020 • Rishi Bommasani, Claire Cardie
High quality data forms the bedrock for building meaningful statistical models in NLP.
no code implementations • FNP (COLING) 2020 • Siyan Zheng, Anneliese Lu, Claire Cardie
This paper describes the SUMSUM systems submitted to the Financial Narrative Summarization Shared Task (FNS-2020).
no code implementations • ICLR 2019 • Ryan Y. Benmalek, Madian Khabsa, Suma Desu, Claire Cardie, Michele Banko
In this paper we introduce the Scratchpad Encoder, a novel addition to the sequence to sequence (seq2seq) framework and explore its effectiveness in generating natural language questions from a given logical form.
no code implementations • LREC 2022 • Jennifer Tracey, Owen Rambow, Claire Cardie, Adam Dalton, Hoa Trang Dang, Mona Diab, Bonnie Dorr, Louise Guthrie, Magdalena Markowska, Smaranda Muresan, Vinodkumar Prabhakaran, Samira Shaikh, Tomek Strzalkowski
We present the BeSt corpus, which records cognitive state: who believes what (i. e., factuality), and who has what sentiment towards what.
no code implementations • 5 Sep 2024 • Yuntian Deng, Wenting Zhao, Jack Hessel, Xiang Ren, Claire Cardie, Yejin Choi
The increasing availability of real-world conversation data offers exciting opportunities for researchers to study user-chatbot interactions.
no code implementations • 24 Jul 2024 • Wenting Zhao, Tanya Goyal, Yu Ying Chiu, Liwei Jiang, Benjamin Newman, Abhilasha Ravichander, Khyathi Chandu, Ronan Le Bras, Claire Cardie, Yuntian Deng, Yejin Choi
While hallucinations of large language models (LLMs) prevail as a major challenge, existing evaluation benchmarks on factuality do not cover the diverse domains of knowledge that the real-world users of LLMs seek information about.
1 code implementation • 24 Jul 2024 • Wenting Zhao, Ge Gao, Claire Cardie, Alexander M. Rush
We curate CouldAsk, an evaluation benchmark composed of existing and new datasets for document-grounded question answering, specifically designed to study reformulating unanswerable questions.
1 code implementation • 7 Jun 2024 • Jinyan Su, John X. Morris, Preslav Nakov, Claire Cardie
Dense retrievers are widely used in information retrieval and have also been successfully extended to other knowledge intensive areas such as language models, e. g., Retrieval-Augmented Generation (RAG) systems.
no code implementations • 2 May 2024 • Wenting Zhao, Xiang Ren, Jack Hessel, Claire Cardie, Yejin Choi, Yuntian Deng
In addition to timestamped chat transcripts, we enrich the dataset with demographic data, including state, country, and hashed IP addresses, alongside request headers.
1 code implementation • 2 Nov 2023 • Jinyan Su, Claire Cardie, Preslav Nakov
With the proliferation of both human-written and machine-generated real and fake news, robustly and effectively discerning the veracity of news articles has become an intricate challenge.
1 code implementation • 23 Oct 2023 • Barry Wang, Xinya Du, Claire Cardie
This work is the first to apply the probing paradigm to representations learned for document-level information extraction (IE).
no code implementations • 6 Oct 2023 • Ge Gao, Jonathan D. Chang, Claire Cardie, Kianté Brantley, Thorsten Joachim
Text retrieval plays a crucial role in incorporating factual knowledge for decision making into language processing pipelines, ranging from chat-based web search to question answering systems.
no code implementations • 24 May 2023 • Wenting Zhao, Justin T. Chiu, Claire Cardie, Alexander M. Rush
Instead of using direct supervision, this work proposes an approach for abductive commonsense reasoning that exploits the fact that only a subset of explanations is correct for a given context.
no code implementations • 23 May 2023 • Wenting Zhao, Justin T. Chiu, Claire Cardie, Alexander M. Rush
Explainable multi-hop question answering (QA) not only predicts answers but also identifies rationales, i. e. subsets of input sentences used to derive the answers.
no code implementations • 3 May 2023 • Mengyun Shi, Claire Cardie, Serge Belongie
Does ads from other domains infer their fashion taste as well?
no code implementations • 3 May 2023 • Mengyun Shi, Serge Belongie, Claire Cardie
Existing fashion datasets do not consider the multi-facts that cause a consumer to like or dislike a fashion image.
1 code implementation • ACL 2022 • Aliva Das, Xinya Du, Barry Wang, Kejian Shi, Jiayuan Gu, Thomas Porter, Claire Cardie
Document-level information extraction (IE) tasks have recently begun to be revisited in earnest using the end-to-end neural network techniques that have been successful on their sentence-level IE counterparts.
1 code implementation • NAACL 2022 • Wenting Zhao, Konstantine Arkoudas, Weiqi Sun, Claire Cardie
Task-oriented parsing (TOP) aims to convert natural language into machine-readable representations of specific tasks, such as setting an alarm.
6 code implementations • 23 Mar 2022 • Menglin Jia, Luming Tang, Bor-Chun Chen, Claire Cardie, Serge Belongie, Bharath Hariharan, Ser-Nam Lim
The current modus operandi in adapting pre-trained models involves updating all the backbone parameters, ie, full fine-tuning.
Ranked #2 on Prompt Engineering on ImageNet-21k
1 code implementation • 15 Dec 2021 • Menglin Jia, Bor-Chun Chen, Zuxuan Wu, Claire Cardie, Serge Belongie, Ser-Nam Lim
In this paper, we investigate $k$-Nearest-Neighbor (k-NN) classifiers, a classical model-free learning method from the pre-deep learning era, as an augmentation to modern neural network based approaches.
no code implementations • 29 Sep 2021 • Ryan Y Benmalek, Sabhya Chhabria, Pedro O. Pinheiro, Claire Cardie, Serge Belongie
These models outperform both previous work and static models under both \emph{static} and \emph{continual} semantic shifts, suggesting that ``learning to adapt'' is a useful capability for models and agents in a changing world.
1 code implementation • Findings (EMNLP) 2021 • Menglin Jia, Austin Reiter, Ser-Nam Lim, Yoav Artzi, Claire Cardie
We introduce Classification with Alternating Normalization (CAN), a non-parametric post-processing step for classification.
2 code implementations • ACL 2022 • Faisal Ladhak, Esin Durmus, He He, Claire Cardie, Kathleen McKeown
Despite recent progress in abstractive summarization, systems still suffer from faithfulness errors.
1 code implementation • NAACL 2021 • Xinya Du, Alexander Rush, Claire Cardie
Template filling is generally tackled by a pipeline of two separate supervised systems {--} one for role-filler extraction and another for template/event recognition.
1 code implementation • ICCV 2021 • Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim
Visual engagement in social media platforms comprises interactions with photo posts including comments, shares, and likes.
no code implementations • Findings (EMNLP) 2021 • Dian Yu, Kai Sun, Dong Yu, Claire Cardie
In spite of much recent research in the area, it is still unclear whether subject-area question-answering data is useful for machine reading comprehension (MRC) tasks.
no code implementations • Asian Chapter of the Association for Computational Linguistics 2020 • Xinya Du, Ahmed Hassan Awadallah, Adam Fourney, Robert Sim, Paul Bennett, Claire Cardie
We show that leveraging metadata information from web pages can improve the performance of models for answer passage selection/reranking.
1 code implementation • CVPR 2021 • Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim
Based on our findings, we conduct further study to quantify the effect of attending to object and context classes as well as textual information in the form of hashtags when training an intent classifier.
no code implementations • Findings of the Association for Computational Linguistics 2020 • Zonglin Yang, Xinya Du, Alexander Rush, Claire Cardie
End-to-end models in NLP rarely encode external world knowledge about length of time.
1 code implementation • NAACL 2021 • Kai Sun, Seungwhan Moon, Paul Crook, Stephen Roller, Becka Silvert, Bing Liu, Zhiguang Wang, Honglei Liu, Eunjoon Cho, Claire Cardie
Existing dialogue corpora and models are typically designed under two disjoint motives: while task-oriented systems focus on achieving functional goals (e. g., booking hotels), open-domain chatbots aim at making socially engaging conversations.
1 code implementation • Findings of the Association for Computational Linguistics 2020 • Faisal Ladhak, Esin Durmus, Claire Cardie, Kathleen McKeown
As a set of baselines for further studies, we evaluate the performance of existing cross-lingual abstractive summarization methods on our dataset.
Abstractive Text Summarization Cross-Lingual Abstractive Summarization +2
no code implementations • EMNLP 2020 • Jialu Li, Esin Durmus, Claire Cardie
Online debate forums provide users a platform to express their opinions on controversial topics while being exposed to opinions from diverse set of viewpoints.
no code implementations • ACL 2022 • Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Claire Cardie
In this paper, we aim to extract commonsense knowledge to improve machine reading comprehension.
2 code implementations • EACL 2021 • Xinya Du, Alexander M. Rush, Claire Cardie
We revisit the classic problem of document-level role-filler entity extraction (REE) for template filling.
no code implementations • ACL 2020 • Rishi Bommasani, Kelly Davis, Claire Cardie
Contextualized representations (e. g. ELMo, BERT) have become the default pretrained representations for downstream NLP applications.
1 code implementation • ACL 2020 • Xinya Du, Claire Cardie
Few works in the literature of event extraction have gone beyond individual sentences to make extraction decisions.
3 code implementations • EMNLP 2020 • Xinya Du, Claire Cardie
The problem of event extraction requires detecting the event trigger and extracting its corresponding arguments.
5 code implementations • ECCV 2020 • Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui, Claire Cardie, Bharath Hariharan, Hartwig Adam, Serge Belongie
In this work we explore the task of instance segmentation with attribute localization, which unifies instance segmentation (detect and segment each object instance) and fine-grained visual attribute categorization (recognize one or multiple attributes).
3 code implementations • ACL 2020 • Dian Yu, Kai Sun, Claire Cardie, Dong Yu
We present the first human-annotated dialogue-based relation extraction (RE) dataset DialogRE, aiming to support the prediction of relation(s) between two arguments that appear in a dialogue.
Ranked #6 on Dialog Relation Extraction on DialogRE (F1c (v1) metric)
no code implementations • IJCNLP 2019 • Esin Durmus, Faisal Ladhak, Claire Cardie
Research in the social sciences and psychology has shown that the persuasiveness of an argument depends not only the language employed, but also on attributes of the source/communicator, the audience, and the appropriateness and strength of the argument's claims given the pragmatic and discourse context of the argument.
no code implementations • WS 2019 • Liane Longpre, Esin Durmus, Claire Cardie
In a study of users of a popular debate platform, we find first that different combinations of linguistic features are critical for predicting persuasion outcomes for UNDECIDED versus DECIDED members of the audience.
no code implementations • ACL 2019 • Ryan Benmalek, Madian Khabsa, Suma Desu, Claire Cardie, Michele Banko
We introduce the Scratchpad Mechanism, a novel addition to the sequence-to-sequence (seq2seq) neural network architecture and demonstrate its effectiveness in improving the overall fluency of seq2seq models for natural language generation tasks.
no code implementations • ACL 2019 • Esin Durmus, Faisal Ladhak, Claire Cardie
Systems for automatic argument generation and debate require the ability to (1) determine the stance of any claims employed in the argument and (2) assess the specificity of each claim relative to the argument context.
no code implementations • ACL 2019 • Esin Durmus, Claire Cardie
Existing argumentation datasets have succeeded in allowing researchers to develop computational methods for analyzing the content, structure and linguistic features of argumentative text.
no code implementations • NAACL 2018 • Esin Durmus, Claire Cardie
Public debate forums provide a common platform for exchanging opinions on a topic of interest.
1 code implementation • NAACL 2019 • Xinya Du, Bhavana Dalvi Mishra, Niket Tandon, Antoine Bosselut, Wen-tau Yih, Peter Clark, Claire Cardie
Our goal is procedural text comprehension, namely tracking how the properties of entities (e. g., their location) change with time given a procedural text (e. g., a paragraph about photosynthesis, a recipe).
1 code implementation • 12 Jun 2019 • Ryan Y. Benmalek, Madian Khabsa, Suma Desu, Claire Cardie, Michele Banko
We introduce the Scratchpad Mechanism, a novel addition to the sequence-to-sequence (seq2seq) neural network architecture and demonstrate its effectiveness in improving the overall fluency of seq2seq models for natural language generation tasks.
no code implementations • WS 2019 • Rishi Bommasani, Arzoo Katiyar, Claire Cardie
We apply our approach to the second-order structured prediction task studied in the 2016/2017 Belief and Sentiment analysis evaluations (BeSt): given a document and its entities, relations, and events (including metadata and mentions), determine the sentiment of each entity towards every relation and event in the document.
1 code implementation • TACL 2020 • Kai Sun, Dian Yu, Dong Yu, Claire Cardie
Machine reading comprehension tasks require a machine reader to answer questions relevant to the given document.
no code implementations • TACL 2019 • Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Yejin Choi, Claire Cardie
We present DREAM, the first dialogue-based multiple-choice reading comprehension data set.
1 code implementation • WS 2019 • Xiaoman Pan, Kai Sun, Dian Yu, Jianshu Chen, Heng Ji, Claire Cardie, Dong Yu
We focus on multiple-choice question answering (QA) tasks in subject areas such as science, where we require both broad background knowledge and the facts from the given subject-area reference corpus.
1 code implementation • 1 Feb 2019 • Kai Sun, Dian Yu, Jianshu Chen, Dong Yu, Yejin Choi, Claire Cardie
DREAM is likely to present significant challenges for existing reading comprehension systems: 84% of answers are non-extractive, 85% of questions require reasoning beyond a single sentence, and 34% of questions also involve commonsense knowledge.
1 code implementation • NAACL 2019 • Kai Sun, Dian Yu, Dong Yu, Claire Cardie
Reading strategies have been shown to improve comprehension levels, especially for readers lacking adequate prior knowledge.
Ranked #7 on Question Answering on StoryCloze
1 code implementation • ACL 2019 • Xilun Chen, Ahmed Hassan Awadallah, Hany Hassan, Wei Wang, Claire Cardie
In this work, we focus on the multilingual transfer setting where training data in multiple source languages is leveraged to further boost target language performance.
Ranked #10 on Cross-Lingual NER on CoNLL Dutch
no code implementations • 27 Sep 2018 • Xilun Chen, Ahmed Hassan Awadallah, Hany Hassan, Wei Wang, Claire Cardie
In this work, we propose a zero-resource multilingual transfer learning model that can utilize training data in multiple source languages, while not requiring target language training data nor cross-lingual supervision.
1 code implementation • EMNLP 2018 • Vlad Niculae, André F. T. Martins, Claire Cardie
Deep NLP models benefit from underlying structures in the data---e. g., parse trees---typically extracted using off-the-shelf parsers.
3 code implementations • EMNLP 2018 • Xilun Chen, Claire Cardie
Multilingual Word Embeddings (MWEs) represent words from multiple languages in a single distributional vector space.
no code implementations • 16 Jun 2018 • Ryan Y. Benmalek, Claire Cardie, Serge Belongie, Xiadong He, Jianfeng Gao
In this work we combine two research threads from Vision/ Graphics and Natural Language Processing to formulate an image generation task conditioned on attributes in a multi-turn setting.
no code implementations • NAACL 2018 • Arzoo Katiyar, Claire Cardie
We propose a novel recurrent neural network-based approach to simultaneously handle nested named entity recognition and nested entity mention detection.
no code implementations • WS 2018 • Esin Durmus, Claire Cardie
We use the gender and stance information to identify significant linguistic differences across individuals with different gender and stance.
1 code implementation • ACL 2018 • Xinya Du, Claire Cardie
We study the task of generating from Wikipedia articles question-answer pairs that cover content beyond a single sentence.
1 code implementation • NAACL 2018 • Xilun Chen, Claire Cardie
Many text classification tasks are known to be highly domain-dependent.
3 code implementations • ICML 2018 • Vlad Niculae, André F. T. Martins, Mathieu Blondel, Claire Cardie
Structured prediction requires searching over a combinatorial number of structures.
no code implementations • EMNLP 2017 • Xinya Du, Claire Cardie
A first step in the task of automatically generating questions for testing reading comprehension is to identify \textit{question-worthy} sentences, i. e. sentences in a text passage that humans find it worthwhile to ask questions about.
no code implementations • ACL 2017 • Arzoo Katiyar, Claire Cardie
We also compare our model with an end-to-end tree-based LSTM model (SPTree) by Miwa and Bansal (2016) and show that our model performs within 1{\%} on entity mentions and 2{\%} on relations.
Ranked #9 on Relation Extraction on ACE 2004
10 code implementations • ACL 2017 • Xinya Du, Junru Shao, Claire Cardie
We study automatic question generation for sentences from text passages in reading comprehension.
1 code implementation • ACL 2017 • Vlad Niculae, Joonsuk Park, Claire Cardie
We propose a novel factor graph model for argument mining, designed for settings in which the argumentative relations in a document do not necessarily form a tree structure.
no code implementations • 25 Jun 2016 • Lu Wang, Claire Cardie
This paper addresses the problem of summarizing decisions in spoken meetings: our goal is to produce a concise {\it decision abstract} for each meeting decision.
no code implementations • ACL 2013 • Lu Wang, Hema Raghavan, Vittorio Castelli, Radu Florian, Claire Cardie
We consider the problem of using sentence compression techniques to facilitate query-focused multi-document summarization.
no code implementations • WS 2012 • Lu Wang, Claire Cardie
We present a token-level decision summarization framework that utilizes the latent topic structures of utterances to identify "summary-worthy" words.
no code implementations • WS 2012 • Lu Wang, Claire Cardie
We present a novel unsupervised framework for focused meeting summarization that views the problem as an instance of relation extraction.
no code implementations • WS 2014 • Lu Wang, Claire Cardie
For example, the isotonic CRF model achieves F1 scores of 0. 74 and 0. 67 for agreement and disagreement detection, when a linear chain CRF obtains 0. 58 and 0. 56 for the discussions on Wikipedia Talk pages.
no code implementations • HLT 2015 • Lu Wang, Claire Cardie, Galen Marchetti
Existing timeline generation systems for complex events consider only information from traditional media, ignoring the rich social context provided by user-generated content that reveals representative public interests or insightful opinions.
no code implementations • COLING 2014 • Lu Wang, Hema Raghavan, Claire Cardie, Vittorio Castelli
We present a submodular function-based framework for query-focused opinion summarization.
no code implementations • ACL 2014 • Lu Wang, Claire Cardie
We investigate the novel task of online dispute detection and propose a sentiment analysis solution to the problem: we aim to identify the sequence of sentence-level sentiments expressed during a discussion and to use them as features in a classifier that predicts the DISPUTE/NON-DISPUTE label for the discussion as a whole.
2 code implementations • TACL 2018 • Xilun Chen, Yu Sun, Ben Athiwaratkun, Claire Cardie, Kilian Weinberger
To tackle the sentiment classification problem in low-resource languages without adequate annotated data, we propose an Adversarial Deep Averaging Network (ADAN) to transfer the knowledge learned from labeled data on a resource-rich source language to low-resource languages where only unlabeled data exists.
no code implementations • SEMEVAL 2015 • Eneko Agirre, Carmen Banea, Claire Cardie, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, Weiwei Guo, I{\~n}igo Lopez-Gazpio, Montse Maritxalar, Rada Mihalcea, German Rigau, Larraitz Uria, Janyce Wiebe
no code implementations • TACL 2015 • Bishan Yang, Claire Cardie, Peter Frazier
We present a novel hierarchical distance-dependent Bayesian model for event coreference resolution.
no code implementations • 20 Dec 2014 • Ozan İrsoy, Claire Cardie
We present the multiplicative recurrent neural network as a general model for compositional meaning in language, and evaluate it on the task of fine-grained sentiment analysis.
no code implementations • NeurIPS 2014 • Ozan .Irsoy, Claire Cardie
In this work we introduce a new architecture --- a deep recursive neural network (deep RNN) --- constructed by stacking multiple recursive layers.
no code implementations • TACL 2014 • Bishan Yang, Claire Cardie
In this paper, we study the problems of opinion expression extraction and expression-level polarity and intensity classification.
no code implementations • 2 Dec 2013 • Ozan İrsoy, Claire Cardie
Recently, deep architectures, such as recurrent and recursive neural networks have been successfully applied to various natural language processing tasks.
no code implementations • 27 Sep 2013 • Jiwei Li, Claire Cardie
In this paper, we investigate the real-time flu detection problem on Twitter data by proposing Flu Markov Network (Flu-MN): a spatio-temporal unsupervised Bayesian algorithm based on a 4 phase Markov Network, trying to identify the flu breakout at the earliest stage.