Search Results for author: Bhuwan Dhingra

Found 48 papers, 23 papers with code

Tweet2Vec: Character-Based Distributed Representations for Social Media

1 code implementation • ACL 2016 • Bhuwan Dhingra, Zhong Zhou, Dylan Fitzpatrick, Michael Muehl, William W. Cohen

Text from social media provides a set of challenges that can cause traditional NLP approaches to fail.

280

Paper
Code

Using Graphs of Classifiers to Impose Constraints on Semi-supervised Relation Extraction

no code implementations • WS 2016 • Lidong Bing, William Cohen, Bhuwan Dhingra, Richard Wang

Relation Relation Extraction

Paper
Add Code

Gated-Attention Readers for Text Comprehension

4 code implementations • ACL 2017 • Bhuwan Dhingra, Hanxiao Liu, Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov

In this paper we study the problem of answering cloze-style questions over documents.

Ranked #1 on Question Answering on Children's Book Test

Answer Selection Open-Domain Question Answering +1

436

Paper
Code

Bootstrapping Distantly Supervised IE using Joint Learning and Small Well-structured Corpora

no code implementations • 10 Jun 2016 • Lidong Bing, Bhuwan Dhingra, Kathryn Mazaitis, Jong Hyuk Park, William W. Cohen

We propose a framework to improve performance of distantly-supervised relation extraction, by jointly learning to solve two related tasks: concept-instance extraction and relation extraction.

Relation Relation Extraction

Paper
Add Code

Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access

1 code implementation • ACL 2017 • Bhuwan Dhingra, Lihong Li, Xiujun Li, Jianfeng Gao, Yun-Nung Chen, Faisal Ahmed, Li Deng

In this paper, we address this limitation by replacing symbolic queries with an induced "soft" posterior distribution over the KB that indicates which entities the user is interested in.

reinforcement-learning Reinforcement Learning (RL) +2

186

Paper
Code

Words or Characters? Fine-grained Gating for Reading Comprehension

1 code implementation • 6 Nov 2016 • Zhilin Yang, Bhuwan Dhingra, Ye Yuan, Junjie Hu, William W. Cohen, Ruslan Salakhutdinov

Previous work combines word-level and character-level representations using concatenation or scalar weighting, which is suboptimal for high-level tasks like reading comprehension.

Ranked #50 on Question Answering on SQuAD1.1 dev

Question Answering Reading Comprehension +1

Paper
Code

A User Simulator for Task-Completion Dialogues

10 code implementations • 17 Dec 2016 • Xiujun Li, Zachary C. Lipton, Bhuwan Dhingra, Lihong Li, Jianfeng Gao, Yun-Nung Chen

Then, one can train reinforcement learning agents in an online fashion as they interact with the simulator.

reinforcement-learning Reinforcement Learning (RL) +2

807

Paper
Code

A Comparative Study of Word Embeddings for Reading Comprehension

no code implementations • 2 Mar 2017 • Bhuwan Dhingra, Hanxiao Liu, Ruslan Salakhutdinov, William W. Cohen

The focus of past machine learning research for Reading Comprehension tasks has been primarily on the design of novel deep learning architectures.

BIG-bench Machine Learning Reading Comprehension +1

Paper
Add Code

Using Graphs of Classifiers to Impose Declarative Constraints on Semi-supervised Learning

no code implementations • 5 Mar 2017 • Lidong Bing, William W. Cohen, Bhuwan Dhingra

We propose a general approach to modeling semi-supervised learning (SSL) algorithms.

Bayesian Optimization General Classification +4

Paper
Add Code

Linguistic Knowledge as Memory for Recurrent Neural Networks

no code implementations • 7 Mar 2017 • Bhuwan Dhingra, Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov

We introduce a model that encodes such graphs as explicit memory in recurrent neural networks, and use it to model coreference relations in text.

Ranked #1 on Question Answering on CNN / Daily Mail

LAMBADA

Paper
Add Code

Question Answering from Unstructured Text by Retrieval and Comprehension

no code implementations • 26 Mar 2017 • Yusuke Watanabe, Bhuwan Dhingra, Ruslan Salakhutdinov

Open domain Question Answering (QA) systems must interact with external knowledge sources, such as web pages, to find relevant information.

Open-Domain Question Answering Retrieval

Paper
Add Code

Quasar: Datasets for Question Answering by Search and Reading

1 code implementation • 12 Jul 2017 • Bhuwan Dhingra, Kathryn Mazaitis, William W. Cohen

ClueWeb09 serves as the background corpus for extracting these answers.

Question Answering Retrieval

Paper
Code

Simple and Effective Semi-Supervised Question Answering

no code implementations • NAACL 2018 • Bhuwan Dhingra, Danish Pruthi, Dheeraj Rajagopal

Recent success of deep learning models for the task of extractive Question Answering (QA) is hinged on the availability of large annotated corpora.

Extractive Question-Answering Question Answering +1

Paper
Add Code

Neural Models for Reasoning over Multiple Mentions using Coreference

no code implementations • NAACL 2018 • Bhuwan Dhingra, Qiao Jin, Zhilin Yang, William W. Cohen, Ruslan Salakhutdinov

Many problems in NLP require aggregating information from multiple mentions of the same entity which may be far apart in the text.

Ranked #7 on Question Answering on WikiHop

LAMBADA

Paper
Add Code

Embedding Text in Hyperbolic Spaces

no code implementations • WS 2018 • Bhuwan Dhingra, Christopher J. Shallue, Mohammad Norouzi, Andrew M. Dai, George E. Dahl

Ideally, we could incorporate our prior knowledge of this hierarchical structure into unsupervised learning algorithms that work on text data.

Sentence Sentence Embeddings

Paper
Add Code

GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations

1 code implementation • 14 Jun 2018 • Zhilin Yang, Jake Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen, Ruslan Salakhutdinov, Yann Lecun

We also show that the learned graphs are generic enough to be transferred to different embeddings on which the graphs have not been trained (including GloVe embeddings, ELMo embeddings, and task-specific RNN hidden unit), or embedding-free units such as image pixels.

Image Classification Natural Language Inference +4

Paper
Code

Open Domain Question Answering Using Early Fusion of Knowledge Bases and Text

2 code implementations • EMNLP 2018 • Haitian Sun, Bhuwan Dhingra, Manzil Zaheer, Kathryn Mazaitis, Ruslan Salakhutdinov, William W. Cohen

In this paper we look at a more practical setting, namely QA over the combination of a KB and entity-linked text, which is appropriate when an incomplete KB is available with a large text corpus.

Graph Representation Learning Open-Domain Question Answering

263

Paper
Code

AttentionMeSH: Simple, Effective and Interpretable Automatic MeSH Indexer

no code implementations • WS 2018 • Qiao Jin, Bhuwan Dhingra, William Cohen, Xinghua Lu

There are millions of articles in PubMed database.

Information Retrieval Question Answering +1

Paper
Add Code

GLoMo: Unsupervised Learning of Transferable Relational Graphs

no code implementations • NeurIPS 2018 • Zhilin Yang, Jake Zhao, Bhuwan Dhingra, Kaiming He, William W. Cohen, Ruslan R. Salakhutdinov, Yann Lecun

Image Classification Natural Language Inference +4

Paper
Add Code

Probing Biomedical Embeddings from Language Models

1 code implementation • WS 2019 • Qiao Jin, Bhuwan Dhingra, William W. Cohen, Xinghua Lu

For this we use the pre-trained LMs as fixed feature extractors and restrict the downstream task models to not have additional sequence modeling layers.

NER Word Embeddings

Paper
Code

Text Generation with Exemplar-based Adaptive Decoding

no code implementations • NAACL 2019 • Hao Peng, Ankur P. Parikh, Manaal Faruqui, Bhuwan Dhingra, Dipanjan Das

We propose a novel conditioned text generation model.

Abstractive Text Summarization Data-to-Text Generation

Paper
Add Code

Combating Adversarial Misspellings with Robust Word Recognition

3 code implementations • ACL 2019 • Danish Pruthi, Bhuwan Dhingra, Zachary C. Lipton

To combat adversarial spelling mistakes, we propose placing a word recognition model in front of the downstream classifier.

Sentiment Analysis

4,296

Paper
Code

Handling Divergent Reference Texts when Evaluating Table-to-Text Generation

1 code implementation • ACL 2019 • Bhuwan Dhingra, Manaal Faruqui, Ankur Parikh, Ming-Wei Chang, Dipanjan Das, William W. Cohen

Automatically constructed datasets for generating text from semi-structured data (tables), such as WikiBio, often contain reference texts that diverge from the information in the corresponding semi-structured data.

Table-to-Text Generation

Paper
Code

PubMedQA: A Dataset for Biomedical Research Question Answering

3 code implementations • IJCNLP 2019 • Qiao Jin, Bhuwan Dhingra, Zhengping Liu, William W. Cohen, Xinghua Lu

We introduce PubMedQA, a novel biomedical question answering (QA) dataset collected from PubMed abstracts.

Ranked #7 on Question Answering on PubMedQA

Question Answering

Paper
Code

Learning to Deceive with Attention-Based Explanations

3 code implementations • ACL 2020 • Danish Pruthi, Mansi Gupta, Bhuwan Dhingra, Graham Neubig, Zachary C. Lipton

Attention mechanisms are ubiquitous components in neural architectures applied to natural language processing.

Fairness

Paper
Code

Differentiable Reasoning over a Virtual Knowledge Base

1 code implementation • ICLR 2020 • Bhuwan Dhingra, Manzil Zaheer, Vidhisha Balachandran, Graham Neubig, Ruslan Salakhutdinov, William W. Cohen

In particular, we describe a neural module, DrKIT, that traverses textual data like a KB, softly following paths of relations between mentions of entities in the corpus.

Re-Ranking

1,561

Paper
Code

ToTTo: A Controlled Table-To-Text Generation Dataset

1 code implementation • EMNLP 2020 • Ankur P. Parikh, Xuezhi Wang, Sebastian Gehrmann, Manaal Faruqui, Bhuwan Dhingra, Diyi Yang, Dipanjan Das

We present ToTTo, an open-domain English table-to-text dataset with over 120, 000 training examples that proposes a controlled generation task: given a Wikipedia table and a set of highlighted table cells, produce a one-sentence description.

Ranked #3 on Data-to-Text Generation on ToTTo

Conditional Text Generation Data-to-Text Generation +2

419

Paper
Code

Differentiable Open-Ended Commonsense Reasoning

no code implementations • NAACL 2021 • Bill Yuchen Lin, Haitian Sun, Bhuwan Dhingra, Manzil Zaheer, Xiang Ren, William W. Cohen

As a step towards making commonsense reasoning research more realistic, we propose to study open-ended commonsense reasoning (OpenCSR) -- the task of answering a commonsense question without any pre-defined choices -- using as a resource only a corpus of commonsense facts written in natural language.

Multiple-choice

Paper
Add Code

Weakly- and Semi-supervised Evidence Extraction

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Danish Pruthi, Bhuwan Dhingra, Graham Neubig, Zachary C. Lipton

For many prediction tasks, stakeholders desire not only predictions but also supporting evidence that a human can use to verify its correctness.

Paper
Code

Evaluating Explanations: How much do explanations from the teacher aid students?

1 code implementation • 1 Dec 2020 • Danish Pruthi, Rachit Bansal, Bhuwan Dhingra, Livio Baldini Soares, Michael Collins, Zachary C. Lipton, Graham Neubig, William W. Cohen

While many methods purport to explain predictions by highlighting salient features, what aims these explanations serve and how they ought to be evaluated often go unstated.

Question Answering text-classification +1

Paper
Code

Reasoning Over Virtual Knowledge Bases With Open Predicate Relations

no code implementations • 14 Feb 2021 • Haitian Sun, Pat Verga, Bhuwan Dhingra, Ruslan Salakhutdinov, William W. Cohen

We present the Open Predicate Query Language (OPQL); a method for constructing a virtual KB (VKB) trained entirely from text.

Language Modelling Open-Domain Question Answering +1

Paper
Add Code

Fool Me Twice: Entailment from Wikipedia Gamification

1 code implementation • NAACL 2021 • Julian Martin Eisenschlos, Bhuwan Dhingra, Jannis Bulian, Benjamin Börschinger, Jordan Boyd-Graber

We release FoolMeTwice (FM2 for short), a large dataset of challenging entailment pairs collected through a fun multi-player game.

Retrieval

Paper
Code

Time-Aware Language Models as Temporal Knowledge Bases

no code implementations • 29 Jun 2021 • Bhuwan Dhingra, Jeremy R. Cole, Julian Martin Eisenschlos, Daniel Gillick, Jacob Eisenstein, William W. Cohen

We introduce a diagnostic dataset aimed at probing LMs for factual knowledge that changes over time and highlight problems with LMs at either end of the spectrum -- those trained on specific slices of temporal data, as well as those trained on a wide range of temporal data.

Memorization

Paper
Add Code

ASQA: Factoid Questions Meet Long-Form Answers

no code implementations • 12 Apr 2022 • Ivan Stelmakh, Yi Luan, Bhuwan Dhingra, Ming-Wei Chang

In contrast to existing long-form QA tasks (such as ELI5), ASQA admits a clear notion of correctness: a user faced with a good summary should be able to answer different interpretations of the original ambiguous question.

Question Answering

Paper
Add Code

Characterizing the Efficiency vs. Accuracy Trade-off for Long-Context NLP Models

1 code implementation • nlppower (ACL) 2022 • Phyllis Ang, Bhuwan Dhingra, Lisa Wu Wills

In this work, we perform a systematic study of this accuracy vs. efficiency trade-off on two widely used long-sequence models - Longformer-Encoder-Decoder (LED) and Big Bird - during fine-tuning and inference on four datasets from the SCROLLS benchmark.

Playing the Game of 2048 Question Answering

Paper
Code

On the State of the Art in Authorship Attribution and Authorship Verification

1 code implementation • 14 Sep 2022 • Jacob Tyo, Bhuwan Dhingra, Zachary C. Lipton

Despite decades of research on authorship attribution (AA) and authorship verification (AV), inconsistent dataset splits/filtering and mismatched evaluation methods make it difficult to assess the state of the art.

Authorship Attribution Authorship Verification

Paper
Code

DIFFQG: Generating Questions to Summarize Factual Changes

no code implementations • 1 Mar 2023 • Jeremy R. Cole, Palak Jain, Julian Martin Eisenschlos, Michael J. Q. Zhang, Eunsol Choi, Bhuwan Dhingra

We propose representing factual changes between paired documents as question-answer pairs, where the answer to the same question differs between two versions.

Change Detection Question Generation +1

Paper
Add Code

Learning the Legibility of Visual Text Perturbations

1 code implementation • 9 Mar 2023 • Dev Seth, Rickard Stureborg, Danish Pruthi, Bhuwan Dhingra

In this work, we address this gap by learning models that predict the legibility of a perturbed string, and rank candidate perturbations based on their legibility.

Paper
Code

Salient Span Masking for Temporal Understanding

no code implementations • 22 Mar 2023 • Jeremy R. Cole, Aditi Chaudhary, Bhuwan Dhingra, Partha Talukdar

First, we find that SSM alone improves the downstream performance on three temporal tasks by an avg.

Avg Language Modelling +1

Paper
Add Code

Selectively Answering Ambiguous Questions

no code implementations • 24 May 2023 • Jeremy R. Cole, Michael J. Q. Zhang, Daniel Gillick, Julian Martin Eisenschlos, Bhuwan Dhingra, Jacob Eisenstein

We investigate question answering from this perspective, focusing on answering a subset of questions with a high degree of accuracy, from a set of questions in which many are inherently ambiguous.

Question Answering

Paper
Add Code

Hierarchical Multi-Instance Multi-Label Learning for Detecting Propaganda Techniques

no code implementations • 30 May 2023 • Anni Chen, Bhuwan Dhingra

Since the introduction of the SemEval 2020 Task 11 (Martino et al., 2020a), several approaches have been proposed in the literature for classifying propaganda based on the rhetorical techniques used to influence readers.

Multi-Label Learning

Paper
Add Code

Hierarchical Multi-Label Classification of Online Vaccine Concerns

no code implementations • 1 Feb 2024 • Chloe Qinyu Zhu, Rickard Stureborg, Bhuwan Dhingra

Vaccine concerns are an ever-evolving target, and can shift quickly as seen during the COVID-19 pandemic.

Classification Hierarchical Multi-label Classification +1

Paper
Add Code

Calibrating Long-form Generations from Large Language Models

no code implementations • 9 Feb 2024 • Yukun Huang, Yixin Liu, Raghuveer Thirukovalluru, Arman Cohan, Bhuwan Dhingra

Addressing this gap, we introduce a unified calibration framework, in which both the correctness of the LLMs' responses and their associated confidence levels are treated as distributions across a range of scores.

Paper
Add Code

LLM-Resistant Math Word Problem Generation via Adversarial Attacks

1 code implementation • 27 Feb 2024 • Roy Xie, Chengxuan Huang, Junlin Wang, Bhuwan Dhingra

Large language models (LLMs) have significantly transformed the educational landscape.

Math

Paper
Code

Extracting Polymer Nanocomposite Samples from Full-Length Documents

1 code implementation • 1 Mar 2024 • Ghazal Khalighinejad, Defne Circi, L. C. Brinson, Bhuwan Dhingra

This paper investigates the use of large language models (LLMs) for extracting sample lists of polymer nanocomposites (PNCs) from full-length materials science research papers.

Document-level Relation Extraction

Paper
Code

IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations

no code implementations • 1 Apr 2024 • Deqing Fu, Ghazal Khalighinejad, Ollie Liu, Bhuwan Dhingra, Dani Yogatama, Robin Jia, Willie Neiswanger

Current foundation models exhibit impressive capabilities when prompted either with text only or with both image and text inputs.

Benchmarking Math

Paper
Add Code

ChatShop: Interactive Information Seeking with Language Agents

no code implementations • 15 Apr 2024 • Sanxing Chen, Sam Wiseman, Bhuwan Dhingra

The desire and ability to seek new information strategically are fundamental to human learning but often overlooked in current language agent development.

Retrieval

Paper
Add Code

Investigating the Effect of Background Knowledge on Natural Questions

no code implementations • NAACL (DeeLIO) 2021 • Vidhisha Balachandran, Bhuwan Dhingra, Haitian Sun, Michael Collins, William Cohen

We create a subset of the NQ data, Factual Questions (FQ), where the questions have evidence in the KB in the form of paths that link question entities to answer entities but still must be answered using text, to facilitate further research into KB integration methods.

Natural Questions Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.