Search Results for author: Dawn Lawrie

Found 24 papers, 9 papers with code

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

2 code implementations • 22 Mar 2024 • Orion Weller, Benjamin Chang, Sean MacAvaney, Kyle Lo, Arman Cohan, Benjamin Van Durme, Dawn Lawrie, Luca Soldaini

We introduce our dataset FollowIR, which contains a rigorous instruction evaluation benchmark as well as a training set for helping IR models learn to better follow real-world instructions.

Information Retrieval Retrieval

16,643

Paper
Code

Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models

1 code implementation • 20 Jan 2022 • Suraj Nair, Eugene Yang, Dawn Lawrie, Kevin Duh, Paul McNamee, Kenton Murray, James Mayfield, Douglas W. Oard

These models have improved the effectiveness of retrieval systems well beyond that of lexical term matching models such as BM25.

Document Ranking Information Retrieval +3

Paper
Code

Neural Approaches to Multilingual Information Retrieval

1 code implementation • 3 Sep 2022 • Dawn Lawrie, Eugene Yang, Douglas W. Oard, James Mayfield

Providing access to information across languages has been a goal of Information Retrieval (IR) for decades.

Document Translation Information Retrieval +3

Paper
Code

Translate-Distill: Learning Cross-Language Dense Retrieval by Translation and Distillation

1 code implementation • 9 Jan 2024 • Eugene Yang, Dawn Lawrie, James Mayfield, Douglas W. Oard, Scott Miller

Applying a similar knowledge distillation approach to training an efficient dual-encoder model for Cross-Language Information Retrieval (CLIR), where queries and documents are in different languages, is challenging due to the lack of a sufficiently large training collection when the query and document languages differ.

Information Retrieval Knowledge Distillation +2

Paper
Code

Patapasco: A Python Framework for Cross-Language Information Retrieval Experiments

1 code implementation • 24 Jan 2022 • Cash Costello, Eugene Yang, Dawn Lawrie, James Mayfield

While there are high-quality software frameworks for information retrieval experimentation, they do not explicitly support cross-language information retrieval (CLIR).

Information Retrieval Retrieval

Paper
Code

Improving Neural Named Entity Recognition with Gazetteers

1 code implementation • 6 Mar 2020 • Chan Hee Song, Dawn Lawrie, Tim Finin, James Mayfield

The goal of this work is to improve the performance of a neural named entity recognition system by adding input features that indicate a word is part of a name included in a gazetteer.

named-entity-recognition Named Entity Recognition +1

Paper
Code

Pretrained Models for Multilingual Federated Learning

1 code implementation • NAACL 2022 • Orion Weller, Marc Marone, Vladimir Braverman, Dawn Lawrie, Benjamin Van Durme

Since the advent of Federated Learning (FL), research has applied these methods to natural language processing (NLP) tasks.

Federated Learning Language Modelling +3

Paper
Code

HC4: A New Suite of Test Collections for Ad Hoc CLIR

1 code implementation • 24 Jan 2022 • Dawn Lawrie, James Mayfield, Douglas Oard, Eugene Yang

HC4 is a new suite of test collections for ad hoc Cross-Language Information Retrieval (CLIR), with Common Crawl News documents in Chinese, Persian, and Russian, topics in English and in the document languages, and graded relevance judgments.

Active Learning Information Retrieval +1

Paper
Code

NevIR: Negation in Neural Information Retrieval

1 code implementation • 12 May 2023 • Orion Weller, Dawn Lawrie, Benjamin Van Durme

Although the Information Retrieval (IR) community has adopted LMs as the backbone of modern IR architectures, there has been little to no research in understanding how negation impacts neural IR.

Information Retrieval Negation +1

Paper
Code

CADET: Computer Assisted Discovery Extraction and Translation

no code implementations • IJCNLP 2017 • Benjamin Van Durme, Tom Lippincott, Kevin Duh, Deana Burchfield, Adam Poliak, Cash Costello, Tim Finin, Scott Miller, James Mayfield, Philipp Koehn, Craig Harman, Dawn Lawrie, Ch May, ler, Max Thomas, Annabelle Carrell, Julianne Chaloux, Tongfei Chen, Alex Comerford, Mark Dredze, Benjamin Glass, Shudong Hao, Patrick Martin, Pushpendre Rastogi, Rashmi Sankepally, Travis Wolfe, Ying-Ying Tran, Ted Zhang

It combines a multitude of analytics together with a flexible environment for customizing the workflow for different users.

Active Learning Machine Translation +1

Paper
Add Code

KELVIN: a tool for automated knowledge base construction

no code implementations • NAACL 2013 • Paul McNamee, James Mayfield, Tim Finin, Tim Oates, Dawn Lawrie, Tan Xu, Douglas Oard

Knowledge Base Population Relation Extraction

Paper
Add Code

A Context-Aware Approach to Entity Linking

no code implementations • WS 2012 • Veselin Stoyanov, James Mayfield, Tan Xu, Douglas Oard, Dawn Lawrie, Tim Oates, Tim Finin

Entity Linking Knowledge Base Population

Paper
Add Code

Creating and Curating a Cross-Language Person-Entity Linking Collection

no code implementations • LREC 2012 • Dawn Lawrie, James Mayfield, Paul McNamee, Douglas Oard

To stimulate research in cross-language entity linking, we present a new test collection for evaluating the accuracy of cross-language entity linking in twenty-one languages.

Entity Linking Knowledge Base Population +1

Paper
Add Code

Building OCR/NER Test Collections

no code implementations • LREC 2020 • Dawn Lawrie, James Mayfield, David Etter

This means that named entities are annotated on the transcribed text.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

Parameter-efficient Zero-shot Transfer for Cross-Language Dense Retrieval with Adapters

no code implementations • 20 Dec 2022 • Eugene Yang, Suraj Nair, Dawn Lawrie, James Mayfield, Douglas W. Oard

By adding adapters pretrained on language tasks for a specific language with task-specific adapters, prior work has shown that the adapter-enhanced models perform better than fine-tuning the entire model when transferring across languages in various NLP tasks.

Information Retrieval Language Modelling +1

Paper
Add Code

When Do Decompositions Help for Machine Reading?

no code implementations • 20 Dec 2022 • Kangda Wei, Dawn Lawrie, Benjamin Van Durme, Yunmo Chen, Orion Weller

Answering complex questions often requires multi-step reasoning in order to obtain the final answer.

Reading Comprehension Retrieval

Paper
Add Code

Defending Against Disinformation Attacks in Open-Domain Question Answering

no code implementations • 20 Dec 2022 • Orion Weller, Aleem Khan, Nathaniel Weir, Dawn Lawrie, Benjamin Van Durme

Recent work in open-domain question answering (ODQA) has shown that adversarial poisoning of the search collection can cause large drops in accuracy for production systems.

Data Poisoning Misinformation +1

Paper
Add Code

Overview of the TREC 2022 NeuCLIR Track

no code implementations • 24 Apr 2023 • Dawn Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang

This is the first year of the TREC Neural CLIR (NeuCLIR) track, which aims to study the impact of neural approaches to cross-language information retrieval.

Information Retrieval Retrieval

Paper
Add Code

Synthetic Cross-language Information Retrieval Training Data

no code implementations • 29 Apr 2023 • James Mayfield, Eugene Yang, Dawn Lawrie, Samuel Barham, Orion Weller, Marc Mason, Suraj Nair, Scott Miller

By repeating this process, collections of arbitrary size can be created in the style of MS MARCO but using naturally-occurring documents in any desired genre and domain of discourse.

Information Retrieval Language Modelling +4

Paper
Add Code

"According to ...": Prompting Language Models Improves Quoting from Pre-Training Data

no code implementations • 22 May 2023 • Orion Weller, Marc Marone, Nathaniel Weir, Dawn Lawrie, Daniel Khashabi, Benjamin Van Durme

Large Language Models (LLMs) may hallucinate and generate fake information, despite pre-training on factual data.

Paper
Add Code

When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets

no code implementations • 15 Sep 2023 • Orion Weller, Kyle Lo, David Wadden, Dawn Lawrie, Benjamin Van Durme, Arman Cohan, Luca Soldaini

Using large language models (LMs) for query or document expansion can improve generalization in information retrieval.

Information Retrieval Retrieval

Paper
Add Code

Dated Data: Tracing Knowledge Cutoffs in Large Language Models

no code implementations • 19 Mar 2024 • Jeffrey Cheng, Marc Marone, Orion Weller, Dawn Lawrie, Daniel Khashabi, Benjamin Van Durme

Using this analysis, we find that effective cutoffs often differ from reported cutoffs.

Paper
Add Code

HLTCOE at TREC 2023 NeuCLIR Track

no code implementations • 11 Apr 2024 • Eugene Yang, Dawn Lawrie, James Mayfield

TT trains a ColBERT model with English queries and passages automatically translated into the document language from the MS-MARCO v1 collection.

Document Translation

Paper
Add Code

Overview of the TREC 2023 NeuCLIR Track

no code implementations • 11 Apr 2024 • Dawn Lawrie, Sean MacAvaney, James Mayfield, Paul McNamee, Douglas W. Oard, Luca Soldaini, Eugene Yang

The principal tasks are ranked retrieval of news in one of the three languages, using English topics.

Information Retrieval Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.