Search Results for author: Neha Verma

Found 9 papers, 4 papers with code

Improving Low-Resource Cross-lingual Document Retrieval by Reranking with Deep Bilingual Representations

no code implementations • ACL 2019 • Rui Zhang, Caitlin Westerfield, Sungrok Shim, Garrett Bingham, Alexander Fabbri, Neha Verma, William Hu, Dragomir Radev

In this paper, we propose to boost low-resource cross-lingual document retrieval performance with deep bilingual query-document representations.

Cross-Lingual Information Retrieval Cross-Lingual Word Embeddings +3

Paper
Add Code

DART: Open-Domain Structured Data Record to Text Generation

2 code implementations • NAACL 2021 • Linyong Nan, Dragomir Radev, Rui Zhang, Amrit Rau, Abhinand Sivaprasad, Chiachun Hsieh, Xiangru Tang, Aadit Vyas, Neha Verma, Pranav Krishna, Yangxiaokang Liu, Nadia Irwanto, Jessica Pan, Faiaz Rahman, Ahmad Zaidi, Mutethia Mutuma, Yasin Tarabar, Ankit Gupta, Tao Yu, Yi Chern Tan, Xi Victoria Lin, Caiming Xiong, Richard Socher, Nazneen Fatema Rajani

Data-to-Text annotations can be a costly process, especially when dealing with tables which are the major source of structured data and contain nontrivial structures.

Domain Generalization Semantic Parsing +2

142

Paper
Code

FeTaQA: Free-form Table Question Answering

1 code implementation • 1 Apr 2021 • Linyong Nan, Chiachun Hsieh, Ziming Mao, Xi Victoria Lin, Neha Verma, Rui Zhang, Wojciech Kryściński, Nick Schoelkopf, Riley Kong, Xiangru Tang, Murori Mutuma, Ben Rosand, Isabel Trindade, Renusree Bandaru, Jacob Cunningham, Caiming Xiong, Dragomir Radev

Existing table question answering datasets contain abundant factual questions that primarily evaluate the query and schema comprehension capability of a system, but they fail to include questions that require complex reasoning and integration of information due to the constraint of the associated short-form answers.

Question Answering Retrieval +2

Paper
Code

Neural Natural Language Processing for Unstructured Data in Electronic Health Records: a Review

no code implementations • 7 Jul 2021 • Irene Li, Jessica Pan, Jeremy Goldwasser, Neha Verma, Wai Pan Wong, Muhammed Yavuz Nuzumlali, Benjamin Rosand, Yixin Li, Matthew Zhang, David Chang, R. Andrew Taylor, Harlan M. Krumholz, Dragomir Radev

Electronic health records (EHRs), digital collections of patient healthcare events and observations, are ubiquitous in medicine and critical to healthcare delivery, operations, and research.

Knowledge Graphs Question Answering +1

Paper
Add Code

IsoVec: Controlling the Relative Isomorphism of Word Embedding Spaces

1 code implementation • 11 Oct 2022 • Kelly Marchisio, Neha Verma, Kevin Duh, Philipp Koehn

The ability to extract high-quality translation dictionaries from monolingual word embedding spaces depends critically on the geometric similarity of the spaces -- their degree of "isomorphism."

Bilingual Lexicon Induction Translation

Paper
Code

Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer

no code implementations • 23 May 2023 • Elizabeth Salesky, Neha Verma, Philipp Koehn, Matt Post

We introduce and demonstrate how to effectively train multilingual machine translation models with pixel representations.

Cross-Lingual Transfer Machine Translation +1

Paper
Add Code

Exploring Representational Disparities Between Multilingual and Bilingual Translation Models

no code implementations • 23 May 2023 • Neha Verma, Kenton Murray, Kevin Duh

Multilingual machine translation has proven immensely useful for both parameter efficiency and overall performance across many language pairs via complete multilingual parameter sharing.

Machine Translation Translation

Paper
Add Code

Merging Text Transformer Models from Different Initializations

1 code implementation • 1 Mar 2024 • Neha Verma, Maha Elbayad

Recent work on one-shot permutation-based model merging has shown impressive low- or zero-barrier mode connectivity between models from completely different initializations.

Language Modelling Masked Language Modeling

Paper
Code

Strategies for Adapting Multilingual Pre-training for Domain-Specific Machine Translation

no code implementations • AMTA 2022 • Neha Verma, Kenton Murray, Kevin Duh

Therefore, in this work, we propose two major fine-tuning strategies: our language-first approach first learns the translation language pair via general bitext, followed by the domain via in-domain bitext, and our domain-first approach first learns the domain via multilingual in-domain bitext, followed by the language pair via language pair-specific in-domain bitext.

Domain Adaptation Machine Translation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.