Search Results for author: Adithya Pratapa

Found 15 papers, 8 papers with code

Comparing Grammatical Theories of Code-Mixing

no code implementations • WNUT (ACL) 2021 • Adithya Pratapa, Monojit Choudhury

Code-mixed text generation systems have found applications in many downstream tasks, including speech recognition, translation and dialogue.

speech-recognition Speech Recognition +2

Paper
Add Code

Team JARS: DialDoc Subtask 1 - Improved Knowledge Identification with Supervised Out-of-Domain Pretraining

no code implementations • ACL (dialdoc) 2021 • Sopan Khosla, Justin Lovelace, Ritam Dutt, Adithya Pratapa

In this paper, we discuss our submission for DialDoc subtask 1.

Question Answering

Paper
Add Code

Constrained Fact Verification for FEVER

no code implementations • EMNLP 2020 • Adithya Pratapa, Sai Muralidhar Jayanthi, Kavya Nerella

Fact-verification systems are well explored in the NLP literature with growing attention owing to shared tasks like FEVER.

Fact Verification

Paper
Add Code

A Study of Morphological Robustness of Neural Machine Translation

1 code implementation • ACL (SIGMORPHON) 2021 • Sai Muralidhar Jayanthi, Adithya Pratapa

In this work, we analyze the robustness of neural machine translation systems towards grammatical perturbations in the source.

Grammatical Error Correction Machine Translation +2

Paper
Code

What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions

1 code implementation • 22 May 2024 • Sang Keun Choe, Hwijeen Ahn, Juhan Bae, Kewen Zhao, Minsoo Kang, Youngseog Chung, Adithya Pratapa, Willie Neiswanger, Emma Strubell, Teruko Mitamura, Jeff Schneider, Eduard Hovy, Roger Grosse, Eric Xing

Large language models (LLMs) are trained on a vast amount of human-written data, but data providers often remain uncredited.

Data Valuation

Paper
Code

Calibrated Seq2seq Models for Efficient and Generalizable Ultra-fine Entity Typing

1 code implementation • 1 Nov 2023 • Yanlin Feng, Adithya Pratapa, David R Mortensen

In this paper, we present CASENT, a seq2seq model designed for ultra-fine entity typing that predicts ultra-fine types with calibrated confidence scores.

Entity Typing

Paper
Code

Background Summarization of Event Timelines

1 code implementation • 24 Oct 2023 • Adithya Pratapa, Kevin Small, Markus Dreyer

Generating concise summaries of news events is a challenging natural language processing task.

News Summarization Question Answering

Paper
Code

Hierarchical Event Grounding

no code implementations • 8 Feb 2023 • Jiefu Ou, Adithya Pratapa, Rishubh Gupta, Teruko Mitamura

In this work, we present an extension to the event grounding task that requires tackling hierarchical event structures from the KB.

Retrieval

Paper
Add Code

Multilingual Event Linking to Wikidata

1 code implementation • NAACL (MIA) 2022 • Adithya Pratapa, Rishubh Gupta, Teruko Mitamura

On the two proposed tasks, we compare multiple event linking systems including BM25+ (Lv and Zhai, 2011) and multilingual adaptations of the biencoder and crossencoder architectures from BLINK (Wu et al., 2020).

Domain Generalization

Paper
Code

Cross-document Event Identity via Dense Annotation

1 code implementation • CoNLL (EMNLP) 2021 • Adithya Pratapa, Zhengzhong Liu, Kimihiro Hasegawa, Linwei Li, Yukari Yamakawa, Shikun Zhang, Teruko Mitamura

To this end, we design a new annotation workflow with careful quality control and an easy-to-use annotation interface.

Paper
Code

Evaluating the Morphosyntactic Well-formedness of Generated Texts

1 code implementation • EMNLP 2021 • Adithya Pratapa, Antonios Anastasopoulos, Shruti Rijhwani, Aditi Chaudhary, David R. Mortensen, Graham Neubig, Yulia Tsvetkov

Text generation systems are ubiquitous in natural language processing applications.

Machine Translation Text Generation +1

Paper
Code

Automatic Extraction of Rules Governing Morphological Agreement

1 code implementation • EMNLP 2020 • Aditi Chaudhary, Antonios Anastasopoulos, Adithya Pratapa, David R. Mortensen, Zaid Sheikh, Yulia Tsvetkov, Graham Neubig

Using cross-lingual transfer, even with no expert annotations in the language of interest, our framework extracts a grammatical specification which is nearly equivalent to those created with large amounts of gold-standard annotated data.

Cross-Lingual Transfer Descriptive

Paper
Code

Word Embeddings for Code-Mixed Language Processing

no code implementations • EMNLP 2018 • Adithya Pratapa, Monojit Choudhury, Sunayana Sitaram

We compare three existing bilingual word embedding approaches, and a novel approach of training skip-grams on synthetic code-mixed text generated through linguistic models of code-mixing, on two tasks - sentiment analysis and POS tagging for code-mixed text.

Machine Translation POS +3

Paper
Add Code

Language Modeling for Code-Mixing: The Role of Linguistic Theory based Synthetic Data

no code implementations • ACL 2018 • Adithya Pratapa, Gayatri Bhat, Monojit Choudhury, Sunayana Sitaram, D, S apat, ipan, Kalika Bali

Training language models for Code-mixed (CM) language is known to be a difficult problem because of lack of data compounded by the increased confusability due to the presence of more than one language.

Automatic Speech Recognition (ASR) Language Identification +3

Paper
Add Code

Quantitative Characterization of Code Switching Patterns in Complex Multi-Party Conversations: A Case Study on Hindi Movie Scripts

no code implementations • WS 2017 • Adithya Pratapa, Monojit Choudhury

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.