Search Results for author: Sunita Sarawagi

Found 38 papers, 24 papers with code

Parallel Iterative Edit Models for Local Sequence Transduction

1 code implementation • IJCNLP 2019 • Abhijeet Awasthi, Sunita Sarawagi, Rasna Goyal, Sabyasachi Ghosh, Vihari Piratla

We present a Parallel Iterative Edit (PIE) model for the problem of local sequence transduction arising in tasks like Grammatical error correction (GEC).

Ranked #13 on Grammatical Error Correction on CoNLL-2014 Shared Task

Grammatical Error Correction Optical Character Recognition (OCR)

226

Paper
Code

Learning from Rules Generalizing Labeled Exemplars

2 code implementations • ICLR 2020 • Abhijeet Awasthi, Sabyasachi Ghosh, Rasna Goyal, Sunita Sarawagi

Empirical evaluation on five different tasks shows that (1) our algorithm is more accurate than several existing methods of learning from a mix of clean and noisy supervision, and (2) the coupled rule-exemplar supervision is effective in denoising rules.

Denoising

Paper
Code

Efficient Domain Generalization via Common-Specific Low-Rank Decomposition

2 code implementations • ICML 2020 • Vihari Piratla, Praneeth Netrapalli, Sunita Sarawagi

The domain specific components are discarded after training and only the common component is retained.

Ranked #1 on Domain Generalization on LipitK

Data Augmentation Domain Generalization +2

Paper
Code

Missing Value Imputation on Multidimensional Time Series

1 code implementation • 2 Mar 2021 • Parikshit Bansal, Prathamesh Deshpande, Sunita Sarawagi

Missing values are commonplace in decision support platforms that aggregate data over long time stretches from disparate sources, and reliable data analytics calls for careful handling of missing data.

Imputation Time Series +1

Paper
Code

Generalizing Across Domains via Cross-Gradient Training

1 code implementation • ICLR 2018 • Shiv Shankar, Vihari Piratla, Soumen Chakrabarti, Siddhartha Chaudhuri, Preethi Jyothi, Sunita Sarawagi

We present CROSSGRAD, a method to use multi-domain training data to learn a classifier that generalizes to new domains.

Ranked #83 on Domain Generalization on PACS

Data Augmentation Domain Generalization

Paper
Code

Long Horizon Forecasting With Temporal Point Processes

1 code implementation • 8 Jan 2021 • Prathamesh Deshpande, Kamlesh Marathe, Abir De, Sunita Sarawagi

In recent years, marked temporal point processes (MTPPs) have emerged as a powerful modeling machinery to characterize asynchronous events in a wide variety of applications.

Point Processes

Paper
Code

Trainable Calibration Measures for Neural Networks from Kernel Mean Embeddings

1 code implementation • ICML 2018 • Aviral Kumar, Sunita Sarawagi, Ujjwal Jain

Modern neural networks have recently been found to be poorly calibrated, primarily in the direction of over-confidence.

Paper
Code

Coherent Probabilistic Aggregate Queries on Long-horizon Forecasts

1 code implementation • 5 Nov 2021 • Prathamesh Deshpande, Sunita Sarawagi

Long range forecasts are the starting point of many decision support systems that need to draw inference from high-level aggregate patterns on forecasted values.

Time Series Time Series Forecasting

Paper
Code

Deep Indexed Active Learning for Matching Heterogeneous Entity Representations

1 code implementation • 8 Apr 2021 • Arjit Jain, Sunita Sarawagi, Prithviraj Sen

We propose DIAL, a scalable active learning approach that jointly learns embeddings to maximize recall for blocking and accuracy for matching blocked pairs.

Active Learning Blocking +1

Paper
Code

Surprisingly Easy Hard-Attention for Sequence to Sequence Learning

1 code implementation • EMNLP 2018 • Shiv Shankar, Siddhant Garg, Sunita Sarawagi

In this paper we show that a simple beam approximation of the joint distribution between attention and output is an easy, accurate, and efficient attention mechanism for sequence to sequence learning.

Hard Attention Image Captioning +2

Paper
Code

Error-driven Fixed-Budget ASR Personalization for Accented Speakers

1 code implementation • 4 Mar 2021 • Abhijeet Awasthi, Aman Kansal, Sunita Sarawagi, Preethi Jyothi

We consider the task of personalizing ASR models while being constrained by a fixed budget on recording speaker-specific utterances.

Sentence

Paper
Code

Training for the Future: A Simple Gradient Interpolation Loss to Generalize Along Time

1 code implementation • NeurIPS 2021 • Anshul Nasery, Soumyadeep Thakur, Vihari Piratla, Abir De, Sunita Sarawagi

In several real world applications, machine learning models are deployed to make predictions on data whose distribution changes gradually along time, leading to a drift between the train and test distributions.

Paper
Code

Data Programming using Continuous and Quality-Guided Labeling Functions

2 code implementations • 22 Nov 2019 • Oishik Chatterjee, Ganesh Ramakrishnan, Sunita Sarawagi

Scarcity of labeled data is a bottleneck for supervised learning models.

Paper
Code

Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related Languages

1 code implementation • ACL 2022 • Vaidehi Patil, Partha Talukdar, Sunita Sarawagi

This results in improved zero-shot transfer from related HRLs to LRLs without reducing HRL representation and accuracy.

XLM-R Zero-Shot Cross-Lingual Transfer

Paper
Code

CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQL

1 code implementation • 2 Nov 2023 • Mayank Kothyari, Dhruva Dhingra, Sunita Sarawagi, Soumen Chakrabarti

Standard dense retrieval techniques are inadequate for schema subsetting of a large structured database, where the correct semantics of retrieval demands that we rank sets of schema elements rather than individual elements.

Hallucination Retrieval +1

Paper
Code

Focus on the Common Good: Group Distributional Robustness Follows

1 code implementation • ICLR 2022 • Vihari Piratla, Praneeth Netrapalli, Sunita Sarawagi

We consider the problem of training a classification model with group annotated training data.

Domain Generalization

Paper
Code

Benchmarking and Improving Text-to-SQL Generation under Ambiguity

1 code implementation • 20 Oct 2023 • Adithya Bhaskar, Tushar Tomar, Ashutosh Sathe, Sunita Sarawagi

Research in Text-to-SQL conversion has been largely benchmarked against datasets where each text query corresponds to one correct SQL.

Benchmarking Natural Language Queries +2

Paper
Code

Black-box Adaptation of ASR for Accented Speech

1 code implementation • 24 Jun 2020 • Kartik Khandelwal, Preethi Jyothi, Abhijeet Awasthi, Sunita Sarawagi

Accordingly, we propose a novel coupling of an open-source accent-tuned local model with the black-box service where the output from the service guides frame-level inference in the local model.

Paper
Code

Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study

1 code implementation • ACL 2021 • Yash Khemchandani, Sarvesh Mehtani, Vaidehi Patil, Abhijeet Awasthi, Partha Talukdar, Sunita Sarawagi

RelateLM uses transliteration to convert the unseen script of limited LRL text into the script of a Related Prominent Language (RPL) (Hindi in our case).

Data Augmentation Language Modelling +2

Paper
Code

Active Assessment of Prediction Services as Accuracy Surface Over Attribute Combinations

1 code implementation • NeurIPS 2021 • Vihari Piratla, Soumen Chakrabarty, Sunita Sarawagi

Our goal is to evaluate the accuracy of a black-box classification model, not as a single aggregate on a given test data distribution, but as a surface over a large number of combinations of attributes characterizing multiple test data distributions.

Attribute

Paper
Code

Topic Sensitive Attention on Generic Corpora Corrects Sense Bias in Pretrained Embeddings

1 code implementation • ACL 2019 • Vihari Piratla, Sunita Sarawagi, Soumen Chakrabarti

Given a small corpus $\mathcal D_T$ pertaining to a limited set of focused topics, our goal is to train embeddings that accurately capture the sense of words in the topic in spite of the limited size of $\mathcal D_T$.

Paper
Code

Streaming Adaptation of Deep Forecasting Models using Adaptive Recurrent Units

1 code implementation • 24 Jun 2019 • Prathamesh Deshpande, Sunita Sarawagi

We present ARU, an Adaptive Recurrent Unit for streaming adaptation of deep globally trained time-series forecasting models.

Time Series Time Series Forecasting

Paper
Code

Training Data Augmentation for Code-Mixed Translation

1 code implementation • NAACL 2021 • Abhirut Gupta, Aditya Vavre, Sunita Sarawagi

Machine translation of user-generated code-mixed inputs to English is of crucial importance in applications like web search and targeted advertising.

Data Augmentation Machine Translation +1

Paper
Code

ARMDN: Associative and Recurrent Mixture Density Networks for eRetail Demand Forecasting

no code implementations • 10 Mar 2018 • Srayanta Mukherjee, Devashish Shankar, Atin Ghosh, Nilam Tathawadekar, Pramod Kompalli, Sunita Sarawagi, Krishnendu Chaudhury

Accurate demand forecasts can help on-line retail organizations better plan their supply-chain processes.

Time Series Time Series Analysis

Paper
Add Code

Labeled Memory Networks for Online Model Adaptation

no code implementations • 5 Jul 2017 • Shiv Shankar, Sunita Sarawagi

In this paper, we establish their potential in online adapting a batch trained neural network to domain-relevant labeled data at deployment time.

Few-Shot Learning

Paper
Add Code

Length bias in Encoder Decoder Models and a Case for Global Conditioning

no code implementations • EMNLP 2016 • Pavel Sountsov, Sunita Sarawagi

Encoder-decoder networks are popular for modeling sequences probabilistically in many applications.

Paper
Add Code

Occurrence Statistics of Entities, Relations and Types on the Web

no code implementations • 14 May 2016 • Aman Madaan, Sunita Sarawagi

This is owing to the severe mismatch in the distributions of such entities on the web and in the relatively diminutive training data.

Entity Disambiguation

Paper
Add Code

Posterior Attention Models for Sequence to Sequence Learning

no code implementations • ICLR 2019 • Shiv Shankar, Sunita Sarawagi

Modern neural architectures critically rely on attention for mapping structured inputs to sequences.

Morphological Inflection Position +1

Paper
Add Code

Calibration of Encoder Decoder Models for Neural Machine Translation

no code implementations • 3 Mar 2019 • Aviral Kumar, Sunita Sarawagi

We study the calibration of several state of the art neural machine translation(NMT) systems built on attention-based encoder-decoder models.

Machine Translation NMT +1

Paper
Add Code

What's in a Name? Are BERT Named Entity Representations just as Good for any other Name?

no code implementations • WS 2020 • Sriram Balasubramanian, Naman jain, Gaurav Jindal, Abhijeet Awasthi, Sunita Sarawagi

We evaluate named entity representations of BERT-based NLP models by investigating their robustness to replacements from the same typed class in the input.

Paper
Add Code

NLP Service APIs and Models for Efficient Registration of New Clients

no code implementations • Findings of the Association for Computational Linguistics 2020 • Sahil Shah, Vihari Piratla, Soumen Chakrabarti, Sunita Sarawagi

Each client uses an unsupervised, corpus-based sketch to register to the service.

Language Modelling NER

Paper
Add Code

Adaptive Discounting of Implicit Language Models in RNN-Transducers

no code implementations • 21 Feb 2022 • Vinit Unni, Shreya Khare, Ashish Mittal, Preethi Jyothi, Sunita Sarawagi, Samarth Bharadwaj

RNN-Transducer (RNN-T) models have become synonymous with streaming end-to-end ASR systems.

Language Modelling

Paper
Add Code

Accurate Online Posterior Alignments for Principled Lexically-Constrained Decoding

no code implementations • ACL 2022 • Soumya Chatterjee, Sunita Sarawagi, Preethi Jyothi

Online alignment in machine translation refers to the task of aligning a target word to a source word when the target sequence has only been partially decoded.

Machine Translation Translation

Paper
Add Code

Bootstrapping Multilingual Semantic Parsers using Large Language Models

no code implementations • 13 Oct 2022 • Abhijeet Awasthi, Nitish Gupta, Bidisha Samanta, Shachi Dave, Sunita Sarawagi, Partha Talukdar

Despite cross-lingual generalization demonstrated by pre-trained multilingual models, the translate-train paradigm of transferring English datasets across multiple languages remains to be a key mechanism for training task-specific multilingual models.

Semantic Parsing Translation

Paper
Add Code

Diverse Parallel Data Synthesis for Cross-Database Adaptation of Text-to-SQL Parsers

no code implementations • 29 Oct 2022 • Abhijeet Awasthi, Ashutosh Sathe, Sunita Sarawagi

Text-to-SQL parsers typically struggle with databases unseen during the train time.

Data Augmentation Natural Language Queries +3

Paper
Add Code

Structured Case-based Reasoning for Inference-time Adaptation of Text-to-SQL parsers

no code implementations • 10 Jan 2023 • Abhijeet Awasthi, Soumen Chakrabarti, Sunita Sarawagi

To the best of our knowledge, we are the first to attempt inference-time adaptation of Text-to-SQL models, and harness trainable structured similarity between subqueries.

Semantic Parsing Text-To-SQL

Paper
Add Code

Improving RNN-Transducers with Acoustic LookAhead

no code implementations • 11 Jul 2023 • Vinit S. Unni, Ashish Mittal, Preethi Jyothi, Sunita Sarawagi

RNN-Transducers (RNN-Ts) have gained widespread acceptance as an end-to-end model for speech to text conversion because of their high accuracy and streaming capabilities.

Hallucination

Paper
Add Code

Continuous Treatment Effect Estimation Using Gradient Interpolation and Kernel Smoothing

1 code implementation • 27 Jan 2024 • Lokesh Nagalapatti, Akshay Iyer, Abir De, Sunita Sarawagi

The main challenge in this estimation task is the potential confounding of treatment assignment with an individual's covariates in the training data, whereas during inference ICTE requires prediction on independently sampled treatments.

counterfactual

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.