Search Results for author: Ganesh Ramakrishnan

Found 71 papers, 26 papers with code

WARM: A Weakly (+Semi) Supervised Math Word Problem Solver

1 code implementation COLING 2022 Oishik Chatterjee, Isha Pandey, Aashish Waikar, Vishwajeet Kumar, Ganesh Ramakrishnan

In order to address this challenge of equation annotation, we propose a weakly supervised model for solving MWPs by requiring only the final answer as supervision.

A Benchmark and Dataset for Post-OCR text correction in Sanskrit

1 code implementation15 Nov 2022 Ayush Maheshwari, Nikhil Singh, Amrith Krishna, Ganesh Ramakrishnan

Keeping this in mind, we release a multi-domain dataset, from areas as diverse as astronomy, medicine and mathematics, with some of them as old as 18 centuries.

Astronomy Optical Character Recognition (OCR)

Speeding up NAS with Adaptive Subset Selection

no code implementations2 Nov 2022 Vishak Prasad C, Colin White, Paarth Jain, Sibasis Nayak, Ganesh Ramakrishnan

A majority of recent developments in neural architecture search (NAS) have been aimed at decreasing the computational cost of various techniques without affecting their final performance.

Neural Architecture Search

Partitioned Gradient Matching-based Data Subset Selection for Compute-Efficient Robust ASR Training

no code implementations30 Oct 2022 Ashish Mittal, Durga Sivasubramanian, Rishabh Iyer, Preethi Jyothi, Ganesh Ramakrishnan

Training state-of-the-art ASR systems such as RNN-T often has a high associated financial and environmental cost.

DICTDIS: Dictionary Constrained Disambiguation for Improved NMT

no code implementations13 Oct 2022 Ayush Maheshwari, Piyush Sharma, Preethi Jyothi, Ganesh Ramakrishnan

In this work we present DICTDIS, a lexically constrained NMT system that disambiguates between multiple candidate translations derived from dictionaries.

Machine Translation NMT

AutoML for Climate Change: A Call to Action

1 code implementation7 Oct 2022 Renbo Tu, Nicholas Roberts, Vishak Prasad, Sibasis Nayak, Paarth Jain, Frederic Sala, Ganesh Ramakrishnan, Ameet Talwalkar, Willie Neiswanger, Colin White

The challenge that climate change poses to humanity has spurred a rapidly developing field of artificial intelligence research focused on climate change applications.


CLINICAL: Targeted Active Learning for Imbalanced Medical Image Classification

no code implementations4 Oct 2022 Suraj Kothawade, Atharv Savarkar, Venkat Iyer, Lakshman Tamil, Ganesh Ramakrishnan, Rishabh Iyer

It is often the case that a suboptimal performance is obtained on some classes due to the natural class imbalance issue that comes with medical data.

Active Learning Image Classification +1

DIAGNOSE: Avoiding Out-of-distribution Data using Submodular Information Measures

no code implementations4 Oct 2022 Suraj Kothawade, Akshit Srivastava, Venkat Iyer, Ganesh Ramakrishnan, Rishabh Iyer

Avoiding out-of-distribution (OOD) data is critical for training supervised machine learning models in the medical imaging domain.

Active Learning

Investigating Modality Bias in Audio Visual Video Parsing

no code implementations31 Mar 2022 Piyush Singh Pasi, Shubham Nemani, Preethi Jyothi, Ganesh Ramakrishnan

We focus on the audio-visual video parsing (AVVP) problem that involves detecting audio and visual event labels with temporal boundaries.

AUTOMATA: Gradient Based Data Subset Selection for Compute-Efficient Hyper-parameter Tuning

1 code implementation15 Mar 2022 KrishnaTeja Killamsetty, Guttu Sai Abhishek, Aakriti, Alexandre V. Evfimievski, Lucian Popa, Ganesh Ramakrishnan, Rishabh Iyer

Our central insight is that using an informative subset of the dataset for model training runs involved in hyper-parameter optimization, allows us to find the optimal hyper-parameter configuration significantly faster.

BASIL: Balanced Active Semi-supervised Learning for Class Imbalanced Datasets

no code implementations10 Mar 2022 Suraj Kothawade, Pavan Kumar Reddy, Ganesh Ramakrishnan, Rishabh Iyer

This issue is further pronounced in SSL methods, as they would use this biased model to obtain psuedo-labels (on the unlabeled data) during training.

Active Learning

UDAAN: Machine Learning based Post-Editing tool for Document Translation

1 code implementation3 Mar 2022 Ayush Maheshwari, Ajay Ravindran, Venkatapathy Subramanian, Ganesh Ramakrishnan

UDAAN has an end-to-end Machine Translation (MT) plus post-editing pipeline wherein users can upload a document to obtain raw MT output.

BIG-bench Machine Learning Document Translation +2

Submodlib: A Submodular Optimization Library

1 code implementation22 Feb 2022 Vishal Kaushal, Ganesh Ramakrishnan, Rishabh Iyer

A recent work has also leveraged submodular functions to propose submodular information measures which have been found to be very useful in solving the problems of guided subset selection and guided summarization.

Data Summarization

Adaptive Mixing of Auxiliary Losses in Supervised Learning

1 code implementation7 Feb 2022 Durga Sivasubramanian, Ayush Maheshwari, Pradeep Shenoy, Prathosh AP, Ganesh Ramakrishnan

In several supervised learning scenarios, auxiliary losses are used in order to introduce additional information or constraints into the supervised learning objective.

Denoising Knowledge Distillation +1

Error Correction in ASR using Sequence-to-Sequence Models

no code implementations2 Feb 2022 Samrat Dutta, Shreyansh Jain, Ayush Maheshwari, Souvik Pal, Ganesh Ramakrishnan, Preethi Jyothi

Post-editing in Automatic Speech Recognition (ASR) entails automatically correcting common and systematic errors produced by the ASR system.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Targeted Subset Selection for Limited-data ASR Accent Adaptation

no code implementations10 Oct 2021 Mayank Kothyari, Anmol Reddy Mekala, Rishabh Iyer, Ganesh Ramakrishnan, Preethi Jyothi

We propose a subset selection approach using the recently proposed submodular mutual information functions, in which we identify a diverse set of utterances that match the target accent.

Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting

1 code implementation19 Aug 2021 Sravya Vardhani Shivapuja, Mansi Pradeep Khamkar, Divij Bajaj, Ganesh Ramakrishnan, Ravi Kiran Sarvadevabhatla

We analyze the performance of representative crowd counting approaches across standard datasets at per strata level and in aggregate.

Crowd Counting

SPEAR : Semi-supervised Data Programming in Python

1 code implementation1 Aug 2021 Guttu Sai Abhishek, Harshad Ingole, Parth Laturia, Vineeth Dorna, Ayush Maheshwari, Rishabh Iyer, Ganesh Ramakrishnan

SPEAR facilitates weak supervision in the form of heuristics (or rules) and association of noisy labels to the training dataset.

Association text-classification +1

Training Data Subset Selection for Regression with Controlled Generalization Error

1 code implementation23 Jun 2021 Durga Sivasubramanian, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

First, we represent this problem with simplified constraints using the dual of the original training problem and show that the objective of this new representation is a monotone and alpha-submodular function, for a wide variety of modeling choices.


Effective Evaluation of Deep Active Learning on Image Classification Tasks

no code implementations16 Jun 2021 Nathan Beck, Durga Sivasubramanian, Apurva Dani, Ganesh Ramakrishnan, Rishabh Iyer

Issues in the current literature include sometimes contradictory observations on the performance of different AL algorithms, unintended exclusion of important generalization approaches such as data augmentation and SGD for optimization, a lack of study of evaluation facets like the labeling efficiency of AL, and little or no clarity on the scenarios in which AL outperforms random sampling (RS).

Active Learning Benchmarking +3

Submodular Mutual Information for Targeted Data Subset Selection

no code implementations30 Apr 2021 Suraj Kothawade, Vishal Kaushal, Ganesh Ramakrishnan, Jeff Bilmes, Rishabh Iyer

With the rapid growth of data, it is becoming increasingly difficult to train or improve deep learning models with the right subset of data.

Active Learning Image Classification

A Weakly Supervised Model for Solving Math word Problems

no code implementations14 Apr 2021 Oishik Chatterjee, Aashish Waikar, Vishwajeet Kumar, Ganesh Ramakrishnan, Kavi Arya

In order to address this challenge of equation annotation, we propose a weakly supervised model for solving math word problems by requiring only the final answer as supervision.

Cross-Modal learning for Audio-Visual Video Parsing

1 code implementation3 Apr 2021 Jatin Lamba, abhishek, Jayaprakash Akula, Rishabh Dabral, Preethi Jyothi, Ganesh Ramakrishnan

In this paper, we present a novel approach to the audio-visual video parsing (AVVP) task that demarcates events from a video separately for audio and visual modalities.

Event Detection Multiple Instance Learning +1

Rudder: A Cross Lingual Video and Text Retrieval Dataset

1 code implementation9 Mar 2021 Jayaprakash A, abhishek, Rishabh Dabral, Ganesh Ramakrishnan, Preethi Jyothi

Video retrieval using natural language queries requires learning semantically meaningful joint embeddings between the text and the audio-visual input.

Natural Language Queries Retrieval +2

GRAD-MATCH: Gradient Matching based Data Subset Selection for Efficient Deep Model Training

2 code implementations27 Feb 2021 KrishnaTeja Killamsetty, Durga Sivasubramanian, Ganesh Ramakrishnan, Abir De, Rishabh Iyer

We show rigorous theoretical and convergence guarantees of the proposed algorithm and, through our extensive experiments on real-world datasets, show the effectiveness of our proposed framework.

PRISM: A Rich Class of Parameterized Submodular Information Measures for Guided Subset Selection

1 code implementation27 Feb 2021 Suraj Kothawade, Vishal Kaushal, Ganesh Ramakrishnan, Jeff Bilmes, Rishabh Iyer

Examples of such problems include: i)targeted learning, where the goal is to find subsets with rare classes or rare attributes on which the model is underperforming, and ii)guided summarization, where data (e. g., image collection, text, document or video) is summarized for quicker human consumption with specific additional user intent.

Image Classification

Joint Learning of Hyperbolic Label Embeddings for Hierarchical Multi-label Classification

1 code implementation EACL 2021 Soumya Chatterjee, Ayush Maheshwari, Ganesh Ramakrishnan, Saketha Nath Jagaralpudi

Such a joint learning is expected to provide a twofold advantage: i) the classifier generalizes better as it leverages the prior knowledge of existence of a hierarchy over the labels, and ii) in addition to the label co-occurrence information, the label-embedding may benefit from the manifold structure of the input datapoints, leading to embeddings that are more faithful to the label hierarchy.

General Classification Hierarchical Multi-label Classification +1

GLISTER: Generalization based Data Subset Selection for Efficient and Robust Learning

1 code implementation19 Dec 2020 KrishnaTeja Killamsetty, Durga Sivasubramanian, Ganesh Ramakrishnan, Rishabh Iyer

Finally, we propose Glister-Active, an extension to batch active learning, and we empirically demonstrate the performance of Glister on a wide range of tasks including, (a) data selection to reduce training time, (b) robust learning under label noise and imbalance settings, and (c) batch-active learning with several deep and shallow models.

Active Learning

LIGHTEN: Learning Interactions with Graph and Hierarchical TEmporal Networks for HOI in videos

1 code implementation17 Dec 2020 Sai Praneeth Reddy Sunkesula, Rishabh Dabral, Ganesh Ramakrishnan

Analyzing the interactions between humans and objects from a video includes identification of the relationships between humans and the objects present in the video.

Human-Object Interaction Detection Visual Relationship Detection

Semi-Supervised Data Programming with Subset Selection

1 code implementation Findings (ACL) 2021 Ayush Maheshwari, Oishik Chatterjee, KrishnaTeja Killamsetty, Ganesh Ramakrishnan, Rishabh Iyer

The first contribution of this work is an introduction of a framework, \model which is a semi-supervised data programming paradigm that learns a \emph{joint model} that effectively uses the rules/labelling functions along with semi-supervised loss functions on the feature space.

text-classification Text Classification

Realistic Video Summarization through VISIOCITY: A New Benchmark and Evaluation Framework

no code implementations29 Jul 2020 Vishal Kaushal, Suraj Kothawade, Rishabh Iyer, Ganesh Ramakrishnan

Thirdly, we demonstrate that in the presence of multiple ground truth summaries (due to the highly subjective nature of the task), learning from a single combined ground truth summary using a single loss function is not a good idea.

Benchmarking Video Summarization

Question Generation from Paragraphs: A Tale of Two Hierarchical Models

no code implementations8 Nov 2019 Vishwajeet Kumar, Raktim Chaki, Sai Teja Talluri, Ganesh Ramakrishnan, Yuan-Fang Li, Gholamreza Haffari

Specifically, we propose (a) a novel hierarchical BiLSTM model with selective attention and (b) a novel hierarchical Transformer architecture, both of which learn hierarchical representations of paragraphs.

Question Generation Question-Generation

Putting the Horse before the Cart: A Generator-Evaluator Framework for Question Generation from Text

no code implementations CONLL 2019 Vishwajeet Kumar, Ganesh Ramakrishnan, Yuan-Fang Li

The \textit{generator} is a sequence-to-sequence model that incorporates the \textit{structure} and \textit{semantics} of the question being generated.

Question Generation Question-Generation

Tale of tails using rule augmented sequence labeling for event extraction

no code implementations19 Aug 2019 Ayush Maheshwari, Hrishikesh Patel, Nandan Rathod, Ritesh Kumar, Ganesh Ramakrishnan, Pushpak Bhattacharyya

The problem of event extraction is a relatively difficult task for low resource languages due to the non-availability of sufficient annotated data.

Event Extraction

Putting the Horse Before the Cart:A Generator-Evaluator Framework for Question Generation from Text

no code implementations15 Aug 2018 Vishwajeet Kumar, Ganesh Ramakrishnan, Yuan-Fang Li

The {\it generator} is a sequence-to-sequence model that incorporates the {\it structure} and {\it semantics} of the question being generated.

Question Generation Question-Generation

Entity Resolution and Location Disambiguation in the Ancient Hindu Temples Domain using Web Data

no code implementations NAACL 2018 Ayush Maheshwari, Vishwajeet Kumar, Ganesh Ramakrishnan, J. Saketha Nath

We present a system for resolving entities and disambiguating locations based on publicly available web data in the domain of ancient Hindu Temples.

Entity Resolution

Learning From Less Data: Diversified Subset Selection and Active Learning in Image Classification Tasks

no code implementations28 May 2018 Vishal Kaushal, Anurag Sahoo, Khoshrav Doctor, Narasimha Raju, Suyash Shetty, Pankaj Singh, Rishabh Iyer, Ganesh Ramakrishnan

Supervised machine learning based state-of-the-art computer vision techniques are in general data hungry and pose the challenges of not having adequate computing resources and of high costs involved in human labeling efforts.

Active Learning BIG-bench Machine Learning +4

Learning Discriminative Relational Features for Sequence Labeling

no code implementations7 May 2017 Naveen Nair, Ajay Nagesh, Ganesh Ramakrishnan

For learning features derived from inputs at a particular sequence position, we propose a Hierarchical Kernels-based approach (referred to as Hierarchical Kernel Learning for Structured Output Spaces - StructHKL).

A Unified Multi-Faceted Video Summarization System

no code implementations4 Apr 2017 Anurag Sahoo, Vishal Kaushal, Khoshrav Doctor, Suyash Shetty, Rishabh Iyer, Ganesh Ramakrishnan

Most importantly, we also show that we can summarize hours of video data in a few seconds, and our system allows the user to generate summaries of various lengths and types interactively on the fly.

Extractive Summarization Video Summarization

Efficient Reuse of Structured and Unstructured Resources for Ontology Population

no code implementations LREC 2014 Chetana Gavankar, Ashish Kulkarni, Ganesh Ramakrishnan

A domain ontology for an organization, often consists of classes whose instances are either specific to, or independent of the organization.

Information Retrieval Retrieval +1

Cannot find the paper you are looking for? You can Submit a new open access paper.