Search Results for author: Mitesh M Khapra

Found 6 papers, 2 papers with code

RASMALAI: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations

no code implementations24 May 2025 Ashwin Sankar, Yoach Lacombe, Sherry Thomas, Praveen Srinivasa Varadhan, Sanchit Gandhi, Mitesh M Khapra

We introduce RASMALAI, a large-scale speech dataset with rich text descriptions, designed to advance controllable and expressive text-to-speech (TTS) synthesis for 23 Indian languages and English.

Expressive Speech Synthesis text-to-speech +1

BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages

1 code implementation7 Nov 2024 Sparsh Jain, Ashwin Sankar, Devilal Choudhary, Dhairya Suman, Nikhil Narasimhan, Mohammed Safi Ur Rahman Khan, Anoop Kunchukuttan, Mitesh M Khapra, Raj Dabre

To this end, we introduce BhasaAnuvaad, the largest publicly available dataset for AST involving 13 out of 22 scheduled Indian languages and English spanning over 44, 400 hours and 17M text segments.

automatic-speech-translation Synthetic Data Generation +1

How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?

1 code implementation6 Jun 2024 Anushka Singh, Ananya B. Sai, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Mitesh M Khapra

While machine translation evaluation has been studied primarily for high-resource languages, there has been a recent interest in evaluation for low-resource languages due to the increasing availability of data and models.

Machine Translation

Input-specific Attention Subnetworks for Adversarial Detection

no code implementations Findings (ACL) 2022 Emil Biju, Anirudh Sriram, Pratyush Kumar, Mitesh M Khapra

We also demonstrate that our method (a) is more accurate for larger models which are likely to have more spurious correlations and thus vulnerable to adversarial attack, and (b) performs well even with modest training sets of adversarial examples.

Adversarial Attack

A Joint Training Framework for Open-World Knowledge Graph Embeddings

no code implementations AKBC 2021 Karthik V, Beethika Tripathi, Mitesh M Khapra, Balaraman Ravindran

However, we find that existing approaches suffer from one or more of four drawbacks – 1) They are not modular with respect to the choice of the KG embedding model 2) They ignore best practices for aligning two embedding spaces 3) They do not account for differences in training strategy needed when presented with datasets with different description sizes and 4) They do not produce entity embeddings for use by downstream tasks.

Dialogue Generation Entity Embeddings +4

Cannot find the paper you are looking for? You can Submit a new open access paper.