Search Results for author: Kalika Bali

Found 30 papers, 3 papers with code

Annotated Speech Corpus for Low Resource Indian Languages: Awadhi, Bhojpuri, Braj and Magahi

no code implementations26 Jun 2022 Ritesh Kumar, Siddharth Singh, Shyam Ratan, Mohit Raj, Sonal Sinha, Bornini Lahiri, Vivek Seshadri, Kalika Bali, Atul Kr. Ojha

In this paper we discuss an in-progress work on the development of a speech corpus for four low-resource Indo-Aryan languages -- Awadhi, Bhojpuri, Braj and Magahi using the field methods of linguistic data collection.

Automatic Speech Recognition speech-recognition

Predicting the Performance of Multilingual NLP Models

no code implementations17 Oct 2021 Anirudh Srinivasan, Sunayana Sitaram, Tanuja Ganu, Sandipan Dandapat, Kalika Bali, Monojit Choudhury

Recent advancements in NLP have given us models like mBERT and XLMR that can serve over 100 languages.

Multilingual NLP

Designing Language Technologies for Social Good: The Road not Taken

no code implementations14 Oct 2021 Namrata Mukhija, Monojit Choudhury, Kalika Bali

Development of speech and language technology for social good (LT4SG), especially those targeted at the welfare of marginalized communities and speakers of low-resource and under-served languages, has been a prominent theme of research within NLP, Speech, and the AI communities.

Understanding Script-Mixing: A Case Study of Hindi-English Bilingual Twitter Users

no code implementations LREC 2020 Abhishek Srivastava, Kalika Bali, Monojit Choudhury

Our analysis shows that both intra-sentential and inter-sentential script-mixing are present on Twitter and show different behavior in different contexts.

INMT: Interactive Neural Machine Translation Prediction

1 code implementation IJCNLP 2019 Sebastin Santy, D, S apat, ipan, Monojit Choudhury, Kalika Bali

In this paper, we demonstrate an Interactive Machine Translation interface, that assists human translators with on-the-fly hints and suggestions.

Machine Translation Translation

Phone Merging For Code-Switched Speech Recognition

no code implementations WS 2018 Sunit Sivasankaran, Brij Mohan Lal Srivastava, Sunayana Sitaram, Kalika Bali, Monojit Choudhury

Though the best performance gain of 1. 2{\%} WER was observed with manually merged phones, we show experimentally that the manual phone merge is not optimal.

Automatic Speech Recognition speech-recognition

Accommodation of Conversational Code-Choice

no code implementations WS 2018 Anshul Bawa, Monojit Choudhury, Kalika Bali

We find that the saliency or markedness of a language in context directly affects the degree of accommodation observed.

Information Retrieval

Language Modeling for Code-Mixing: The Role of Linguistic Theory based Synthetic Data

no code implementations ACL 2018 Adithya Pratapa, Gayatri Bhat, Monojit Choudhury, Sunayana Sitaram, D, S apat, ipan, Kalika Bali

Training language models for Code-mixed (CM) language is known to be a difficult problem because of lack of data compounded by the increased confusability due to the presence of more than one language.

Automatic Speech Recognition Language Identification +1

Estimating Code-Switching on Twitter with a Novel Generalized Word-Level Language Detection Technique

no code implementations ACL 2017 Shruti Rijhwani, Royal Sequiera, Monojit Choudhury, Kalika Bali, Ch Maddila, ra Shekhar

Word-level language detection is necessary for analyzing code-switched text, where multiple languages could be mixed within a sentence.

Grammatical Constraints on Intra-sentential Code-Switching: From Theories to Working Models

1 code implementation14 Dec 2016 Gayatri Bhat, Monojit Choudhury, Kalika Bali

We make one of the first attempts to build working models for intra-sentential code-switching based on the Equivalence-Constraint (Poplack 1980) and Matrix-Language (Myers-Scotton 1993) theories.

Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments

no code implementations LREC 2016 Rafiya Begum, Kalika Bali, Monojit Choudhury, Koustav Rudra, Niloy Ganguly

Code-Switching (CS) between two languages is extremely common in communities with societal multilingualism where speakers switch between two or more languages when interacting with each other.

Natural Language Processing

Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics

no code implementations LREC 2012 Kanika Gupta, Monojit Choudhury, Kalika Bali

This paper describes a method to mine Hindi-English transliteration pairs from online Hindi song lyrics.

Transliteration

Cannot find the paper you are looking for? You can Submit a new open access paper.