1 code implementation • EMNLP (WNUT) 2020 • Sharanya Chakravarthy, Anjana Umapathy, Alan W Black
The presence of large-scale corpora for Natural Language Inference (NLI) has spurred deep learning research in this area, though much of this research has focused solely on monolingual data.
no code implementations • LREC 2022 • Xinjian Li, Florian Metze, David R. Mortensen, Alan W Black, Shinji Watanabe
Identifying phone inventories is a crucial component in language documentation and the preservation of endangered languages.
no code implementations • EACL (DravidianLangTech) 2021 • Akshat Gupta, Sai Krishna Rallabandi, Alan W Black
Sentiment analysis in Code-Mixed languages has garnered a lot of attention in recent years.
1 code implementation • 16 Oct 2023 • Cheol Jun Cho, Abdelrahman Mohamed, Shang-Wen Li, Alan W Black, Gopala K. Anumanchipalli
Data-driven unit discovery in self-supervised learning (SSL) of speech has embarked on a new era of spoken language processing.
no code implementations • 16 Oct 2023 • Cheol Jun Cho, Abdelrahman Mohamed, Alan W Black, Gopala K. Anumanchipalli
Self-Supervised Learning (SSL) based models of speech have shown remarkable performance on a range of downstream tasks.
1 code implementation • 5 Jul 2023 • Peter Wu, Tingle Li, Yijing Lu, Yubin Zhang, Jiachen Lian, Alan W Black, Louis Goldstein, Shinji Watanabe, Gopala K. Anumanchipalli
Finally, through a series of ablations, we show that the proposed MRI representation is more comprehensive than EMA and identify the most suitable MRI feature subset for articulatory synthesis.
1 code implementation • 14 Feb 2023 • Peter Wu, Li-Wei Chen, Cheol Jun Cho, Shinji Watanabe, Louis Goldstein, Alan W Black, Gopala K. Anumanchipalli
To build speech processing methods that can handle speech as naturally as humans, researchers have explored multiple ways of building an invertible mapping from speech to an interpretable space.
no code implementations • 29 Oct 2022 • Jiachen Lian, Alan W Black, Yijing Lu, Louis Goldstein, Shinji Watanabe, Gopala K. Anumanchipalli
In this work, we propose a novel articulatory representation decomposition algorithm that takes the advantage of guided factor analysis to derive the articulatory-specific factors and factor scores.
1 code implementation • 27 Oct 2022 • Siddhant Arora, Siddharth Dalmia, Brian Yan, Florian Metze, Alan W Black, Shinji Watanabe
End-to-end spoken language understanding (SLU) systems are gaining popularity over cascaded approaches due to their simplicity and ability to avoid error propagation.
no code implementations • 27 Oct 2022 • Yisi Liu, Peter Wu, Alan W Black, Gopala K. Anumanchipalli
Estimation of fundamental frequency (F0) in voiced segments of speech signals, also known as pitch tracking, plays a crucial role in pitch synchronous speech analysis, speech synthesis, and speech manipulation.
no code implementations • 11 Oct 2022 • Brian Yan, Siddharth Dalmia, Yosuke Higuchi, Graham Neubig, Florian Metze, Alan W Black, Shinji Watanabe
Connectionist Temporal Classification (CTC) is a widely used approach for automatic speech recognition (ASR) that performs conditionally independent monotonic alignment.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • 13 Sep 2022 • Peter Wu, Shinji Watanabe, Louis Goldstein, Alan W Black, Gopala K. Anumanchipalli
In the articulatory synthesis task, speech is synthesized from input features containing information about the physical behavior of the human vocal tract.
1 code implementation • 6 Sep 2022 • Xinjian Li, Florian Metze, David R Mortensen, Alan W Black, Shinji Watanabe
We achieve 50% CER and 74% WER on the Wilderness dataset with Crubadan statistics only and improve them to 45% CER and 69% WER when using 10000 raw text utterances.
1 code implementation • 1 Jul 2022 • Perez Ogayo, Graham Neubig, Alan W Black
This paper focuses on speech synthesis for low-resourced African languages, from corpus creation to sharing and deploying the Text-to-Speech (TTS) systems.
no code implementations • 24 May 2022 • Shruti Palaskar, Akshita Bhagia, Yonatan Bisk, Florian Metze, Alan W Black, Ana Marasović
Combining the visual modality with pretrained language models has been surprisingly effective for simple descriptive tasks such as image captioning.
1 code implementation • 1 Apr 2022 • Jiachen Lian, Alan W Black, Louis Goldstein, Gopala Krishna Anumanchipalli
Most of the research on data-driven speech representation learning has focused on raw audios in an end-to-end manner, paying little attention to their internal phonological or gestural structure.
2 code implementations • 29 Nov 2021 • Siddhant Arora, Siddharth Dalmia, Pavel Denisov, Xuankai Chang, Yushi Ueda, Yifan Peng, Yuekai Zhang, Sujay Kumar, Karthik Ganesan, Brian Yan, Ngoc Thang Vu, Alan W Black, Shinji Watanabe
However, there are few open source toolkits that can be used to generate reproducible results on different Spoken Language Understanding (SLU) benchmarks.
1 code implementation • 2 Nov 2021 • Peter Wu, Jiatong Shi, Yifan Zhong, Shinji Watanabe, Alan W Black
We demonstrate the effectiveness of our approach in language family classification, speech recognition, and speech synthesis tasks.
no code implementations • Findings (EMNLP) 2021 • Parul Chopra, Sai Krishna Rallabandi, Alan W Black, Khyathi Raghavi Chandu
Code-switching (CS), a ubiquitous phenomenon due to the ease of communication it offers in multilingual communities still remains an understudied problem in language processing.
no code implementations • 31 Oct 2021 • Anurag Katakkar, Alan W Black
For the English language, these LMs treat words as atomic units, which presents inherent challenges to language modelling in the speech domain.
no code implementations • 18 Oct 2021 • Hemant Yadav, Akshat Gupta, Sai Krishna Rallabandi, Alan W Black, Rajiv Ratn Shah
We perform experiments across three different languages: English, Sinhala, and Tamil each with different data sizes to simulate high, medium, and low resource scenarios.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 12 Oct 2021 • Roshan Sharma, Shruti Palaskar, Alan W Black, Florian Metze
End-to-end modeling of speech summarization models is challenging due to memory and compute constraints arising from long input audio sequences.
no code implementations • ACL 2021 • Abhilasha Ravichander, Alan W Black, Thomas Norton, Shomir Wilson, Norman Sadeh
Privacy plays a crucial role in preserving democratic ideals and personal autonomy.
no code implementations • 29 Jun 2021 • Siddhant Arora, Alissa Ostapenko, Vijay Viswanathan, Siddharth Dalmia, Florian Metze, Shinji Watanabe, Alan W Black
Our splits identify performance gaps up to 10% between end-to-end systems that were within 1% of each other on the original test sets.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
1 code implementation • NAACL (CALCS) 2021 • Sai Muralidhar Jayanthi, Kavya Nerella, Khyathi Raghavi Chandu, Alan W Black
The NLP community has witnessed steep progress in a variety of tasks across the realms of monolingual and multilingual language processing recently.
no code implementations • 4 Jun 2021 • Khyathi Raghavi Chandu, Yonatan Bisk, Alan W Black
And finally, (3) How to advance our current definition to bridge the gap with Cognitive Science?
1 code implementation • NAACL 2021 • Shrimai Prabhumoye, Kazuma Hashimoto, Yingbo Zhou, Alan W Black, Ruslan Salakhutdinov
Document grounded generation is the task of using the information provided in a document to improve text generation.
no code implementations • 3 Apr 2021 • Akshat Gupta, Olivia Deng, Akruti Kushwaha, Saloni Mittal, William Zeng, Sai Krishna Rallabandi, Alan W Black
We build a word-free natural language understanding module that does intent recognition and slot identification from these phonetic transcription.
no code implementations • NAACL (CALCS) 2021 • Akshat Gupta, Sargam Menghani, Sai Krishna Rallabandi, Alan W Black
We propose a general framework called Unsupervised Self-Training and show its applications for the specific use case of sentiment analysis of code-switched data.
2 code implementations • EACL 2021 • Abhilasha Ravichander, Siddharth Dalmia, Maria Ryskina, Florian Metze, Eduard Hovy, Alan W Black
When Question-Answering (QA) systems are deployed in the real world, users query them through a variety of interfaces, such as speaking to voice assistants, typing questions into a search engine, or even translating questions to languages supported by the QA system.
1 code implementation • 1 Dec 2020 • Peter Wu, Yifan Zhong, Alan W Black
Existing multilingual speech NLP works focus on a relatively small subset of languages, and thus current linguistic understanding of languages predominantly stems from classical approaches.
no code implementations • 7 Nov 2020 • Akshat Gupta, Xinjian Li, Sai Krishna Rallabandi, Alan W Black
With the aim of aiding development of spoken dialog systems in low resourced languages, we propose a novel acoustics based intent recognition system that uses discovered phonetic units for intent classification.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
1 code implementation • CONLL 2020 • Tanmay Parekh, Emily Ahn, Yulia Tsvetkov, Alan W Black
Code-switching is a ubiquitous phenomenon in multilingual communities.
no code implementations • 20 Oct 2020 • Yiyuan Li, Antonios Anastasopoulos, Alan W Black
In this work, we design a knowledge-base and prediction model embedded system for spelling correction in low-resource languages.
no code implementations • 14 Oct 2020 • Khyathi Raghavi Chandu, Alan W Black
Neural text generation metamorphosed into several critical natural language applications ranging from text completion to free form narrative generation.
no code implementations • NAACL 2021 • Shrimai Prabhumoye, Brendon Boldt, Ruslan Salakhutdinov, Alan W Black
Recent work in natural language processing (NLP) has focused on ethical challenges such as understanding and mitigating bias in data and algorithms; identifying objectionable content like hate speech, stereotypes and offensive language; and building frameworks for better system design and data handling practices.
no code implementations • 9 Oct 2020 • Akshat Gupta, Sai Krishna Rallabandi, Alan W Black
Tremendous progress in speech and language processing has brought language technologies closer to daily human life.
no code implementations • 27 Sep 2018 • Xinjian Li, Siddharth Dalmia, David R. Mortensen, Florian Metze, Alan W Black
Our model is able to recognize unseen phonemes in the target language, if only a small text corpus is available.