Search Results for author: Alan W Black

Found 40 papers, 16 papers with code

Detecting Entailment in Code-Mixed Hindi-English Conversations

1 code implementation • EMNLP (WNUT) 2020 • Sharanya Chakravarthy, Anjana Umapathy, Alan W Black

The presence of large-scale corpora for Natural Language Inference (NLI) has spurred deep learning research in this area, though much of this research has focused solely on monolingual data.

Data Augmentation Language Modelling +3

Paper
Code

Grounding ‘Grounding’ in NLP

no code implementations • Findings (ACL) 2021 • Khyathi Raghavi Chandu, Yonatan Bisk, Alan W Black

Paper
Add Code

What Code-Switching Strategies are Effective in Dialog Systems?

no code implementations • SCiL 2020 • Emily Ahn, Cecilia Jimenez, Yulia Tsvetkov, Alan W Black

Paper
Add Code

Phone Inventories and Recognition for Every Language

no code implementations • LREC 2022 • Xinjian Li, Florian Metze, David R. Mortensen, Alan W Black, Shinji Watanabe

Identifying phone inventories is a crucial component in language documentation and the preservation of endangered languages.

Paper
Add Code

Task-Specific Pre-Training and Cross Lingual Transfer for Sentiment Analysis in Dravidian Code-Switched Languages

no code implementations • EACL (DravidianLangTech) 2021 • Akshat Gupta, Sai Krishna Rallabandi, Alan W Black

Sentiment analysis in Code-Mixed languages has garnered a lot of attention in recent years.

Cross-Lingual Transfer Sentiment Analysis +1

Paper
Add Code

SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in HuBERT

1 code implementation • 16 Oct 2023 • Cheol Jun Cho, Abdelrahman Mohamed, Shang-Wen Li, Alan W Black, Gopala K. Anumanchipalli

Data-driven unit discovery in self-supervised learning (SSL) of speech has embarked on a new era of spoken language processing.

Language Modelling Self-Supervised Learning +1

Paper
Code

Self-Supervised Models of Speech Infer Universal Articulatory Kinematics

no code implementations • 16 Oct 2023 • Cheol Jun Cho, Abdelrahman Mohamed, Alan W Black, Gopala K. Anumanchipalli

Self-Supervised Learning (SSL) based models of speech have shown remarkable performance on a range of downstream tasks.

Self-Supervised Learning

Paper
Add Code

Deep Speech Synthesis from MRI-Based Articulatory Representations

1 code implementation • 5 Jul 2023 • Peter Wu, Tingle Li, Yijing Lu, Yubin Zhang, Jiachen Lian, Alan W Black, Louis Goldstein, Shinji Watanabe, Gopala K. Anumanchipalli

Finally, through a series of ablations, we show that the proposed MRI representation is more comprehensive than EMA and identify the most suitable MRI feature subset for articulatory synthesis.

Computational Efficiency Denoising +1

Paper
Code

Speaker-Independent Acoustic-to-Articulatory Speech Inversion

1 code implementation • 14 Feb 2023 • Peter Wu, Li-Wei Chen, Cheol Jun Cho, Shinji Watanabe, Louis Goldstein, Alan W Black, Gopala K. Anumanchipalli

To build speech processing methods that can handle speech as naturally as humans, researchers have explored multiple ways of building an invertible mapping from speech to an interpretable space.

Resynthesis

Paper
Code

Articulatory Representation Learning Via Joint Factor Analysis and Neural Matrix Factorization

no code implementations • 29 Oct 2022 • Jiachen Lian, Alan W Black, Yijing Lu, Louis Goldstein, Shinji Watanabe, Gopala K. Anumanchipalli

In this work, we propose a novel articulatory representation decomposition algorithm that takes the advantage of guided factor analysis to derive the articulatory-specific factors and factor scores.

Representation Learning

Paper
Add Code

Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models

1 code implementation • 27 Oct 2022 • Siddhant Arora, Siddharth Dalmia, Brian Yan, Florian Metze, Alan W Black, Shinji Watanabe

End-to-end spoken language understanding (SLU) systems are gaining popularity over cascaded approaches due to their simplicity and ability to avoid error propagation.

named-entity-recognition Named Entity Recognition +2

7,871

Paper
Code

A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution

no code implementations • 27 Oct 2022 • Yisi Liu, Peter Wu, Alan W Black, Gopala K. Anumanchipalli

Estimation of fundamental frequency (F0) in voiced segments of speech signals, also known as pitch tracking, plays a crucial role in pitch synchronous speech analysis, speech synthesis, and speech manipulation.

Speech Synthesis

Paper
Add Code

CTC Alignments Improve Autoregressive Translation

no code implementations • 11 Oct 2022 • Brian Yan, Siddharth Dalmia, Yosuke Higuchi, Graham Neubig, Florian Metze, Alan W Black, Shinji Watanabe

Connectionist Temporal Classification (CTC) is a widely used approach for automatic speech recognition (ASR) that performs conditionally independent monotonic alignment.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Deep Speech Synthesis from Articulatory Representations

1 code implementation • 13 Sep 2022 • Peter Wu, Shinji Watanabe, Louis Goldstein, Alan W Black, Gopala K. Anumanchipalli

In the articulatory synthesis task, speech is synthesized from input features containing information about the physical behavior of the human vocal tract.

Speech Synthesis

Paper
Code

ASR2K: Speech Recognition for Around 2000 Languages without Audio

1 code implementation • 6 Sep 2022 • Xinjian Li, Florian Metze, David R Mortensen, Alan W Black, Shinji Watanabe

We achieve 50% CER and 74% WER on the Wilderness dataset with Crubadan statistics only and improve them to 45% CER and 69% WER when using 10000 raw text utterances.

Language Modelling Speech Recognition

Paper
Code

Building African Voices

1 code implementation • 1 Jul 2022 • Perez Ogayo, Graham Neubig, Alan W Black

This paper focuses on speech synthesis for low-resourced African languages, from corpus creation to sharing and deploying the Text-to-Speech (TTS) systems.

Speech Synthesis

Paper
Code

On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization

no code implementations • 24 May 2022 • Shruti Palaskar, Akshita Bhagia, Yonatan Bisk, Florian Metze, Alan W Black, Ana Marasović

Combining the visual modality with pretrained language models has been surprisingly effective for simple descriptive tasks such as image captioning.

Descriptive Image Captioning +5

Paper
Add Code

Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition

1 code implementation • 1 Apr 2022 • Jiachen Lian, Alan W Black, Louis Goldstein, Gopala Krishna Anumanchipalli

Most of the research on data-driven speech representation learning has focused on raw audios in an end-to-end manner, paying little attention to their internal phonological or gestural structure.

Representation Learning

Paper
Code

ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet

2 code implementations • 29 Nov 2021 • Siddhant Arora, Siddharth Dalmia, Pavel Denisov, Xuankai Chang, Yushi Ueda, Yifan Peng, Yuekai Zhang, Sujay Kumar, Karthik Ganesan, Brian Yan, Ngoc Thang Vu, Alan W Black, Shinji Watanabe

However, there are few open source toolkits that can be used to generate reproducible results on different Spoken Language Understanding (SLU) benchmarks.

Spoken Language Understanding

7,871

Paper
Code

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

1 code implementation • 2 Nov 2021 • Peter Wu, Jiatong Shi, Yifan Zhong, Shinji Watanabe, Alan W Black

We demonstrate the effectiveness of our approach in language family classification, speech recognition, and speech synthesis tasks.

Cross-Lingual Transfer speech-recognition +2

Paper
Code

Switch Point biased Self-Training: Re-purposing Pretrained Models for Code-Switching

no code implementations • Findings (EMNLP) 2021 • Parul Chopra, Sai Krishna Rallabandi, Alan W Black, Khyathi Raghavi Chandu

Code-switching (CS), a ubiquitous phenomenon due to the ease of communication it offers in multilingual communities still remains an understudied problem in language processing.

NER POS +1

Paper
Add Code

Towards Language Modelling in the Speech Domain Using Sub-word Linguistic Units

no code implementations • 31 Oct 2021 • Anurag Katakkar, Alan W Black

For the English language, these LMs treat words as atomic units, which presents inherent challenges to language modelling in the speech domain.

Language Modelling Text Generation

Paper
Add Code

Intent Classification Using Pre-trained Language Agnostic Embeddings For Low Resource Languages

no code implementations • 18 Oct 2021 • Hemant Yadav, Akshat Gupta, Sai Krishna Rallabandi, Alan W Black, Rajiv Ratn Shah

We perform experiments across three different languages: English, Sinhala, and Tamil each with different data sizes to simulate high, medium, and low resource scenarios.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Add Code

Speech Summarization using Restricted Self-Attention

no code implementations • 12 Oct 2021 • Roshan Sharma, Shruti Palaskar, Alan W Black, Florian Metze

End-to-end modeling of speech summarization models is challenging due to memory and compute constraints arising from long input audio sequences.

Document Summarization speech-recognition +2

Paper
Add Code

Breaking Down Walls of Text: How Can NLP Benefit Consumer Privacy?

no code implementations • ACL 2021 • Abhilasha Ravichander, Alan W Black, Thomas Norton, Shomir Wilson, Norman Sadeh

Privacy plays a crucial role in preserving democratic ideals and personal autonomy.

Paper
Add Code

Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding

no code implementations • 29 Jun 2021 • Siddhant Arora, Alissa Ostapenko, Vijay Viswanathan, Siddharth Dalmia, Florian Metze, Shinji Watanabe, Alan W Black

Our splits identify performance gaps up to 10% between end-to-end systems that were within 1% of each other on the original test sets.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Mixing

1 code implementation • NAACL (CALCS) 2021 • Sai Muralidhar Jayanthi, Kavya Nerella, Khyathi Raghavi Chandu, Alan W Black

The NLP community has witnessed steep progress in a variety of tasks across the realms of monolingual and multilingual language processing recently.

Paper
Code

Grounding 'Grounding' in NLP

no code implementations • 4 Jun 2021 • Khyathi Raghavi Chandu, Yonatan Bisk, Alan W Black

And finally, (3) How to advance our current definition to bridge the gap with Cognitive Science?

Paper
Add Code

Focused Attention Improves Document-Grounded Generation

1 code implementation • NAACL 2021 • Shrimai Prabhumoye, Kazuma Hashimoto, Yingbo Zhou, Alan W Black, Ruslan Salakhutdinov

Document grounded generation is the task of using the information provided in a document to improve text generation.

Response Generation Text Generation

Paper
Code

Intent Recognition and Unsupervised Slot Identification for Low Resourced Spoken Dialog Systems

no code implementations • 3 Apr 2021 • Akshat Gupta, Olivia Deng, Akruti Kushwaha, Saloni Mittal, William Zeng, Sai Krishna Rallabandi, Alan W Black

We build a word-free natural language understanding module that does intent recognition and slot identification from these phonetic transcription.

Data Augmentation General Classification +5

Paper
Add Code

Unsupervised Self-Training for Sentiment Analysis of Code-Switched Data

no code implementations • NAACL (CALCS) 2021 • Akshat Gupta, Sargam Menghani, Sai Krishna Rallabandi, Alan W Black

We propose a general framework called Unsupervised Self-Training and show its applications for the specific use case of sentiment analysis of code-switched data.

Sentiment Analysis

Paper
Add Code

NoiseQA: Challenge Set Evaluation for User-Centric Question Answering

2 code implementations • EACL 2021 • Abhilasha Ravichander, Siddharth Dalmia, Maria Ryskina, Florian Metze, Eduard Hovy, Alan W Black

When Question-Answering (QA) systems are deployed in the real world, users query them through a variety of interfaces, such as speaking to voice assistants, typing questions into a search engine, or even translating questions to languages supported by the QA system.

Question Answering

Paper
Code

Automatically Identifying Language Family from Acoustic Examples in Low Resource Scenarios

1 code implementation • 1 Dec 2020 • Peter Wu, Yifan Zhong, Alan W Black

Existing multilingual speech NLP works focus on a relatively small subset of languages, and thus current linguistic understanding of languages predominantly stems from classical approaches.

Data Augmentation

Paper
Code

Acoustics Based Intent Recognition Using Discovered Phonetic Units for Low Resource Languages

no code implementations • 7 Nov 2020 • Akshat Gupta, Xinjian Li, Sai Krishna Rallabandi, Alan W Black

With the aim of aiding development of spoken dialog systems in low resourced languages, we propose a novel acoustics based intent recognition system that uses discovered phonetic units for intent classification.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

Understanding Linguistic Accommodation in Code-Switched Human-Machine Dialogues

1 code implementation • CONLL 2020 • Tanmay Parekh, Emily Ahn, Yulia Tsvetkov, Alan W Black

Code-switching is a ubiquitous phenomenon in multilingual communities.

Paper
Code

Comparison of Interactive Knowledge Base Spelling Correction Models for Low-Resource Languages

no code implementations • 20 Oct 2020 • Yiyuan Li, Antonios Anastasopoulos, Alan W Black

In this work, we design a knowledge-base and prediction model embedded system for spelling correction in low-resource languages.

Spelling Correction

Paper
Add Code

Positioning yourself in the maze of Neural Text Generation: A Task-Agnostic Survey

no code implementations • 14 Oct 2020 • Khyathi Raghavi Chandu, Alan W Black

Neural text generation metamorphosed into several critical natural language applications ranging from text completion to free form narrative generation.

Image Captioning Machine Translation +3

Paper
Add Code

Case Study: Deontological Ethics in NLP

no code implementations • NAACL 2021 • Shrimai Prabhumoye, Brendon Boldt, Ruslan Salakhutdinov, Alan W Black

Recent work in natural language processing (NLP) has focused on ethical challenges such as understanding and mitigating bias in data and algorithms; identifying objectionable content like hate speech, stereotypes and offensive language; and building frameworks for better system design and data handling practices.

Ethics

Paper
Add Code

Mere account mein kitna balance hai? -- On building voice enabled Banking Services for Multilingual Communities

no code implementations • 9 Oct 2020 • Akshat Gupta, Sai Krishna Rallabandi, Alan W Black

Tremendous progress in speech and language processing has brought language technologies closer to daily human life.

Intent Recognition

Paper
Add Code

Zero-shot Learning for Speech Recognition with Universal Phonetic Model

no code implementations • 27 Sep 2018 • Xinjian Li, Siddharth Dalmia, David R. Mortensen, Florian Metze, Alan W Black

Our model is able to recognize unseen phonemes in the target language, if only a small text corpus is available.

speech-recognition Speech Recognition +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.