Search Results for author: Daniel Whitenack

Found 6 papers, 4 papers with code

Phone-ing it in: Towards Flexible Multi-Modal Language Model Training by Phonetic Representations of Data

1 code implementation ACL 2022 Colin Leong, Daniel Whitenack

However, many advances in language model pre-training are focused on text, a fact that only increases systematic inequalities in the performance of NLP tasks across the world’s languages.

Language Modelling named-entity-recognition +2

Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks

no code implementations26 Oct 2022 Colin Leong, Joshua Nemecek, Jacob Mansdorfer, Anna Filighera, Abraham Owodunni, Daniel Whitenack

We present Bloom Library, a linguistically diverse set of multimodal and multilingual datasets for language modeling, image captioning, visual storytelling, and speech synthesis/recognition.

Image Captioning Language Modelling +2

Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification

no code implementations4 Aug 2021 Sangeeta Ghangam, Daniel Whitenack, Joshua Nemecek

Running automatic speech recognition (ASR) on edge devices is non-trivial due to resource constraints, especially in scenarios that require supporting multiple languages.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.