Search Results for author: Hanan Aldarmaki

Found 19 papers, 7 papers with code

Spoken Word2Vec: A Perspective And Some Techniques

no code implementations • 15 Nov 2023 • Mohammad Amaan Sayeed, Hanan Aldarmaki

In addition, previous works relied on the simplifying assumptions of perfect word segmentation and clustering by word type.

Clustering Word Embeddings

Paper
Add Code

Automatic Restoration of Diacritics for Speech Data Sets

1 code implementation • 15 Nov 2023 • Sara Shatnawi, Sawsan Alqahtani, Hanan Aldarmaki

Automatic text-based diacritic restoration models generally have high diacritic error rates when applied to speech transcripts as a result of domain and style shifts in spoken language.

Paper
Code

ArTST: Arabic Text and Speech Transformer

1 code implementation • 25 Oct 2023 • Hawau Olamide Toyin, Amirbek Djanibekov, Ajinkya Kulkarni, Hanan Aldarmaki

We present ArTST, a pre-trained Arabic text and speech transformer for supporting open-source speech technologies for the Arabic language.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Paper
Code

Yet Another Model for Arabic Dialect Identification

no code implementations • 20 Oct 2023 • Ajinkya Kulkarni, Hanan Aldarmaki

We explore two architectural variations: ResNet and ECAPA-TDNN, coupled with two types of acoustic features: MFCCs and features exratected from the pre-trained self-supervised model UniSpeech-SAT Large, as well as a fusion of all four variants.

Dialect Identification

Paper
Add Code

Adapting the adapters for code-switching in multilingual ASR

1 code implementation • 11 Oct 2023 • Atharva Kulkarni, Ajinkya Kulkarni, Miguel Couceiro, Hanan Aldarmaki

Recently, large pre-trained multilingual speech models have shown potential in scaling Automatic Speech Recognition (ASR) to many low-resource languages.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Code

Handling Realistic Label Noise in BERT Text Classification

no code implementations • 23 May 2023 • Maha Tufail Agro, Hanan Aldarmaki

Labels noise refers to errors in training labels caused by cheap data annotation methods, such as web scraping or crowd-sourcing, which can be detrimental to the performance of supervised classifiers.

text-classification Text Classification

Paper
Add Code

ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus

no code implementations • 28 Feb 2023 • Ajinkya Kulkarni, Atharva Kulkarni, Sara Abedalmonem Mohammad Shatnawi, Hanan Aldarmaki

In a move towards filling this gap in resources, we present a speech corpus for Classical Arabic Text-to-Speech (ClArTTS) to support the development of end-to-end TTS systems for Arabic.

Speech Synthesis

Paper
Add Code

Diacritic Recognition Performance in Arabic ASR

no code implementations • 27 Feb 2023 • Hanan Aldarmaki, Ahmad Ghannam

We present an analysis of diacritic recognition performance in Arabic Automatic Speech Recognition (ASR) systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Supervised Acoustic Embeddings And Their Transferability Across Languages

1 code implementation • 3 Jan 2023 • Sreepratha Ram, Hanan Aldarmaki

In speech recognition, it is essential to model the phonetic content of the input signal while discarding irrelevant factors such as speaker variations and noise, which is challenging in low-resource settings.

speech-recognition Speech Recognition +2

Paper
Code

Unsupervised Automatic Speech Recognition: A Review

no code implementations • 9 Jun 2021 • Hanan Aldarmaki, Asad Ullah, Nazar Zaki

Automatic Speech Recognition (ASR) systems can be trained to achieve remarkable performance given large amounts of manually transcribed speech, but large labeled data sets can be difficult or expensive to acquire for all languages of interest.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Homograph Disambiguation Through Selective Diacritic Restoration

no code implementations • WS 2019 • Sawsan Alqahtani, Hanan Aldarmaki, Mona Diab

Diacritic restoration could theoretically help disambiguate these words, but in practice, the increase in overall sparsity leads to performance degradation in NLP applications.

Machine Translation Part-Of-Speech Tagging +2

Paper
Add Code

Efficient Sentence Embedding using Discrete Cosine Transform

1 code implementation • IJCNLP 2019 • Nada Almarwani, Hanan Aldarmaki, Mona Diab

Vector averaging remains one of the most popular sentence embedding methods in spite of its obvious disregard for syntactic structure.

Classification General Classification +3

Paper
Code

Scalable Cross-Lingual Transfer of Neural Sentence Embeddings

no code implementations • SEMEVAL 2019 • Hanan Aldarmaki, Mona Diab

We develop and investigate several cross-lingual alignment approaches for neural sentence embedding models, such as the supervised inference classifier, InferSent, and sequential encoder-decoder models.

Cross-Lingual Transfer Sentence +3

Paper
Add Code

Context-Aware Cross-Lingual Mapping

1 code implementation • NAACL 2019 • Hanan Aldarmaki, Mona Diab

Cross-lingual word vectors are typically obtained by fitting an orthogonal matrix that maps the entries of a bilingual dictionary from a source to a target vector space.

Retrieval Sentence +4

Paper
Code

Evaluation of Unsupervised Compositional Representations

1 code implementation • COLING 2018 • Hanan Aldarmaki, Mona Diab

We evaluated various compositional models, from bag-of-words representations to compositional RNN-based models, on several extrinsic supervised and unsupervised evaluation benchmarks.

General Classification

Paper
Code

Unsupervised Word Mapping Using Structural Similarities in Monolingual Embeddings

no code implementations • TACL 2018 • Hanan Aldarmaki, Mahesh Mohan, Mona Diab

We show empirically that the performance of bilingual correspondents learned using our proposed unsupervised method is comparable to that of using supervised bilingual correspondents from a seed dictionary.

Word Embeddings