Search Results for author: Thomas Niesler

Found 27 papers, 3 papers with code

A Hybrid CNN-BiLSTM Voice Activity Detector

1 code implementation • 5 Mar 2021 • Nicholas Wilkinson, Thomas Niesler

We find that significantly smaller models with near optimal parameters perform on par with larger models trained with optimal parameters.

Ranked #1 on Activity Detection on AVA-Speech (ROC-AUC metric)

Action Detection Activity Detection +1

Paper
Code

Fine-Tuned Self-Supervised Speech Representations for Language Diarization in Multilingual Code-Switched Speech

1 code implementation • 15 Dec 2023 • Geoffrey Frost, Emily Morris, Joshua Jansen van Vüren, Thomas Niesler

Annotating a multilingual code-switched corpus is a painstaking process requiring specialist linguistic expertise.

Paper
Code

Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring

no code implementations • 25 Jun 2018 • Raghav Menon, Herman Kamper, John Quinn, Thomas Niesler

While the resulting CNN keyword spotter cannot match the performance of the DTW-based system, it substantially outperforms a CNN classifier trained only on the keywords, improving the area under the ROC curve from 0. 54 to 0. 64.

Dynamic Time Warping Humanitarian +2

Paper
Add Code

Automatic Speech Recognition for Humanitarian Applications in Somali

no code implementations • 23 Jul 2018 • Raghav Menon, Astik Biswas, Armin Saeb, John Quinn, Thomas Niesler

We present our first efforts in building an automatic speech recognition system for Somali, an under-resourced language, using 1. 57 hrs of annotated speech for acoustic model training.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

ASR-free CNN-DTW keyword spotting using multilingual bottleneck features for almost zero-resource languages

no code implementations • 23 Jul 2018 • Raghav Menon, Herman Kamper, Emre Yilmaz, John Quinn, Thomas Niesler

We consider multilingual bottleneck features (BNFs) for nearly zero-resource keyword spotting.

Dynamic Time Warping Humanitarian +2

Paper
Add Code

Building a Unified Code-Switching ASR System for South African Languages

no code implementations • 28 Jul 2018 • Emre Yilmaz, Astik Biswas, Ewald van der Westhuizen, Febe De Wet, Thomas Niesler

We present our first efforts towards building a single multilingual automatic speech recognition (ASR) system that can process code-switching (CS) speech in five languages spoken within the same population.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Cluster Size Management in Multi-Stage Agglomerative Hierarchical Clustering of Acoustic Speech Segments

no code implementations • 30 Oct 2018 • Lerato Lerato, Thomas Niesler

Agglomerative hierarchical clustering (AHC) requires only the similarity between objects to be known.

Clustering Management

Paper
Add Code

Feature Trajectory Dynamic Time Warping for Clustering of Speech Segments

no code implementations • 30 Oct 2018 • Lerato Lerato, Thomas Niesler

Dynamic time warping (DTW) can be used to compute the similarity between two sequences of generally differing length.

Clustering Dynamic Time Warping

Paper
Add Code

A First South African Corpus of Multilingual Code-switched Soap Opera Speech

no code implementations • LREC 2018 • Ewald van der Westhuizen, Thomas Niesler

Language Modelling

Paper
Add Code

Semi-supervised acoustic model training for five-lingual code-switched ASR

no code implementations • 20 Jun 2019 • Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler

Furthermore, because English is common to all language pairs in our data, it dominates when training a unified language model, leading to improved English ASR performance at the expense of the other languages.

Acoustic Modelling Language Modelling

Paper
Add Code

Improved low-resource Somali speech recognition by semi-supervised acoustic and language model training

no code implementations • 6 Jul 2019 • Astik Biswas, Raghav Menon, Ewald van der Westhuizen, Thomas Niesler

The automatic transcriptions from the best performing pass were used for language model augmentation.

Acoustic Modelling Automatic Speech Recognition +5

Paper
Add Code

Feature exploration for almost zero-resource ASR-free keyword spotting using a multilingual bottleneck extractor and correspondence autoencoders

no code implementations • 14 Nov 2018 • Raghav Menon, Herman Kamper, Ewald van der Westhuizen, John Quinn, Thomas Niesler

We compare features for dynamic time warping (DTW) when used to bootstrap keyword spotting (KWS) in an almost zero-resource setting.

Dynamic Time Warping Humanitarian +1

Paper
Add Code

Semi-supervised Development of ASR Systems for Multilingual Code-switched Speech in Under-resourced Languages

no code implementations • LREC 2020 • Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler

This paper reports on the semi-supervised development of acoustic and language models for under-resourced, code-switched speech in five South African languages.

Paper
Add Code

Semi-supervised acoustic and language model training for English-isiZulu code-switched speech recognition

no code implementations • LREC 2020 • Astik Biswas, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler

We present an analysis of semi-supervised acoustic and language model training for English-isiZulu code-switched (CS) ASR using soap opera speech.

Language Modelling speech-recognition +1

Paper
Add Code

Semi-supervised Acoustic Modelling for Five-lingual Code-switched ASR using Automatically-segmented Soap Opera Speech

no code implementations • LREC 2020 • Nick Wilkinson, Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler

Automatic segmentation was applied in combination with automaticspeaker diarization.

Acoustic Modelling Action Detection +4

Paper
Add Code

Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages

1 code implementation • 31 Oct 2020 • Trideba Padhi, Astik Biswas, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler

In this work, we explore the benefits of using multilingual bottleneck features (mBNF) in acoustic modelling for the automatic speech recognition of code-switched (CS) speech in African languages.

Acoustic Modelling Automatic Speech Recognition +2

Paper
Code

COVID-19 Cough Classification using Machine Learning and Global Smartphone Recordings

no code implementations • 2 Dec 2020 • Madhurananda Pahar, Marisa Klopper, Robin Warren, Thomas Niesler

We present a machine learning based COVID-19 cough classifier which can discriminate COVID-19 positive coughs from both COVID-19 negative and healthy coughs recorded on a smartphone.

Audio Classification BIG-bench Machine Learning +1

Paper
Add Code

Deep Neural Network based Cough Detection using Bed-mounted Accelerometer Measurements

no code implementations • 9 Feb 2021 • Madhurananda Pahar, Igor Miranda, Andreas Diacon, Thomas Niesler

Since the need to gather audio is avoided and therefore privacy is inherently protected, and since the accelerometer is attached to the bed and not worn, this form of monitoring may represent a more convenient and readily accepted method of long-term patient cough monitoring.

Paper
Add Code

Automatic Cough Classification for Tuberculosis Screening in a Real-World Environment

no code implementations • 23 Mar 2021 • Madhurananda Pahar, Marisa Klopper, Byron Reeve, Grant Theron, Rob Warren, Thomas Niesler

Objective: The automatic discrimination between the coughing sounds produced by patients with tuberculosis (TB) and those produced by patients with other lung ailments.

feature selection General Classification +1

Paper
Add Code

COVID-19 Detection in Cough, Breath and Speech using Deep Transfer Learning and Bottleneck Features

no code implementations • 2 Apr 2021 • Madhurananda Pahar, Marisa Klopper, Robin Warren, Thomas Niesler

We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech.

Audio Classification BIG-bench Machine Learning +1

Paper
Add Code

Feature learning for efficient ASR-free keyword spotting in low-resource languages

no code implementations • 13 Aug 2021 • Ewald van der Westhuizen, Herman Kamper, Raghav Menon, John Quinn, Thomas Niesler

We show that, using these features, the CNN-DTW keyword spotter performs almost as well as the DTW keyword spotter while outperforming a baseline CNN trained only on the keyword templates.

Dynamic Time Warping Humanitarian +1

Paper
Add Code

Multilingual training set selection for ASR in under-resourced Malian languages

no code implementations • 13 Aug 2021 • Ewald van der Westhuizen, Trideba Padhi, Thomas Niesler

We find that, although maximising the training pool by including all six additional languages provides improved speech recognition in both target languages, substantially better performance can be achieved by a more judicious choice.

Humanitarian speech-recognition +1

Paper
Add Code

Automatic non-invasive Cough Detection based on Accelerometer and Audio Signals

no code implementations • 31 Aug 2021 • Madhurananda Pahar, Igor Miranda, Andreas Diacon, Thomas Niesler

We present an automatic non-invasive way of detecting cough events based on both accelerometer and audio signals.

Paper
Add Code

Wake-Cough: cough spotting and cougher identification for personalised long-term cough monitoring

no code implementations • 7 Oct 2021 • Madhurananda Pahar, Marisa Klopper, Byron Reeve, Rob Warren, Grant Theron, Andreas Diacon, Thomas Niesler

We present `wake-cough', an application of wake-word spotting to coughs using a Resnet50 and the identification of coughers using i-vectors, for the purpose of a long-term, personalised cough monitoring system.

Paper
Add Code

Accelerometer-based Bed Occupancy Detection for Automatic, Non-invasive Long-term Cough Monitoring

no code implementations • 8 Feb 2022 • Madhurananda Pahar, Igor Miranda, Andreas Diacon, Thomas Niesler

When integrated into a complete cough monitoring system, the daily cough rate of a patient undergoing TB treatment was determined over a period of 14 days.

Paper
Add Code

Automatic Tuberculosis and COVID-19 cough classification using deep learning

no code implementations • 11 May 2022 • Madhurananda Pahar, Marisa Klopper, Byron Reeve, Rob Warren, Grant Theron, Andreas Diacon, Thomas Niesler

This cough data include 1. 68 hours of TB coughs, 18. 54 minutes of COVID-19 coughs and 1. 69 hours of healthy coughs from 47 TB patients, 229 COVID-19 patients and 1498 healthy patients and were used to train and evaluate a CNN, LSTM and Resnet50.

Audio Classification Transfer Learning

Paper
Add Code

TB or not TB? Acoustic cough analysis for tuberculosis classification

no code implementations • 2 Sep 2022 • Geoffrey Frost, Grant Theron, Thomas Niesler

In this work, we explore recurrent neural network architectures for tuberculosis (TB) cough classification.

feature selection Style Transfer

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.