Search Results for author: Thomas Niesler

Found 27 papers, 3 papers with code

A Hybrid CNN-BiLSTM Voice Activity Detector

1 code implementation5 Mar 2021 Nicholas Wilkinson, Thomas Niesler

We find that significantly smaller models with near optimal parameters perform on par with larger models trained with optimal parameters.

 Ranked #1 on Activity Detection on AVA-Speech (ROC-AUC metric)

Action Detection Activity Detection +1

Fine-Tuned Self-Supervised Speech Representations for Language Diarization in Multilingual Code-Switched Speech

1 code implementation15 Dec 2023 Geoffrey Frost, Emily Morris, Joshua Jansen van Vüren, Thomas Niesler

Annotating a multilingual code-switched corpus is a painstaking process requiring specialist linguistic expertise.

Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring

no code implementations25 Jun 2018 Raghav Menon, Herman Kamper, John Quinn, Thomas Niesler

While the resulting CNN keyword spotter cannot match the performance of the DTW-based system, it substantially outperforms a CNN classifier trained only on the keywords, improving the area under the ROC curve from 0. 54 to 0. 64.

Dynamic Time Warping Humanitarian +2

Automatic Speech Recognition for Humanitarian Applications in Somali

no code implementations23 Jul 2018 Raghav Menon, Astik Biswas, Armin Saeb, John Quinn, Thomas Niesler

We present our first efforts in building an automatic speech recognition system for Somali, an under-resourced language, using 1. 57 hrs of annotated speech for acoustic model training.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Building a Unified Code-Switching ASR System for South African Languages

no code implementations28 Jul 2018 Emre Yilmaz, Astik Biswas, Ewald van der Westhuizen, Febe De Wet, Thomas Niesler

We present our first efforts towards building a single multilingual automatic speech recognition (ASR) system that can process code-switching (CS) speech in five languages spoken within the same population.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Feature Trajectory Dynamic Time Warping for Clustering of Speech Segments

no code implementations30 Oct 2018 Lerato Lerato, Thomas Niesler

Dynamic time warping (DTW) can be used to compute the similarity between two sequences of generally differing length.

Clustering Dynamic Time Warping

Semi-supervised acoustic model training for five-lingual code-switched ASR

no code implementations20 Jun 2019 Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler

Furthermore, because English is common to all language pairs in our data, it dominates when training a unified language model, leading to improved English ASR performance at the expense of the other languages.

Acoustic Modelling Language Modelling

Semi-supervised Development of ASR Systems for Multilingual Code-switched Speech in Under-resourced Languages

no code implementations LREC 2020 Astik Biswas, Emre Yilmaz, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler

This paper reports on the semi-supervised development of acoustic and language models for under-resourced, code-switched speech in five South African languages.

Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages

1 code implementation31 Oct 2020 Trideba Padhi, Astik Biswas, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler

In this work, we explore the benefits of using multilingual bottleneck features (mBNF) in acoustic modelling for the automatic speech recognition of code-switched (CS) speech in African languages.

Acoustic Modelling Automatic Speech Recognition +2

COVID-19 Cough Classification using Machine Learning and Global Smartphone Recordings

no code implementations2 Dec 2020 Madhurananda Pahar, Marisa Klopper, Robin Warren, Thomas Niesler

We present a machine learning based COVID-19 cough classifier which can discriminate COVID-19 positive coughs from both COVID-19 negative and healthy coughs recorded on a smartphone.

Audio Classification BIG-bench Machine Learning +1

Deep Neural Network based Cough Detection using Bed-mounted Accelerometer Measurements

no code implementations9 Feb 2021 Madhurananda Pahar, Igor Miranda, Andreas Diacon, Thomas Niesler

Since the need to gather audio is avoided and therefore privacy is inherently protected, and since the accelerometer is attached to the bed and not worn, this form of monitoring may represent a more convenient and readily accepted method of long-term patient cough monitoring.

Automatic Cough Classification for Tuberculosis Screening in a Real-World Environment

no code implementations23 Mar 2021 Madhurananda Pahar, Marisa Klopper, Byron Reeve, Grant Theron, Rob Warren, Thomas Niesler

Objective: The automatic discrimination between the coughing sounds produced by patients with tuberculosis (TB) and those produced by patients with other lung ailments.

feature selection General Classification +1

COVID-19 Detection in Cough, Breath and Speech using Deep Transfer Learning and Bottleneck Features

no code implementations2 Apr 2021 Madhurananda Pahar, Marisa Klopper, Robin Warren, Thomas Niesler

We present an experimental investigation into the effectiveness of transfer learning and bottleneck feature extraction in detecting COVID-19 from audio recordings of cough, breath and speech.

Audio Classification BIG-bench Machine Learning +1

Feature learning for efficient ASR-free keyword spotting in low-resource languages

no code implementations13 Aug 2021 Ewald van der Westhuizen, Herman Kamper, Raghav Menon, John Quinn, Thomas Niesler

We show that, using these features, the CNN-DTW keyword spotter performs almost as well as the DTW keyword spotter while outperforming a baseline CNN trained only on the keyword templates.

Dynamic Time Warping Humanitarian +1

Multilingual training set selection for ASR in under-resourced Malian languages

no code implementations13 Aug 2021 Ewald van der Westhuizen, Trideba Padhi, Thomas Niesler

We find that, although maximising the training pool by including all six additional languages provides improved speech recognition in both target languages, substantially better performance can be achieved by a more judicious choice.

Humanitarian speech-recognition +1

Automatic non-invasive Cough Detection based on Accelerometer and Audio Signals

no code implementations31 Aug 2021 Madhurananda Pahar, Igor Miranda, Andreas Diacon, Thomas Niesler

We present an automatic non-invasive way of detecting cough events based on both accelerometer and audio signals.

Wake-Cough: cough spotting and cougher identification for personalised long-term cough monitoring

no code implementations7 Oct 2021 Madhurananda Pahar, Marisa Klopper, Byron Reeve, Rob Warren, Grant Theron, Andreas Diacon, Thomas Niesler

We present `wake-cough', an application of wake-word spotting to coughs using a Resnet50 and the identification of coughers using i-vectors, for the purpose of a long-term, personalised cough monitoring system.

Accelerometer-based Bed Occupancy Detection for Automatic, Non-invasive Long-term Cough Monitoring

no code implementations8 Feb 2022 Madhurananda Pahar, Igor Miranda, Andreas Diacon, Thomas Niesler

When integrated into a complete cough monitoring system, the daily cough rate of a patient undergoing TB treatment was determined over a period of 14 days.

Automatic Tuberculosis and COVID-19 cough classification using deep learning

no code implementations11 May 2022 Madhurananda Pahar, Marisa Klopper, Byron Reeve, Rob Warren, Grant Theron, Andreas Diacon, Thomas Niesler

This cough data include 1. 68 hours of TB coughs, 18. 54 minutes of COVID-19 coughs and 1. 69 hours of healthy coughs from 47 TB patients, 229 COVID-19 patients and 1498 healthy patients and were used to train and evaluate a CNN, LSTM and Resnet50.

Audio Classification Transfer Learning

TB or not TB? Acoustic cough analysis for tuberculosis classification

no code implementations2 Sep 2022 Geoffrey Frost, Grant Theron, Thomas Niesler

In this work, we explore recurrent neural network architectures for tuberculosis (TB) cough classification.

feature selection Style Transfer

Cannot find the paper you are looking for? You can Submit a new open access paper.