Keyword Spotting

95 papers with code • 10 benchmarks • 8 datasets

In speech processing, keyword spotting deals with the identification of keywords in utterances.

( Image credit: Simon Grest )

Libraries

Use these libraries to find Keyword Spotting models and implementations

Latest papers with no code

TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration Transducer

no code yet • 20 Mar 2024

Designing an efficient keyword spotting (KWS) system that delivers exceptional performance on resource-constrained edge devices has long been a subject of significant attention.

More than words: Advancements and challenges in speech recognition for singing

no code yet • 14 Mar 2024

This paper addresses the challenges and advancements in speech recognition for singing, a domain distinctly different from standard speech recognition.

Boosting keyword spotting through on-device learnable user speech characteristics

no code yet • 12 Mar 2024

Keyword spotting systems for always-on TinyML-constrained applications require on-site tuning to boost the accuracy of offline trained classifiers when deployed in unseen inference conditions.

On-Device Domain Learning for Keyword Spotting on Low-Power Extreme Edge Embedded Systems

no code yet • 12 Mar 2024

Keyword spotting accuracy degrades when neural networks are exposed to noisy environments.

A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement

no code yet • 3 Mar 2024

Self-supervised learned models have been found to be very effective for certain speech tasks such as automatic speech recognition, speaker identification, keyword spotting and others.

Multilingual acoustic word embeddings for zero-resource languages

no code yet • 19 Jan 2024

This research addresses the challenge of developing speech applications for zero-resource languages that lack labelled data.

Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech

no code yet • 12 Jan 2024

Furthermore, experiments on the continuous speech dataset LibriSpeech demonstrate that, by incorporating audio discrimination, CLAD achieves significant performance gain over CL without audio discrimination.

Maximum-Entropy Adversarial Audio Augmentation for Keyword Spotting

no code yet • 12 Jan 2024

Data augmentation is a key tool for improving the performance of deep networks, particularly when there is limited labeled data.

U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias

no code yet • 15 Dec 2023

Open-vocabulary keyword spotting (KWS), which allows users to customize keywords, has attracted increasingly more interest.

Keyword spotting -- Detecting commands in speech using deep learning

no code yet • 9 Dec 2023

Speech recognition has become an important task in the development of machine learning and artificial intelligence.