TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Keyword Spotting	Google Speech Commands	Embedding + Head	Google Speech Commands V2 12	97.7	# 10
Keyword Spotting	Google Speech Commands	Head without Embedding	Google Speech Commands V2 12	97.4	# 12

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/training-keyword-spotters-with-limited-and/keyword-spotting-on-google-speech-commands)](https://paperswithcode.com/sota/keyword-spotting-on-google-speech-commands?p=training-keyword-spotters-with-limited-and)`

Training Keyword Spotters with Limited and Synthesized Speech Data

31 Jan 2020 · James Lin, Kevin Kilgour, Dominik Roblek, Matthew Sharifi ·

With the rise of low power speech-enabled devices, there is a growing demand to quickly produce models for recognizing arbitrary sets of keywords. As with many machine learning tasks, one of the most challenging parts in the model creation process is obtaining a sufficient amount of training data. In this paper, we explore the effectiveness of synthesized speech data in training small, spoken term detection models of around 400k parameters. Instead of training such models directly on the audio or low level features such as MFCCs, we use a pre-trained speech embedding model trained to extract useful features for keyword spotting models. Using this speech embedding, we show that a model which detects 10 keywords when trained on only synthetic speech is equivalent to a model trained on over 500 real examples. We also show that a model without our speech embeddings would need to be trained on over 4000 real examples to reach the same accuracy.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Keyword Spotting

Datasets

Speech Commands

Results from the Paper

Edit

Ranked #10 on Keyword Spotting on Google Speech Commands (Google Speech Commands V2 12 metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Keyword Spotting	Google Speech Commands	Embedding + Head	Google Speech Commands V2 12	97.7	# 10		Compare
Keyword Spotting	Google Speech Commands	Head without Embedding	Google Speech Commands V2 12	97.4	# 12		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Training Keyword Spotters with Limited and Synthesized Speech Data

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove