TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Keyword Spotting	Google Speech Commands	MHAtt-RNN	Google Speech Commands V1 12	97.2	# 10
Keyword Spotting	Google Speech Commands	MHAtt-RNN	Google Speech Commands V2 12	98	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/streaming-keyword-spotting-on-mobile-devices/keyword-spotting-on-google-speech-commands)](https://paperswithcode.com/sota/keyword-spotting-on-google-speech-commands?p=streaming-keyword-spotting-on-mobile-devices)`

Streaming keyword spotting on mobile devices

14 May 2020 · Oleg Rybakov, Natasha Kononenko, Niranjan Subrahmanya, Mirko Visontai, Stella Laurenzo ·

In this work we explore the latency and accuracy of keyword spotting (KWS) models in streaming and non-streaming modes on mobile phones. NN model conversion from non-streaming mode (model receives the whole input sequence and then returns the classification result) to streaming mode (model receives portion of the input sequence and classifies it incrementally) may require manual model rewriting. We address this by designing a Tensorflow/Keras based library which allows automatic conversion of non-streaming models to streaming ones with minimum effort. With this library we benchmark multiple KWS models in both streaming and non-streaming modes on mobile phones and demonstrate different tradeoffs between latency and accuracy. We also explore novel KWS models with multi-head attention which reduce the classification error over the state-of-art by 10% on Google speech commands data sets V2. The streaming library with all experiments is open-sourced.

PDF Abstract

Code

Add Remove Mark official

google-research/google-research official

32,806

qute012/Pytorch-MHAtt-RNN-KWS

Arizona-Voice/blossom

Tasks

Add Remove

Datasets

Speech Commands

Results from the Paper

Add Remove

Ranked #10 on Keyword Spotting on Google Speech Commands

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Keyword Spotting	Google Speech Commands	MHAtt-RNN	Google Speech Commands V1 12	97.2	# 10	Compare
Keyword Spotting	Google Speech Commands	MHAtt-RNN	Google Speech Commands V2 12	98	# 9	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Streaming keyword spotting on mobile devices

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove