Neural Architecture Search For Keyword Spotting

1 Sep 2020  ·  Tong Mo, Yakun Yu, Mohammad Salameh, Di Niu, Shangling Jui ·

Deep neural networks have recently become a popular solution to keyword spotting systems, which enable the control of smart devices via voice. In this paper, we apply neural architecture search to search for convolutional neural network models that can help boost the performance of keyword spotting based on features extracted from acoustic signals while maintaining an acceptable memory footprint. Specifically, we use differentiable architecture search techniques to search for operators and their connections in a predefined cell search space. The found cells are then scaled up in both depth and width to achieve competitive performance. We evaluated the proposed method on Google's Speech Commands Dataset and achieved a state-of-the-art accuracy of over 97% on the setting of 12-class utterance classification commonly reported in the literature.

PDF Abstract

Datasets


Results from the Paper


 Ranked #1 on Keyword Spotting on Google Speech Commands (Google Speech Commands V1 6 metric)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Keyword Spotting Google Speech Commands NAS2 Google Speech Commands V1 6 97.22 # 1
Keyword Spotting Google Speech Commands NAS1 Google Speech Commands V1 12 97.06 # 11

Methods


No methods listed for this paper. Add relevant methods here