Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

被引:1
作者
Peter, David [1 ]
Roth, Wolfgang [1 ]
Pernkopf, Franz [1 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, Graz, Austria
来源
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年
基金
奥地利科学基金会;
关键词
keyword spotting; neural architecture search; weight quantization;
D O I
10.1109/ICPR48806.2021.9413191
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces neural architecture search (NAS) for the automatic discovery of small models for keyword spotting (KWS) in limited resource environments. We employ a differentiable NAS approach to optimize the structure of convolutional neural networks (CNNs) to maximize the classification accuracy while minimizing the number of operations per inference. Using NAS only, we were able to obtain a highly efficient model with 95.4% accuracy on the Google speech commands dataset with 494.8 kB of memory usage and 19.6 million operations. Additionally, weight quantization is used to reduce the memory consumption even further. We show that weight quantization to low bit-widths (e.g. 1 bit) can be used without substantial loss in accuracy. By increasing the number of input features from 10 MFCC to 20 MFCC we were able to increase the accuracy to 963% at 340.1 kB of memory usage and 27.1 million operations.
引用
收藏
页码:9273 / 9279
页数:7
相关论文
共 32 条
[1]   Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting [J].
Arik, Sercan O. ;
Kliegl, Markus ;
Child, Rewon ;
Hestness, Joel ;
Gibiansky, Andrew ;
Fougner, Chris ;
Prenger, Ryan ;
Coates, Adam .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :1606-1610
[2]  
Bengio Y., 2013, CoRR abs/1308.3432
[3]  
Cai H., 2018, PR MACH LEARN RES
[4]  
Cai Han, 2019, INT C LEARN REPR ICL
[5]  
Chen G, 2014, CHIN CONTR CONF, P1087, DOI 10.1109/ChiCC.2014.6896779
[6]  
Hinton G., 2012, COURS VID LECT
[7]  
Hubara Itay, 2016, P ADV NEUR INF PROC, P4107
[8]  
Ioffe S, 2015, PR MACH LEARN RES, V37, P448
[9]  
Lai L, 2017, ARXIV PREPRINT ARXIV
[10]   Progressive Neural Architecture Search [J].
Liu, Chenxi ;
Zoph, Barret ;
Neumann, Maxim ;
Shlens, Jonathon ;
Hua, Wei ;
Li, Li-Jia ;
Li Fei-Fei ;
Yuille, Alan ;
Huang, Jonathan ;
Murphy, Kevin .
COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 :19-35