Neural Architecture Search For Keyword Spotting

被引:16
|
作者
Mo, Tong [1 ]
Yu, Yakun [1 ]
Salameh, Mohammad [2 ]
Niu, Di [1 ]
Jui, Shangling [2 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Huawei Technol, Shenzhen, Peoples R China
来源
INTERSPEECH 2020 | 2020年
关键词
Keyword Spotting; Neural Architecture Search;
D O I
10.21437/Interspeech.2020-3132
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Deep neural networks have recently become a popular solution to keyword spotting systems, which enable the control of smart devices via voice. In this paper, we apply neural architecture search to search for convolutional neural network models that can help boost the performance of keyword spotting based on features extracted from acoustic signals while maintaining an acceptable memory footprint. Specifically, we use differentiable architecture search techniques to search for operators and their connections in a predefined cell search space. The found cells are then scaled up in both depth and width to achieve competitive performance. We evaluated the proposed method on Google's Speech Commands Dataset and achieved a state-of-the-art accuracy of over 97% on the setting of 12-class utterance classification commonly reported in the literature.
引用
收藏
页码:1982 / 1986
页数:5
相关论文
共 50 条
  • [21] Efficient keyword spotting using time delay neural networks
    Myer, Samuel
    Tomar, Vikrant Singh
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1264 - 1268
  • [22] BIFOCAL NEURAL ASR: EXPLOITING KEYWORD SPOTTING FOR INFERENCE OPTIMIZATION
    Macoskey, Jon
    Strimel, Grant P.
    Rastrow, Ariya
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5999 - 6003
  • [23] A keyword spotting method
    Guo, R
    Zhu, XY
    PROCEEDINGS OF THE 4TH ASIA-PACIFIC CONFERENCE ON CONTROL & MEASUREMENT, 2000, : 301 - 304
  • [24] New search algorithm for spotting keyword embedded in unconstrained spontaneous speech
    Dai, Lirong
    Wang, Renhua
    1997, (10):
  • [25] Discriminative keyword spotting
    Keshet, Joseph
    Grangier, David
    Bengio, Samy
    SPEECH COMMUNICATION, 2009, 51 (04) : 317 - 329
  • [26] Contextual Keyword Spotting in Lecture Video With Deep Convolutional Neural Network
    Andra, Muhammad Bagus
    Usagawa, Tsuyoshi
    2017 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS), 2017, : 198 - 203
  • [27] Handwritten keyword spotting using deep neural networks and certainty prediction
    Daraee, Fatemeh
    Mozaffari, Saeed
    Razavi, Seyyed Mohammad
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92
  • [28] SMALL-FOOTPRINT KEYWORD SPOTTING USING DEEP NEURAL NETWORKS
    Chen, Guoguo
    Parada, Carolina
    Heigold, Georg
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [29] A depthwise separable convolutional neural network for keyword spotting on an embedded system
    Peter Mølgaard Sørensen
    Bastian Epp
    Tobias May
    EURASIP Journal on Audio, Speech, and Music Processing, 2020
  • [30] A depthwise separable convolutional neural network for keyword spotting on an embedded system
    Sorensen, Peter Molgaard
    Epp, Bastian
    May, Tobias
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2020, 2020 (01)