STOCHASTIC ADAPTIVE NEURAL ARCHITECTURE SEARCH FOR KEYWORD SPOTTING

被引:0
|
作者
Veniat, Tom [1 ]
Schwander, Olivier [1 ]
Denoyer, Ludovic [1 ,2 ]
机构
[1] Sorbonne Univ, LIP6, F-75005 Paris, France
[2] Facebook AI Res, Menlo Pk, CA USA
来源
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2019年
关键词
Neural Architecture Search; Keyword Spotting; Deep Learning; Budgeted Learning;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The problem of keyword spotting i.e. identifying keywords in a real-time audio stream is mainly solved by applying a neural network over successive sliding windows. Due to the difficulty of the task, baseline models are usually large, resulting in a high computational cost and energy consumption level. We propose a new method called SANAS (Stochastic Adaptive Neural Architecture Search) which is able to adapt the architecture of the neural network on-the-fly at inference time such that small architectures will be used when the stream is easy to process (silence, low noise, ... ) and bigger networks will be used when the task becomes more difficult. We show that this adaptive model can be learned end-to-end by optimizing a trade-off between the prediction performance and the average computational cost per unit of time. Experiments on the Speech Commands dataset [1] show that this approach leads to a high recognition level while being much faster (and/or energy saving) than classical approaches where the network architecture is static.
引用
收藏
页码:2842 / 2846
页数:5
相关论文
共 50 条
  • [1] Neural Architecture Search For Keyword Spotting
    Mo, Tong
    Yu, Yakun
    Salameh, Mohammad
    Niu, Di
    Jui, Shangling
    INTERSPEECH 2020, 2020, : 1982 - 1986
  • [2] Target-Aware Neural Architecture Search and Deployment for Keyword Spotting
    Busia, Paola
    Deriu, Gianfranco
    Rinelli, Luca
    Chesta, Cristina
    Raffo, Luigi
    Meloni, Paolo
    IEEE ACCESS, 2022, 10 : 40687 - 40700
  • [3] AUTOKWS: KEYWORD SPOTTING WITH DIFFERENTIABLE ARCHITECTURE SEARCH
    Zhang, Bo
    Li, Wenfeng
    Li, Qingyuan
    Zhuang, Weiji
    Chu, Xiangxiang
    Wang, Yujun
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2830 - 2834
  • [4] Improving Keyword Spotting and Language Identification via Neural Architecture Search at Scale
    Mazzawi, Hanna
    Gonzalvo, Xavi
    Kracun, Aleks
    Sridhar, Prashant
    Subrahmanya, Niranjan
    Moreno, Ignacio Lopez
    Park, Hyun Jin
    Violette, Patrick
    INTERSPEECH 2019, 2019, : 1278 - 1282
  • [5] END-TO-END KEYWORD SPOTTING USING NEURAL ARCHITECTURE SEARCH AND QUANTIZATION
    Peter, David
    Roth, Wolfgang
    Pernkopf, Franz
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3423 - 3427
  • [6] Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization
    Peter, David
    Roth, Wolfgang
    Pernkopf, Franz
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9273 - 9279
  • [7] Channel Shuffle Neural Architecture Search for Key Word Spotting
    Lee, Bokyeung
    Kim, Donghyeon
    Kim, Gwantae
    Ko, Hanseok
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 443 - 447
  • [8] TEXT ADAPTIVE DETECTION FOR CUSTOMIZABLE KEYWORD SPOTTING
    Xi, Yu
    Tan, Tian
    Zhang, Wangyou
    Yang, Baochen
    Yu, Kai
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6652 - 6656
  • [9] Keyword spotting based on recurrent neural network
    Zhou, JL
    Liu, J
    Song, YT
    Yu, TC
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 710 - 713
  • [10] Keyword spotting based on recurrent neural network
    Chinese Acad of Science, Beijing, China
    Int Conf Signal Process Proc, (710-713):