A depthwise separable convolutional neural network for keyword spotting on an embedded system

被引:16
|
作者
Sorensen, Peter Molgaard [1 ]
Epp, Bastian [1 ]
May, Tobias [1 ]
机构
[1] Tech Univ Denmark, Ctr Appl Hearing Res, Lyngby, Denmark
关键词
Keyword spotting; Speech recognition; Embedded software; Deep learning; Convolutional neural networks; Quantization; SPEECH; RECOGNITION;
D O I
10.1186/s13636-020-00176-2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A keyword spotting algorithm implemented on an embedded system using a depthwise separable convolutional neural network classifier is reported. The proposed system was derived from a high-complexity system with the goal to reduce complexity and to increase efficiency. In order to meet the requirements set by hardware resource constraints, a limited hyper-parameter grid search was performed, which showed that network complexity could be drastically reduced with little effect on classification accuracy. It was furthermore found that quantization of pre-trained networks using mixed and dynamic fixed point principles could reduce the memory footprint and computational requirements without lowering classification accuracy. Data augmentation techniques were used to increase network robustness in unseen acoustic conditions by mixing training data with realistic noise recordings. Finally, the system's ability to detect keywords in a continuous audio stream was successfully demonstrated.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A depthwise separable convolutional neural network for keyword spotting on an embedded system
    Peter Mølgaard Sørensen
    Bastian Epp
    Tobias May
    EURASIP Journal on Audio, Speech, and Music Processing, 2020
  • [2] FPGA Implementation of Keyword Spotting System Using Depthwise Separable Binarized and Ternarized Neural Networks
    Bae, Seongwoo
    Kim, Haechan
    Lee, Seongjoo
    Jung, Yunho
    SENSORS, 2023, 23 (12)
  • [3] Depthwise-Separable Residual Capsule for Robust Keyword Spotting
    Huang, Xianghong
    Yang, Qun
    Liu, Shaohan
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 194 - 204
  • [4] Depthwise Separable Convolutional ResNet with Squeeze-and-Excitation Blocks for Small-footprint Keyword Spotting
    Xu, Menglong
    Zhang, Xiao-Lei
    INTERSPEECH 2020, 2020, : 2547 - 2551
  • [5] Depthwise Separable Convolutional Neural Network for Skin Lesion Classification
    Kassani, Sara Hosseinzadeh
    Kassani, Peyman Hosseinzadeh
    Wesolowski, Michal J.
    Schneider, Kevin A.
    Deters, Ralph
    2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,
  • [6] Depthwise Separable Convolutional Neural Network for Confidential Information Analysis
    Lu, Yue
    Jiang, Jianguo
    Yu, Min
    Liu, Chao
    Liu, Chaochao
    Huang, Weiqing
    Lv, Zhiqiang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT II, 2020, 12275 : 450 - 461
  • [7] Keyword spotting in continuous speech using convolutional neural network
    Rostami, Amir Mohammad
    Karimi, Ali
    Akhaee, Mohammad Ali
    SPEECH COMMUNICATION, 2022, 142 : 15 - 21
  • [8] Keyword spotting in continuous speech using convolutional neural network
    Rostami, Amir Mohammad
    Karimi, Ali
    Akhaee, Mohammad Ali
    Speech Communication, 2022, 142 : 15 - 21
  • [9] FPGA Implementation for Odor Identification with Depthwise Separable Convolutional Neural Network
    Mo, Zhuofeng
    Luo, Dehan
    Wen, Tengteng
    Cheng, Yu
    Li, Xin
    SENSORS, 2021, 21 (03) : 1 - 19
  • [10] Designing efficient accelerator of depthwise separable convolutional neural network on FPGA
    Ding, Wei
    Huang, Zeyu
    Huang, Zunkai
    Tian, Li
    Wang, Hui
    Feng, Songlin
    JOURNAL OF SYSTEMS ARCHITECTURE, 2019, 97 : 278 - 286