A depthwise separable convolutional neural network for keyword spotting on an embedded system

被引:16
|
作者
Sorensen, Peter Molgaard [1 ]
Epp, Bastian [1 ]
May, Tobias [1 ]
机构
[1] Tech Univ Denmark, Ctr Appl Hearing Res, Lyngby, Denmark
关键词
Keyword spotting; Speech recognition; Embedded software; Deep learning; Convolutional neural networks; Quantization; SPEECH; RECOGNITION;
D O I
10.1186/s13636-020-00176-2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A keyword spotting algorithm implemented on an embedded system using a depthwise separable convolutional neural network classifier is reported. The proposed system was derived from a high-complexity system with the goal to reduce complexity and to increase efficiency. In order to meet the requirements set by hardware resource constraints, a limited hyper-parameter grid search was performed, which showed that network complexity could be drastically reduced with little effect on classification accuracy. It was furthermore found that quantization of pre-trained networks using mixed and dynamic fixed point principles could reduce the memory footprint and computational requirements without lowering classification accuracy. Data augmentation techniques were used to increase network robustness in unseen acoustic conditions by mixing training data with realistic noise recordings. Finally, the system's ability to detect keywords in a continuous audio stream was successfully demonstrated.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] A double-channel multiscale depthwise separable convolutional neural network for abnormal gait recognition
    Liu, Xiaoguang
    Wu, Yubo
    Chen, Meng
    Liang, Tie
    Han, Fei
    Liu, Xiuling
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (05) : 8049 - 8067
  • [32] Keyword spotting based on recurrent neural network
    Zhou, JL
    Liu, J
    Song, YT
    Yu, TC
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 710 - 713
  • [33] Keyword spotting based on recurrent neural network
    Chinese Acad of Science, Beijing, China
    Int Conf Signal Process Proc, (710-713):
  • [34] SDCN: Synchronized Depthwise Separable Convolutional Neural Network for Single Image Super-Resolution
    Muhammad, Wazir
    Bhutto, Zuhaibuddin
    Shah, Syed Ali Raza
    Shah, Jalal
    Shaikh, Murtaza Hussain
    Hussain, Ayaz
    Masrour, Salman
    Thaheem, Imdadullah
    Ali, Shamshad
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (11): : 17 - 22
  • [35] Fault Line Selection Method Based on Transfer Learning Depthwise Separable Convolutional Neural Network
    Zhang H.
    Cheng W.
    Zhang, Haixia (zhanghaixia@hngm.edu.cn), 1600, Hindawi Limited (2021):
  • [36] Age Estimation via Fusion of Depthwise Separable Convolutional Neural Networks
    Liu, Kuan-Hsien
    Liu, Hsin-Hua
    Chan, Pak Ki
    Liu, Tsung-Jung
    Pei, Soo-Chang
    2018 10TH IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2018,
  • [37] Convolutional Neural Networks for Small-footprint Keyword Spotting
    Sainath, Tara N.
    Parada, Carolina
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1478 - 1482
  • [38] Facial expression recognition based on improved depthwise separable convolutional network
    Huo, Hua
    Yu, YaLi
    Liu, ZhongHua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 18635 - 18652
  • [39] Facial expression recognition based on improved depthwise separable convolutional network
    Hua Huo
    YaLi Yu
    ZhongHua Liu
    Multimedia Tools and Applications, 2023, 82 : 18635 - 18652
  • [40] MDSCN: multiscale depthwise separable convolutional network for underwater graphics restoration
    Li, Shiyu
    Liu, Zehao
    Gao, Meijing
    Bai, Yang
    Yin, Haozheng
    VISUAL COMPUTER, 2025, 41 (03): : 1999 - 2010