A LIGHTWEIGHT DYNAMIC FILTER FOR KEYWORD SPOTTING

被引:2
|
作者
Kim, Donghyeon [1 ]
Ko, Kyungdeuk [1 ]
Kwak, Jeonggi [1 ]
Han, David K. [2 ]
Ko, Hanseok [1 ]
机构
[1] Korea Univ, Seoul, South Korea
[2] Drexel Univ, Philadelphia, PA USA
来源
2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW | 2023年
关键词
keyword spotting; dynamic filter; dynamic weight; computational cost;
D O I
10.1109/ICASSPW59220.2023.10193258
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Keyword Spotting (KWS) from speech signals is widely applied to perform fully hands-free speech recognition. The KWS network is designed as a small-footprint model so it can continuously be active. Recent efforts have explored dynamic filter-based models in deep learning frameworks to enhance the system's robustness or accuracy. However, as a dynamic filter framework requires high computational costs, the implementation is limited to the computational condition of the device. In this paper, we propose a lightweight dynamic filter to improve the performance of KWS. Our proposed model divides the dynamic filter into two branches to reduce computational complexity: pixel level and instance level. The proposed lightweight dynamic filter is applied to the front end of KWS to enhance the separability of the input data. The experimental results show that our model is robustly working on unseen noise and small training data environments by using a small computational resource.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] LETR: A LIGHTWEIGHT AND EFFICIENT TRANSFORMER FOR KEYWORD SPOTTING
    Ding, Kevin
    Zong, Martin
    Li, Jiakui
    Li, Baoxiang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7987 - 7991
  • [2] Variational Dynamic Background Model for Keyword Spotting in Handwritten Documents
    Kumar, Gaurav
    Wshah, Safwan
    Govindaraju, Venu
    DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021
  • [3] Discriminative keyword spotting
    Keshet, Joseph
    Grangier, David
    Bengio, Samy
    SPEECH COMMUNICATION, 2009, 51 (04) : 317 - 329
  • [4] Segmentation Free Keyword Spotting Framework using Dynamic Background Model
    Kumar, Gaurav
    Wshah, Safwan
    Govindaraju, Venu
    Ramachandrula, Sitaram
    DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [5] METRIC LEARNING FOR KEYWORD SPOTTING
    Huh, Jaesung
    Lee, Minjae
    Heo, Heesoo
    Mun, Seongkyu
    Chung, Joon Son
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 133 - 140
  • [6] Keyword Spotting using Dynamic Time Warping and Convolutional Recurrent Networks
    Albert, Erika-Timea
    Lemnaru, Camelia
    Dinsoreanu, Mihaela
    Potolea, Rodica
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2019), 2019, : 53 - 60
  • [7] A New Keyword Spotting Approach
    Bahi, Halima
    Benati, Nadia
    2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 77 - +
  • [8] Latency Control for Keyword Spotting
    Jose, Christin
    Wang, Joseph
    Strimel, Grant P.
    Khursheed, Mohammad Omar
    Mishchenko, Yuriy
    Kulis, Brian
    INTERSPEECH 2022, 2022, : 1891 - 1895
  • [9] FEDERATED LEARNING FOR KEYWORD SPOTTING
    Leroy, David
    Coucke, Alice
    Lavril, Thibaut
    Gisselbrecht, Thibault
    Dureau, Joseph
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6341 - 6345
  • [10] Development and Optimization of an Ultra-lightweight Deep Spoken Keyword Spotting Model for FPGA Acceleration
    Dembeck, Trysten
    Parikh, Chirag
    COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, CAINE 2024, 2025, 2242 : 3 - 20