Custom Mandarin Keyword Spotting with Extended Long Short-Term Memory

被引:0
|
作者
Cao, Haitao [1 ]
Liu, Xi [1 ]
Tan, Zhiguo [1 ]
Yang, Zhenlun [1 ]
Qin, Xin [2 ]
机构
[1] School of Information Engineering, Guangzhou Panyu Polytechnic, Guangzhou,511483, China
[2] Institute of Big Data and Internet Innovation, Hunan University of Technology and Business, Changsha,410205, China
关键词
Deep neural networks;
D O I
暂无
中图分类号
学科分类号
摘要
In real-world scenarios, Deep Neural Network (DNN)-powered Keyword Spotting (KWS) systems are typically engineered as lightweight architectures, optimizing for superior performance and low computational complexity in resource-limited devices. However, such lightweight designs often encounter limitations in generalization, particularly when it comes to customizing keywords. This paper presents a twostage method to customize a Mandarin KWS system rapidly. First, we propose an embedding model to learn the embedding representations of general Mandarin keywords. Subsequently, we facilitate keyword customization with the generalization capability of embedding models through few-shot transfer learning. To improve performance further, in the embedding model, we introduce two scale blocks to fuse acoustic features and employ an Enhanced Extended Long Short-Term Memory (ExLSTM) as the backbone. Experimental results on both English and Mandarin keyword datasets highlight the advantages of the proposed embedding model. In addition, we conduct keyword customization on a self-recorded dataset containing 10 Mandarin keywords. The impressive average accuracy of 97.45% with merely five target samples demonstrates the effectiveness of our method. © (2024), (International Association of Engineers). All rights reserved.
引用
收藏
页码:1933 / 1942
相关论文
共 35 条
  • [21] Examining the influence of sampling frequency on state-of-charge estimation accuracy using long short-term memory models
    Arabaci, Hayri
    Ucar, Kursad
    Cimen, Halil
    ELECTRICAL ENGINEERING, 2024, 106 (05) : 6449 - 6462
  • [22] Punjabi news multi-classification using language generation-based optimized long short-term memory networks
    Varun Gupta
    Ekta Gupta
    Evolving Systems, 2023, 14 : 49 - 58
  • [23] Punjabi news multi-classification using language generation-based optimized long short-term memory networks
    Gupta, Varun
    Gupta, Ekta
    EVOLVING SYSTEMS, 2023, 14 (01) : 49 - 58
  • [24] Enhancing Legal Sentiment Analysis: A Convolutional Neural Network-Long Short-Term Memory Document-Level Model
    Abimbola, Bolanle
    de la Cal Marin, Enrique
    Tan, Qing
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (02): : 877 - 897
  • [25] Handwritten Bangla Numeral Recognition Using Deep Long Short Term Memory
    Ahmed, Mahtab
    Akhand, M. A. H.
    Rahman, M. M. Hafizur
    2016 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR THE MUSLIM WORLD (ICT4M), 2016, : 310 - 315
  • [26] Integrated adversarial long short-term memory deep networks for reheater tube temperature forecasting of ultra-supercritical turbo-generators
    Yin L.
    Wei X.
    Applied Soft Computing, 2023, 142
  • [27] Convolutional long short term memory deep neural networks for image sequence prediction
    Balderas, David
    Ponce, Pedro
    Molina, Arturo
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 122 : 152 - 162
  • [28] A Novel Locomotion Rule Rmbedding Long Short-Term Memory Network with Attention for Human Locomotor Intent Classification Using Multi-Sensors Signals
    Shen, Jiajie
    Wang, Yan
    Zhang, Dongxu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (03): : 4349 - 4370
  • [29] Developing a novel framework for forecasting groundwater level fluctuations using Bi-directional Long Short-Term Memory (BiLSTM) deep neural network
    Ghasemlounia, Redvan
    Gharehbaghi, Amin
    Ahmadi, Farshad
    Saadatnejadgharahassanlou, Hamid
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 191
  • [30] Modeling Long- and Short-Term User Behaviors for Sequential Recommendation with Deep Neural Networks
    Yan, Cairong
    Wang, Yiwei
    Zhang, Yanting
    Wang, Zijian
    Wang, Pengwei
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,