DYNAMIC CURRICULUM LEARNING VIA DATA PARAMETERS FOR NOISE ROBUST KEYWORD SPOTTING

被引:6
|
作者
Higuchi, Takuya [1 ]
Saxena, Shreyas [1 ]
Souden, Mehrez [1 ]
Tien Dung Tran [1 ]
Delfarah, Masood [1 ]
Dhir, Chandra [1 ]
机构
[1] Apple, Cupertino, CA 95014 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
Noise robustness; acoustic modeling; keyword spotting; curriculum learning; NEURAL-NETWORKS;
D O I
10.1109/ICASSP39728.2021.9414501
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose dynamic curriculum learning via data parameters for noise robust keyword spotting. Data parameter learning has recently been introduced for image processing, where weight parameters, so-called data parameters, for target classes and instances are introduced and optimized along with model parameters. The data parameters scale logits and control importance over classes and instances during training, which enables automatic curriculum learning without additional annotations for training data. Similarly, in this paper, we propose using this curriculum learning approach for acoustic modeling, and train an acoustic model on clean and noisy utterances with the data parameters. The proposed approach automatically learns the difficulty of the classes and instances, e.g. due to low speech to noise ratio (SNR), in the gradient descent optimization and performs curriculum learning. This curriculum learning leads to overall improvement of the accuracy of the acoustic model. We evaluate the effectiveness of the proposed approach on a keyword spotting task. Experimental results show 7.7% relative reduction in false reject ratio with the data parameters compared to a baseline model which is simply trained on the multiconditioned dataset.
引用
收藏
页码:6848 / 6852
页数:5
相关论文
共 34 条
  • [1] Discriminatory and Orthogonal Feature Learning for Noise Robust Keyword Spotting
    Kim, Donghyeon
    Ko, Kyungdeuk
    Han, David K.
    Ko, Hanseok
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1913 - 1917
  • [2] Joint Framework of Curriculum Learning and Knowledge Distillation for Noise-Robust and Small-Footprint Keyword Spotting
    Lim, Jaebong
    Baek, Yunju
    IEEE ACCESS, 2023, 11 : 100540 - 100553
  • [3] Prototypical Knowledge Distillation for Noise Robust Keyword Spotting
    Kim, Donghyeon
    Kim, Gwantae
    Lee, Bokyeung
    Ko, Hanseok
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2298 - 2302
  • [4] A Novel Loss Function and Training Strategy for Noise-Robust Keyword Spotting
    Lopez-Espejo, Ivan
    Tan, Zheng-Hua
    Jensen, Jesper
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2254 - 2266
  • [5] NOISE ROBUST KEYWORD SPOTTING FOR USER GENERATED VIDEO BLOGS
    Barakat, M. S.
    Ritz, C. H.
    Stirling, D. A.
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [6] Robust Keyword Spotting via Recycle-Pooling for Mobile Game
    An, Shounan
    Kim, Youngsoo
    Xu, Hu
    Lee, Jinwoo
    Lee, Myungwoo
    Oh, Insoo
    INTERSPEECH 2019, 2019, : 3661 - 3662
  • [7] Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting
    Woellmer, Martin
    Marchi, Erik
    Squartini, Stefano
    Schuller, Bjoern
    COGNITIVE NEURODYNAMICS, 2011, 5 (03) : 253 - 264
  • [8] Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting
    Martin Wöllmer
    Erik Marchi
    Stefano Squartini
    Björn Schuller
    Cognitive Neurodynamics, 2011, 5 : 253 - 264
  • [9] A Pitch and Noise Robust Keyword Spotting System Using SMAC Features with Prosody Modification
    Maity, Karabi
    Pradhan, Gayadhar
    Singh, Jyoti Prakash
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (04) : 1892 - 1904
  • [10] A Pitch and Noise Robust Keyword Spotting System Using SMAC Features with Prosody Modification
    Karabi Maity
    Gayadhar Pradhan
    Jyoti Prakash Singh
    Circuits, Systems, and Signal Processing, 2021, 40 : 1892 - 1904