DYNAMIC CURRICULUM LEARNING VIA DATA PARAMETERS FOR NOISE ROBUST KEYWORD SPOTTING

被引:6
作者
Higuchi, Takuya [1 ]
Saxena, Shreyas [1 ]
Souden, Mehrez [1 ]
Tien Dung Tran [1 ]
Delfarah, Masood [1 ]
Dhir, Chandra [1 ]
机构
[1] Apple, Cupertino, CA 95014 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
Noise robustness; acoustic modeling; keyword spotting; curriculum learning; NEURAL-NETWORKS;
D O I
10.1109/ICASSP39728.2021.9414501
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose dynamic curriculum learning via data parameters for noise robust keyword spotting. Data parameter learning has recently been introduced for image processing, where weight parameters, so-called data parameters, for target classes and instances are introduced and optimized along with model parameters. The data parameters scale logits and control importance over classes and instances during training, which enables automatic curriculum learning without additional annotations for training data. Similarly, in this paper, we propose using this curriculum learning approach for acoustic modeling, and train an acoustic model on clean and noisy utterances with the data parameters. The proposed approach automatically learns the difficulty of the classes and instances, e.g. due to low speech to noise ratio (SNR), in the gradient descent optimization and performs curriculum learning. This curriculum learning leads to overall improvement of the accuracy of the acoustic model. We evaluate the effectiveness of the proposed approach on a keyword spotting task. Experimental results show 7.7% relative reduction in false reject ratio with the data parameters compared to a baseline model which is simply trained on the multiconditioned dataset.
引用
收藏
页码:6848 / 6852
页数:5
相关论文
共 34 条
  • [21] Dynamic Data Distribution-based Curriculum Learning
    Chaudhry, Shonal
    Sharma, Anuraganand
    INFORMATION SCIENCES, 2025, 702
  • [22] Curriculum Learning based Probabilistic Linear Discriminant Analysis for Noise Robust Speaker Recognition
    Ranjan, Shivesh
    Misra, Abhinav
    Hansen, John H. L.
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3717 - 3721
  • [23] Curriculum learning based approach for noise robust language identification using DNN with attention
    Vuddagiri, Ravi Kumar
    Vydana, Hari Krishna
    Vuppala, Anil Kumar
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 110 : 290 - 297
  • [24] Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention
    Jung, Myunghun
    Jung, Youngmoon
    Goo, Jahyun
    Kim, Hoirin
    INTERSPEECH 2020, 2020, : 931 - 935
  • [25] Improving Data Augmentation for Robust Visual Question Answering with Effective Curriculum Learning
    Zheng, Yuhang
    Wang, Zhen
    Chen, Long
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 1084 - 1088
  • [26] Dynamic Curriculum Learning via In-Domain Uncertainty for Medical Image Classification
    Li, Chaoyi
    Li, Meng
    Peng, Can
    Lovell, Brian C.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT V, 2023, 14224 : 747 - 757
  • [27] Reinforcement Learning based Data Augmentation for Noise Robust Speech Emotion Recognition
    Ranjan, Sumit
    Chakraborty, Rupayan
    Kopparapu, Sunil Kumar
    INTERSPEECH 2024, 2024, : 1040 - 1044
  • [28] IMPROVING NOISE ROBUSTNESS OF AUTOMATIC SPEECH RECOGNITION VIA PARALLEL DATA AND TEACHER-STUDENT LEARNING
    Mosner, Ladislav
    Wu, Minhua
    Raju, Anirudh
    Parthasarathi, Sree Hari Krishnan
    Kumatani, Kenichi
    Sundaram, Shiva
    Maas, Roland
    Hoffmeister, Bjorn
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6475 - 6479
  • [29] Robust Positive-Unlabeled Learning via Noise Negative Sample Self-correction
    Zhu, Zhangchi
    Wang, Lu
    Zhao, Pu
    Du, Chao
    Zhang, Wei
    Dong, Hang
    Qiao, Bo
    Lin, Qingwei
    Rajmohan, Saravan
    Zhang, Dongmei
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3663 - 3673
  • [30] Robust Learning From Noisy Web Images Via Data Purification for Fine-Grained Recognition
    Zhang, Chuanyi
    Wang, Qiong
    Xie, Guosen
    Wu, Qi
    Shen, Fumin
    Tang, Zhenmin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1198 - 1209