Training Keyword Spotting Models on Non-IID Data with Federated Learning

被引:19
|
作者
Hard, Andrew [1 ]
Partridge, Kurt [1 ]
Nguyen, Cameron [1 ]
Subrahmanya, Niranjan [1 ]
Shah, Aishanee [1 ]
Zhu, Pai [1 ]
Moreno, Ignacio Lopez [1 ]
Mathews, Rajiv [1 ]
机构
[1] Google LLC, Mountain View, CA 94043 USA
来源
关键词
federated learning; on-device learning; keyword spotting; wake word detection; non-iid data; data augmentation;
D O I
10.21437/Interspeech.2020-3023
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
We demonstrate that a production-quality keyword-spotting model can be trained on-device using federated learning and achieve comparable false accept and false reject rates to a centrally-trained model. To overcome the algorithmic constraints associated with fitting on-device data (which are inherently non-independent and identically distributed), we conduct thorough empirical studies of optimization algorithms and hyper parameter configurations using large-scale federated simulations. To overcome resource constraints, we replace memory intensive MTR data augmentation with SpecAugment, which reduces the false reject rate by 56%. Finally, to label examples (given the zero visibility into on-device data), we explore teacher-student training.
引用
收藏
页码:4343 / 4347
页数:5
相关论文
共 50 条
  • [1] Federated learning on non-IID data: A survey
    Zhu, Hangyu
    Xu, Jinjin
    Liu, Shiqing
    Jin, Yaochu
    NEUROCOMPUTING, 2021, 465 : 371 - 390
  • [2] Adaptive Federated Learning With Non-IID Data
    Zeng, Yan
    Mu, Yuankai
    Yuan, Junfeng
    Teng, Siyuan
    Zhang, Jilin
    Wan, Jian
    Ren, Yongjian
    Zhang, Yunquan
    COMPUTER JOURNAL, 2023, 66 (11): : 2758 - 2772
  • [3] Federated Learning With Taskonomy for Non-IID Data
    Jamali-Rad, Hadi
    Abdizadeh, Mohammad
    Singh, Anuj
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8719 - 8730
  • [4] Federated Learning With Non-IID Data: A Survey
    Lu, Zili
    Pan, Heng
    Dai, Yueyue
    Si, Xueming
    Zhang, Yan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (11): : 19188 - 19209
  • [5] A Survey of Federated Learning on Non-IID Data
    HAN Xuming
    GAO Minghan
    WANG Limin
    HE Zaobo
    WANG Yanze
    ZTECommunications, 2022, 20 (03) : 17 - 26
  • [6] Non-IID Federated Learning
    Cao, Longbing
    IEEE INTELLIGENT SYSTEMS, 2022, 37 (02) : 14 - 15
  • [7] Differentially private federated learning with non-IID data
    Cheng, Shuyan
    Li, Peng
    Wang, Ruchuan
    Xu, He
    COMPUTING, 2024, 106 (07) : 2459 - 2488
  • [8] Fast converging Federated Learning with Non-IID Data
    Naas, Si -Ahmed
    Sigg, Stephan
    2023 IEEE 97TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-SPRING, 2023,
  • [9] Adaptive Federated Deep Learning With Non-IID Data
    Zhang, Ze-Hui
    Li, Qing-Dan
    Fu, Yao
    He, Ning-Xin
    Gao, Tie-Gang
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (12): : 2493 - 2506
  • [10] Federated Dictionary Learning from Non-IID Data
    Gkillas, Alexandros
    Ampeliotis, Dimitris
    Berberidis, Kostas
    2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,