Training Keyword Spotting Models on Non-IID Data with Federated Learning

被引:19
|
作者
Hard, Andrew [1 ]
Partridge, Kurt [1 ]
Nguyen, Cameron [1 ]
Subrahmanya, Niranjan [1 ]
Shah, Aishanee [1 ]
Zhu, Pai [1 ]
Moreno, Ignacio Lopez [1 ]
Mathews, Rajiv [1 ]
机构
[1] Google LLC, Mountain View, CA 94043 USA
来源
关键词
federated learning; on-device learning; keyword spotting; wake word detection; non-iid data; data augmentation;
D O I
10.21437/Interspeech.2020-3023
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
We demonstrate that a production-quality keyword-spotting model can be trained on-device using federated learning and achieve comparable false accept and false reject rates to a centrally-trained model. To overcome the algorithmic constraints associated with fitting on-device data (which are inherently non-independent and identically distributed), we conduct thorough empirical studies of optimization algorithms and hyper parameter configurations using large-scale federated simulations. To overcome resource constraints, we replace memory intensive MTR data augmentation with SpecAugment, which reduces the false reject rate by 56%. Finally, to label examples (given the zero visibility into on-device data), we explore teacher-student training.
引用
收藏
页码:4343 / 4347
页数:5
相关论文
共 50 条
  • [31] FedCML: Federated Clustering Mutual Learning with non-IID Data
    Chen, Zekai
    Wang, Fuyi
    Yu, Shengxing
    Liu, Ximeng
    Zheng, Zhiwei
    EURO-PAR 2023: PARALLEL PROCESSING, 2023, 14100 : 623 - 636
  • [32] Heterogeneous Federated Learning for Non-IID Smartwatch Data Classification
    Syu, Jia-Hao
    Lin, Jerry Chun-Wei
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (18): : 29811 - 29818
  • [33] Ensemble Federated Learning With Non-IID Data in Wireless Networks
    Zhao, Zhongyuan
    Wang, Jingyi
    Hong, Wei
    Quek, Tony Q. S.
    Ding, Zhiguo
    Peng, Mugen
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (04) : 3557 - 3571
  • [34] Advanced Optimization Techniques for Federated Learning on Non-IID Data
    Efthymiadis, Filippos
    Karras, Aristeidis
    Karras, Christos
    Sioutas, Spyros
    FUTURE INTERNET, 2024, 16 (10)
  • [35] Feature Matching Data Synthesis for Non-IID Federated Learning
    Li, Zijian
    Sun, Yuchang
    Shao, Jiawei
    Mao, Yuyi
    Wang, Jessie Hui
    Zhang, Jun
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (10) : 9352 - 9367
  • [36] FedKT: Federated learning with knowledge transfer for non-IID data
    Mao, Wenjie
    Yu, Bin
    Zhang, Chen
    Qin, A. K.
    Xie, Yu
    PATTERN RECOGNITION, 2025, 159
  • [37] Is Non-IID Data a Threat in Federated Online Learning to Rank?
    Wang, Shuyi
    Zuccon, Guido
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2801 - 2813
  • [38] FedRL: Improving the Performance of Federated Learning with Non-IID Data
    Kang, Yufei
    Li, Baochun
    Zeyl, Timothy
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3023 - 3028
  • [39] FedAP: Adaptive Personalization in Federated Learning for Non-IID Data
    Yeganeh, Yousef
    Farshad, Azade
    Boschmann, Johann
    Gaus, Richard
    Frantzen, Maximilian
    Navab, Nassir
    DISTRIBUTED, COLLABORATIVE, AND FEDERATED LEARNING, AND AFFORDABLE AI AND HEALTHCARE FOR RESOURCE DIVERSE GLOBAL HEALTH, DECAF 2022, FAIR 2022, 2022, 13573 : 17 - 27
  • [40] A Comprehensive Study on Personalized Federated Learning with Non-IID Data
    Yu, Menghang
    Zheng, Zhenzhe
    Li, Qinya
    Wu, Fan
    Zheng, Jiaqi
    2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 40 - 49