Training Keyword Spotting Models on Non-IID Data with Federated Learning

被引:19
|
作者
Hard, Andrew [1 ]
Partridge, Kurt [1 ]
Nguyen, Cameron [1 ]
Subrahmanya, Niranjan [1 ]
Shah, Aishanee [1 ]
Zhu, Pai [1 ]
Moreno, Ignacio Lopez [1 ]
Mathews, Rajiv [1 ]
机构
[1] Google LLC, Mountain View, CA 94043 USA
来源
关键词
federated learning; on-device learning; keyword spotting; wake word detection; non-iid data; data augmentation;
D O I
10.21437/Interspeech.2020-3023
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
We demonstrate that a production-quality keyword-spotting model can be trained on-device using federated learning and achieve comparable false accept and false reject rates to a centrally-trained model. To overcome the algorithmic constraints associated with fitting on-device data (which are inherently non-independent and identically distributed), we conduct thorough empirical studies of optimization algorithms and hyper parameter configurations using large-scale federated simulations. To overcome resource constraints, we replace memory intensive MTR data augmentation with SpecAugment, which reduces the false reject rate by 56%. Finally, to label examples (given the zero visibility into on-device data), we explore teacher-student training.
引用
收藏
页码:4343 / 4347
页数:5
相关论文
共 50 条
  • [41] Hierarchical Federated Learning with Adaptive Clustering on Non-IID Data
    Tian, Yuqing
    Zhang, Zhaoyang
    Yang, Zhaohui
    Jin, Richeng
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 627 - 632
  • [42] Privacy-Enhanced Federated Learning for Non-IID Data
    Tan, Qingjie
    Wu, Shuhui
    Tao, Yuanhong
    MATHEMATICS, 2023, 11 (19)
  • [43] Adaptive Federated Learning on Non-IID Data With Resource Constraint
    Zhang, Jie
    Guo, Song
    Qu, Zhihao
    Zeng, Deze
    Zhan, Yufeng
    Liu, Qifeng
    Akerkar, Rajendra
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (07) : 1655 - 1667
  • [44] A Study of Enhancing Federated Learning on Non-IID Data with Server Learning
    Mai V.S.
    La R.J.
    Zhang T.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 15
  • [45] Federated Learning for Sentiment Analysis in Presence of Non-IID Data: Sensitivity of Deep Learning Models
    Gholamiangonabadi, Davoud
    Grolinger, Katarina
    IEEE ACCESS, 2024, 12 : 128049 - 128060
  • [46] Federated Learning on Non-IID Data Silos: An Experimental Study
    Li, Qinbin
    Diao, Yiqun
    Chen, Quan
    He, Bing Sheng
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 965 - 978
  • [47] FedRL: Federated Learning with Non-IID Data via Review Learning
    Wang, Jinbo
    Wang, Ruijin
    Pei, Xikai
    2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 115 - 120
  • [48] FedOVA: One-vs-All Training Method for Federated Learning with Non-IID Data
    Zhu, Yuanshao
    Markos, Christos
    Zhao, Ruihui
    Zheng, Yefeng
    Yu, James J. Q.
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [49] Gradient Calibration for Non-IID Federated Learning
    Li, Jiachen
    Zhang, Yuchao
    Li, Yiping
    Gong, Xiangyang
    Wang, Wendong
    PROCEEDINGS OF THE 2023 THE 2ND ACM WORKSHOP ON DATA PRIVACY AND FEDERATED LEARNING TECHNOLOGIES FOR MOBILE EDGE NETWORK, FEDEDGE 2023, 2023, : 119 - 124
  • [50] MultiConfederated Learning: Inclusive Non-IID Data handling with Decentralized Federated Learning
    Duchesne, Michael
    Zhang, Kaiwen
    Talhi, Chamseddine
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 1587 - 1595