ENABLING ON-DEVICE TRAINING OF SPEECH RECOGNITION MODELS WITH FEDERATED DROPOUT

被引:2
|
作者
Guliani, Dhruv [1 ]
Zhou, Lillian [1 ]
Ryu, Changwan [1 ]
Yang, Tien-Ju [1 ]
Zhang, Harry [1 ]
Xiao, Yonghui [1 ]
Beaufays, Francoise [1 ]
Motta, Giovanni [1 ]
机构
[1] Google LLC, Mountain View, CA 94043 USA
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
federated learning; speech recognition; federated dropout;
D O I
10.1109/ICASSP43922.2022.9746226
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Federated learning can be used to train machine learning models on the edge on local data that never leave devices, providing privacy by default. This presents a challenge pertaining to the communication and computation costs associated with clients' devices. These costs are strongly correlated with the size of the model being trained, and are significant for state-of-the-art automatic speech recognition models. We propose using federated dropout to reduce the size of client models while training a full-size model server-side. We provide empirical evidence of the effectiveness of federated dropout, and propose a novel approach to vary the dropout rate applied at each layer. Furthermore, we find that federated dropout enables a set of smaller sub-models within the larger model to independently have low word error rates, making it easier to dynamically adjust the size of the model deployed for inference.
引用
收藏
页码:8757 / 8761
页数:5
相关论文
共 50 条
  • [31] Kaldi-web: An installation-free, on-device speech recognition system
    Hu, Mathieu
    Pierron, Laurent
    Vincent, Emmanuel
    Jouvet, Denis
    INTERSPEECH 2020, 2020, : 484 - 485
  • [32] FedNST: Federated Noisy Student Training for Automatic Speech Recognition
    Mehmood, Haaris
    Dobrowolska, Agnieszka
    Saravanan, Karthikeyan
    Ozay, Mete
    INTERSPEECH 2022, 2022, : 1001 - 1005
  • [33] Enabling On-Device LLMs Personalization with Smartphone Sensing
    Zhang, Shiquan
    Ma, Ying
    Fang, Le
    Jia, Hong
    D'Alfonso, Simon
    Kostakos, Vassilis
    COMPANION OF THE 2024 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, UBICOMP COMPANION 2024, 2024, : 186 - 190
  • [34] Streaming Parrotron for on-device speech-to-speech conversion
    Rybakov, Oleg
    Biadsy, Fadi
    Zhang, Xia
    Jiang, Liyang
    Meadowlark, Phoenix
    Agrawal, Shivani
    INTERSPEECH 2023, 2023, : 2033 - 2037
  • [35] Sub-8-Bit Quantization Aware Training for 8-Bit Neural Network Accelerator with On-Device Speech Recognition
    Zhen, Kai
    Nguyen, Hieu Duy
    Chinta, Raviteja
    Susanj, Nathan
    Mouchtaris, Athanasios
    Afzal, Tariq
    Rastrow, Ariya
    INTERSPEECH 2022, 2022, : 3033 - 3037
  • [36] Hiding in the Crowd: Federated Data Augmentation for On-Device Learning
    Jeong, Eunjeong
    Oh, Seungeun
    Park, Jihong
    Kim, Hyesung
    Bennis, Mehdi
    Kim, Seong-Lyun
    IEEE INTELLIGENT SYSTEMS, 2021, 36 (05) : 80 - 86
  • [37] Personalized Human Activity Recognition: Real-Time On-Device Training and Inference
    Saha, Bidyut
    Samanta, Riya
    Roy, Ram Babu
    Chakraborty, Chinmay
    Ghosh, Soumya K.
    IEEE CONSUMER ELECTRONICS MAGAZINE, 2025, 14 (02) : 84 - 89
  • [38] ON-DEVICE END-TO-END SPEECH RECOGNITION WITH MULTI-STEP PARALLEL RNNS
    Boo, Yoonho
    Park, Jinhwan
    Lee, Lukas
    Sung, Wonyong
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 376 - 381
  • [39] Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
    Ding, Shaojin
    Rikhye, Rajeev
    Liang, Qiao
    He, Yanzhang
    Wang, Quan
    Narayanan, Arun
    O'Malley, Tom
    McGraw, Ian
    INTERSPEECH 2022, 2022, : 3744 - 3748
  • [40] Enabling On-Device CNN Training by Self-Supervised Instance Filtering and Error Map Pruning
    Wu, Yawen
    Wang, Zhepeng
    Shi, Yiyu
    Hu, Jingtong
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (11) : 3445 - 3457