ENABLING ON-DEVICE TRAINING OF SPEECH RECOGNITION MODELS WITH FEDERATED DROPOUT

被引:2
|
作者
Guliani, Dhruv [1 ]
Zhou, Lillian [1 ]
Ryu, Changwan [1 ]
Yang, Tien-Ju [1 ]
Zhang, Harry [1 ]
Xiao, Yonghui [1 ]
Beaufays, Francoise [1 ]
Motta, Giovanni [1 ]
机构
[1] Google LLC, Mountain View, CA 94043 USA
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
federated learning; speech recognition; federated dropout;
D O I
10.1109/ICASSP43922.2022.9746226
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Federated learning can be used to train machine learning models on the edge on local data that never leave devices, providing privacy by default. This presents a challenge pertaining to the communication and computation costs associated with clients' devices. These costs are strongly correlated with the size of the model being trained, and are significant for state-of-the-art automatic speech recognition models. We propose using federated dropout to reduce the size of client models while training a full-size model server-side. We provide empirical evidence of the effectiveness of federated dropout, and propose a novel approach to vary the dropout rate applied at each layer. Furthermore, we find that federated dropout enables a set of smaller sub-models within the larger model to independently have low word error rates, making it easier to dynamically adjust the size of the model deployed for inference.
引用
收藏
页码:8757 / 8761
页数:5
相关论文
共 50 条
  • [21] Federated selective aggregation for on-device knowledge amalgamation
    Xie, Donglin
    Yu, Ruonan
    Fang, Gongfan
    Han, Jiaqi
    Song, Jie
    Feng, Zunlei
    Sun, Li
    Song, Mingli
    CHIP, 2023, 2 (03):
  • [22] On-device diagnostic recommendation with heterogeneous federated BlockNets
    Minh Hieu NGUYEN
    Thanh Trung HUYNH
    Thanh Toan NGUYEN
    Phi Le NGUYEN
    Hien Thu PHAM
    Jun JO
    Thanh Tam NGUYEN
    Science China(Information Sciences), 2025, 68 (04) : 33 - 49
  • [23] MACRO-BLOCK DROPOUT FOR IMPROVED REGULARIZATION IN TRAINING END-TO-END SPEECH RECOGNITION MODELS
    Kim, Chanwoo
    Indurti, Sathish
    Park, Jinhwan
    Sung, Wonyong
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 331 - 338
  • [24] Adaptive on-device location recognition
    Laasonen, K
    Raento, M
    Toivonen, H
    PERVASIVE COMPUTING, PROCEEDINGS, 2004, 3001 : 287 - 304
  • [25] Energy-Efficient Target Recognition using ReRAM Crossbars for Enabling On-Device Intelligence
    Sanyal, Sourav
    Ankit, Aayush
    Vineyard, Craig M.
    Roy, Kaushik
    2020 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2020, : 187 - 192
  • [26] FAST CONTEXTUAL ADAPTATION WITH NEURAL ASSOCIATIVE MEMORY FOR ON-DEVICE PERSONALIZED SPEECH RECOGNITION
    Munkhdalai, Tsendsuren
    Sim, Khe Chai
    Chandorkar, Angad
    Gao, Fan
    Chua, Mason
    Strohman, Trevor
    Beaufays, Francoise
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6632 - 6636
  • [27] VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
    Wang, Quan
    Moreno, Ignacio Lopez
    Saglam, Mert
    Wilson, Kevin
    Chiao, Alan
    Liu, Renjie
    He, Yanzhang
    Li, Wei
    Pelecanos, Jason
    Nika, Marily
    Gruenstein, Alexander
    INTERSPEECH 2020, 2020, : 2677 - 2681
  • [28] On-device Streaming Transformer-based End-to-End Speech Recognition
    Oh, Yoo Rhee
    Park, Kiyoung
    INTERSPEECH 2021, 2021, : 967 - 968
  • [29] Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
    Rathod, Jash
    Dawalatabad, Nauman
    Singh, Shatrughan
    Gowda, Dhananjaya
    INTERSPEECH 2022, 2022, : 1691 - 1695
  • [30] STREAMING, FAST AND ACCURATE ON-DEVICE INVERSE TEXT NORMALIZATION FOR AUTOMATIC SPEECH RECOGNITION
    Gaur, Yashesh
    Kibre, Nick
    Xue, Jian
    Shu, Kangyuan
    Wang, Yuhui
    Alphanso, Issac
    Li, Jinyu
    Gong, Yifan
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 237 - 244