ENABLING ON-DEVICE TRAINING OF SPEECH RECOGNITION MODELS WITH FEDERATED DROPOUT

被引：2

作者：

Guliani, Dhruv ^{[1
]}

Zhou, Lillian ^{[1
]}

Ryu, Changwan ^{[1
]}

Yang, Tien-Ju ^{[1
]}

Zhang, Harry ^{[1
]}

Xiao, Yonghui ^{[1
]}

Beaufays, Francoise ^{[1
]}

Motta, Giovanni ^{[1
]}

机构：

[1] Google LLC, Mountain View, CA 94043 USA

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

federated learning; speech recognition; federated dropout;

D O I：

10.1109/ICASSP43922.2022.9746226

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Federated learning can be used to train machine learning models on the edge on local data that never leave devices, providing privacy by default. This presents a challenge pertaining to the communication and computation costs associated with clients' devices. These costs are strongly correlated with the size of the model being trained, and are significant for state-of-the-art automatic speech recognition models. We propose using federated dropout to reduce the size of client models while training a full-size model server-side. We provide empirical evidence of the effectiveness of federated dropout, and propose a novel approach to vary the dropout rate applied at each layer. Furthermore, we find that federated dropout enables a set of smaller sub-models within the larger model to independently have low word error rates, making it easier to dynamically adjust the size of the model deployed for inference.

引用

页码：8757 / 8761

页数：5

共 50 条

[31] Kaldi-web: An installation-free, on-device speech recognition system
Hu, Mathieu
Pierron, Laurent
Vincent, Emmanuel
Jouvet, Denis
INTERSPEECH 2020, 2020, : 484 - 485
[32] FedNST: Federated Noisy Student Training for Automatic Speech Recognition
Mehmood, Haaris
Dobrowolska, Agnieszka
Saravanan, Karthikeyan
Ozay, Mete
INTERSPEECH 2022, 2022, : 1001 - 1005
[33] Enabling On-Device LLMs Personalization with Smartphone Sensing
Zhang, Shiquan
Ma, Ying
Fang, Le
Jia, Hong
D'Alfonso, Simon
Kostakos, Vassilis
COMPANION OF THE 2024 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, UBICOMP COMPANION 2024, 2024, : 186 - 190
[34] Streaming Parrotron for on-device speech-to-speech conversion
Rybakov, Oleg
Biadsy, Fadi
Zhang, Xia
Jiang, Liyang
Meadowlark, Phoenix
Agrawal, Shivani
INTERSPEECH 2023, 2023, : 2033 - 2037
[35] Sub-8-Bit Quantization Aware Training for 8-Bit Neural Network Accelerator with On-Device Speech Recognition
Zhen, Kai
Nguyen, Hieu Duy
Chinta, Raviteja
Susanj, Nathan
Mouchtaris, Athanasios
Afzal, Tariq
Rastrow, Ariya
INTERSPEECH 2022, 2022, : 3033 - 3037
[36] Hiding in the Crowd: Federated Data Augmentation for On-Device Learning
Jeong, Eunjeong
Oh, Seungeun
Park, Jihong
Kim, Hyesung
Bennis, Mehdi
Kim, Seong-Lyun
IEEE INTELLIGENT SYSTEMS, 2021, 36 (05) : 80 - 86
[37] Personalized Human Activity Recognition: Real-Time On-Device Training and Inference
Saha, Bidyut
Samanta, Riya
Roy, Ram Babu
Chakraborty, Chinmay
Ghosh, Soumya K.
IEEE CONSUMER ELECTRONICS MAGAZINE, 2025, 14 (02) : 84 - 89
[38] ON-DEVICE END-TO-END SPEECH RECOGNITION WITH MULTI-STEP PARALLEL RNNS
Boo, Yoonho
Park, Jinhwan
Lee, Lukas
Sung, Wonyong
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 376 - 381
[39] Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Ding, Shaojin
Rikhye, Rajeev
Liang, Qiao
He, Yanzhang
Wang, Quan
Narayanan, Arun
O'Malley, Tom
McGraw, Ian
INTERSPEECH 2022, 2022, : 3744 - 3748
[40] Enabling On-Device CNN Training by Self-Supervised Instance Filtering and Error Map Pruning
Wu, Yawen
Wang, Zhepeng
Shi, Yiyu
Hu, Jingtong
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (11) : 3445 - 3457

← 1 2 3 4 5 →