ENABLING ON-DEVICE TRAINING OF SPEECH RECOGNITION MODELS WITH FEDERATED DROPOUT

被引：2

作者：

Guliani, Dhruv ^{[1
]}

Zhou, Lillian ^{[1
]}

Ryu, Changwan ^{[1
]}

Yang, Tien-Ju ^{[1
]}

Zhang, Harry ^{[1
]}

Xiao, Yonghui ^{[1
]}

Beaufays, Francoise ^{[1
]}

Motta, Giovanni ^{[1
]}

机构：

[1] Google LLC, Mountain View, CA 94043 USA

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

federated learning; speech recognition; federated dropout;

D O I：

10.1109/ICASSP43922.2022.9746226

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Federated learning can be used to train machine learning models on the edge on local data that never leave devices, providing privacy by default. This presents a challenge pertaining to the communication and computation costs associated with clients' devices. These costs are strongly correlated with the size of the model being trained, and are significant for state-of-the-art automatic speech recognition models. We propose using federated dropout to reduce the size of client models while training a full-size model server-side. We provide empirical evidence of the effectiveness of federated dropout, and propose a novel approach to vary the dropout rate applied at each layer. Furthermore, we find that federated dropout enables a set of smaller sub-models within the larger model to independently have low word error rates, making it easier to dynamically adjust the size of the model deployed for inference.

引用

页码：8757 / 8761

页数：5

共 50 条

[21] Federated selective aggregation for on-device knowledge amalgamation
Xie, Donglin
Yu, Ruonan
Fang, Gongfan
Han, Jiaqi
Song, Jie
Feng, Zunlei
Sun, Li
Song, Mingli
CHIP, 2023, 2 (03):
[22] On-device diagnostic recommendation with heterogeneous federated BlockNets
Minh Hieu NGUYEN
Thanh Trung HUYNH
Thanh Toan NGUYEN
Phi Le NGUYEN
Hien Thu PHAM
Jun JO
Thanh Tam NGUYEN
Science China(Information Sciences), 2025, 68 (04) : 33 - 49
[23] MACRO-BLOCK DROPOUT FOR IMPROVED REGULARIZATION IN TRAINING END-TO-END SPEECH RECOGNITION MODELS
Kim, Chanwoo
Indurti, Sathish
Park, Jinhwan
Sung, Wonyong
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 331 - 338
[24] Adaptive on-device location recognition
Laasonen, K
Raento, M
Toivonen, H
PERVASIVE COMPUTING, PROCEEDINGS, 2004, 3001 : 287 - 304
[25] Energy-Efficient Target Recognition using ReRAM Crossbars for Enabling On-Device Intelligence
Sanyal, Sourav
Ankit, Aayush
Vineyard, Craig M.
Roy, Kaushik
2020 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2020, : 187 - 192
[26] FAST CONTEXTUAL ADAPTATION WITH NEURAL ASSOCIATIVE MEMORY FOR ON-DEVICE PERSONALIZED SPEECH RECOGNITION
Munkhdalai, Tsendsuren
Sim, Khe Chai
Chandorkar, Angad
Gao, Fan
Chua, Mason
Strohman, Trevor
Beaufays, Francoise
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6632 - 6636
[27] VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
Wang, Quan
Moreno, Ignacio Lopez
Saglam, Mert
Wilson, Kevin
Chiao, Alan
Liu, Renjie
He, Yanzhang
Li, Wei
Pelecanos, Jason
Nika, Marily
Gruenstein, Alexander
INTERSPEECH 2020, 2020, : 2677 - 2681
[28] On-device Streaming Transformer-based End-to-End Speech Recognition
Oh, Yoo Rhee
Park, Kiyoung
INTERSPEECH 2021, 2021, : 967 - 968
[29] Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Rathod, Jash
Dawalatabad, Nauman
Singh, Shatrughan
Gowda, Dhananjaya
INTERSPEECH 2022, 2022, : 1691 - 1695
[30] STREAMING, FAST AND ACCURATE ON-DEVICE INVERSE TEXT NORMALIZATION FOR AUTOMATIC SPEECH RECOGNITION
Gaur, Yashesh
Kibre, Nick
Xue, Jian
Shu, Kangyuan
Wang, Yuhui
Alphanso, Issac
Li, Jinyu
Gong, Yifan
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 237 - 244

← 1 2 3 4 5 →