FEDERATED SELF-TRAINING FOR DATA-EFFICIENT AUDIO RECOGNITION

被引：1

作者：

Tsouvalas, Vasileios ^{[1
]}

Saeed, Aaqib ^{[2
]}

Ozcelebi, Tanir ^{[1
]}

机构：

[1] Eindhoven Univ Technol, Eindhoven, Netherlands

[2] Philips Res, Eindhoven, Netherlands

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

federated learning; semi-supervised learning; deep learning; audio classification; sound recognition;

D O I：

10.1109/ICASSP43922.2022.9746356

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Federated learning is a distributed machine learning paradigm dealing with decentralized and personal datasets. Since data reside on devices like smartphones, labeling is entrusted to the clients or labels are extracted in an automated way. Specifically, in the case of audio data, acquiring semantic annotations can be prohibitively expensive and time-consuming. As a result, an abundance of audio data remains unlabeled and unexploited on users' devices. Existing federated learning approaches largely focus on supervised learning without harnessing the unlabeled data. Here, we study the problem of semi-supervised learning of audio models in conjunction with federated learning. We propose FedSTAR, a self-training approach to exploit large-scale on-device unlabeled data to improve the generalization of audio recognition models. We conduct experiments on diverse public audio classification datasets and investigate the performance of our models under varying percentages of labeled data and show that with as little as 3% labeled data, FedSTAR on average can improve the recognition rate by 13.28% compared to the fully-supervised federated model.

引用

页码：476 / 480

页数：5

共 25 条

[1]

Arazo E., 2020, IEEE IJCNN, P1, DOI 10.1109/IJCNN48605.2020.9207304

[2]

Beutel DJ, 2022, Arxiv, DOI arXiv:2007.14390

[3] Contactless cardiac arrest detection using smart devices [J].

Chan, Justin ;

Rea, Thomas ;

Gollakota, Shyamnath ;

Sunshine, Jacob E. .

NPJ DIGITAL MEDICINE, 2019, 2

[4]

Hard A., 2020, TRAINING KEYWORD SPO

[5]

Hard A., 2018, ARXIV PREPRINT ARXIV

[6]

Hinton Geoffrey, 2015, DISTILLING KNOWLEDGE

[7]

Hosseini Hossein, 2020, FEDERATED LEARNING U

[8]

Jeong Wonyong, 2021, P 9 INT C LEARN REPR

[9]

Jin Y., 2020, ARXIV200211545

[10]

Lee Dong-Hyun, 2013, WORKSHOP CHALLENGES

← 1 2 3 →