Optimal Transport-Based One-Shot Federated Learning for Artificial Intelligence of Things

被引:5
作者
Chiang, Yi-Han [1 ]
Terai, Koudai [1 ]
Chiang, Tsung-Wei [2 ]
Lin, Hai [1 ]
Ji, Yusheng [3 ]
Lui, John C. S. [4 ]
机构
[1] Osaka Metropolitan Univ, Dept Elect & Elect Syst Engn, Osaka 5998531, Japan
[2] MediaTek Inc, Hsinchu, Taiwan
[3] Natl Inst Informat, Informat Syst Architecture Sci Res Div, Tokyo 1018430, Japan
[4] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
关键词
Artificial Intelligence of Things (AIoT); federated learning (FL); one-shot learning; optimal transport (OT); NETWORKS; OPTIMIZATION;
D O I
10.1109/JIOT.2023.3293230
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Federated learning (FL) is an emerging distributed machine learning (ML) paradigm in the Artificial Intelligence of Things (AIoT). FL enables AIoT devices to collaboratively train an ML model on the network edge, while protecting data privacy and solving the problem of isolated data islands. Contemporary FL is typically realized through the model aggregation of locally trained models and the model dissemination of a globally averaged model; such a procedure iteratively proceeds until a predefined convergence criterion is met. However, FL necessitates frequent information exchanges between AIoT devices and a parameter server, which inevitably induces tremendous communication costs. Therefore, this article proposes a new design for efficient one-shot FL for AIoT systems, so that the model aggregation and dissemination can be completed within a single communication round. To this end, we leverage optimal transport (OT) theory to design the coupled model averaging (CODE) algorithm to fuse the model weights of the neural networks (NNs) on AIoT devices. The CODE algorithm initially performs OT-based layer-by-layer model averaging (MA) over two NNs to form a fused NN, which will then be averaged with another NN. The CODE algorithm progressively determines a pair of NNs, and continues until all NNs have been examined to achieve one-shot FL. In addition, we provide a detailed convergence analysis for the proposed solution. Our simulation results show that the proposed solution outperforms other one-shot MA mechanisms under various parameter settings.
引用
收藏
页码:2166 / 2180
页数:15
相关论文
共 64 条
[1]  
Altschuler J, 2017, ADV NEUR IN, V30
[2]  
Andoni A, 2008, PROCEEDINGS OF THE NINETEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P343
[3]  
[Anonymous], 2023, Cisco global cloud index: forecast and methodology, 2016-2021
[4]  
Bagdasaryan E, 2020, PR MACH LEARN RES, V108, P2938
[5]  
Bhagoji AN, 2019, PR MACH LEARN RES, V97
[6]   A Survey of Recent Advances in Edge-Computing-Powered Artificial Intelligence of Things [J].
Chang, Zhuoqing ;
Liu, Shubo ;
Xiong, Xingxing ;
Cai, Zhaohui ;
Tu, Guoqing .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (18) :13849-13875
[7]   A Joint Learning and Communications Framework for Federated Learning Over Wireless Networks [J].
Chen, Mingzhe ;
Yang, Zhaohui ;
Saad, Walid ;
Yin, Changchuan ;
Poor, H. Vincent ;
Cui, Shuguang .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (01) :269-283
[8]   Semisupervised Distributed Learning With Non-IID Data for AIoT Service Platform [J].
Chiu, Te-Chuan ;
Shih, Yuan-Yao ;
Pang, Ai-Chun ;
Wang, Chieh-Sheng ;
Weng, Wei ;
Chou, Chun-Ting .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10) :9266-9277
[9]  
Chizat L, 2018, ADV NEUR IN, V31
[10]  
Courty N, 2017, ADV NEUR IN, V30