A Joint Communication and Learning Framework for Hierarchical Split Federated Learning

被引:16
作者
Khan, Latif U. [1 ]
Guizani, Mohsen [1 ]
Al-Fuqaha, Ala [2 ]
Hong, Choong Seon [3 ]
Niyato, Dusit [4 ]
Han, Zhu [3 ,5 ,6 ]
机构
[1] Mohamed Bin Zayed Univ Artificial Intelligence, Machine Learning Dept, Abu Dhabi, U Arab Emirates
[2] Hamad Bin Khalifa Univ, Coll Engn & Appl Sci, Comp Sci Dept, Doha, Qatar
[3] Kyung Hee Univ, Dept Comp Sci & Engn, Yongin 17104, South Korea
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[5] Univ Houston, Elect & Comp Engn Dept, Houston, TX 77004 USA
[6] Univ Houston, Comp Sci Dept, Houston, TX 77004 USA
关键词
Federated learning (FL); hierarchical FL; Internet of Things (IoT); split learning; NETWORKS;
D O I
10.1109/JIOT.2023.3315673
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In contrast to methods relying on a centralized training, emerging Internet of Things (IoT) applications can employ federated learning (FL) to train a variety of models for performance improvement and improved privacy preservation. FL calls for the distributed training of local models at end-devices, which uses a lot of processing power (i.e., CPU cycles/sec). Most end-devices have computing power limitations, such as IoT temperature sensors. One solution for this problem is split FL. However, split FL has its problems, including a single point of failure, issues with fairness, and a poor convergence rate. We provide a novel framework, called hierarchical split FL (HSFL), to overcome these issues. On grouping, our HSFL framework is built. Partial models are constructed within each group at the devices, with the remaining work done at the edge servers. Each group then performs local aggregation at the edge following the computation of local models. End devices are given access to such an edge aggregated model so they can update their models. For each group, a unique edge aggregated HSFL model is produced by this procedure after a set number of rounds. Shared among edge servers, these edge aggregated HSFL models are then aggregated to produce a global model. Additionally, we propose an optimization problem that takes into account the relative local accuracy (RLA) of devices, transmission latency, transmission energy, and edge servers' compute latency in order to reduce the cost of HSFL. The formulated problem is a mixed-integer nonlinear programming (MINLP) problem and cannot be solved easily. To tackle this challenge, we perform decomposition of the formulated problem to yield subproblems. These subproblems are edge computing resource allocation problem and joint RLA minimization, wireless resource allocation, task offloading, and transmit power allocation subproblem. Due to the convex nature of edge computing, resource allocation is done so utilizing a convex optimizer, as opposed to a block successive upper-bound minimization (BSUM)-based approach for joint RLA minimization, resource allocation, job offloading, and transmit power allocation. Finally, we present the performance evaluation findings for the proposed HSFL scheme.
引用
收藏
页码:268 / 282
页数:15
相关论文
共 37 条
[1]  
Abad MSH, 2020, INT CONF ACOUST SPEE, P8866, DOI [10.1109/icassp40776.2020.9054634, 10.1109/ICASSP40776.2020.9054634]
[2]   Machine Learning Meets Communication Networks: Current Trends and Future Challenges [J].
Ahmad, Ijaz ;
Shahabuddin, Shariar ;
Malik, Hassan ;
Harjula, Erkki ;
Leppanen, Teemu ;
Loven, Lauri ;
Anttonen, Antti ;
Sodhro, Ali Hassan ;
Mahtab Alam, Muhammad ;
Juntti, Markku ;
Yla-Jaaski, Antti ;
Sauter, Thilo ;
Gurtov, Andrei ;
Ylianttila, Mika ;
Riekki, Jukka .
IEEE ACCESS, 2020, 8 :223418-223460
[3]  
Ali S, 2020, Arxiv, DOI arXiv:2004.13875
[4]  
[Anonymous], 2021, EVOLVED UNIVERSAL TE
[5]  
[Anonymous], 2022, Hessian matrix
[6]  
Boyd S., 2004, Convex Optimization
[7]   A Joint Learning and Communications Framework for Federated Learning Over Wireless Networks [J].
Chen, Mingzhe ;
Yang, Zhaohui ;
Saad, Walid ;
Yin, Changchuan ;
Poor, H. Vincent ;
Cui, Shuguang .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2021, 20 (01) :269-283
[8]   Using Machine Learning in Communication Networks [Invited] [J].
Cote, David .
JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2018, 10 (10) :D100-D109
[9]   6G VISION AND REQUIREMENTS [J].
David, Klaus ;
Berndt, Hendrik .
IEEE VEHICULAR TECHNOLOGY MAGAZINE, 2018, 13 (03) :72-80
[10]  
Han Z, 2017, SIGNAL PROCESSING AND NETWORKING FOR BIG DATA APPLICATIONS, P1, DOI 10.1017/9781316408032