MULTI-MODEL FEDERATED LEARNING OPTIMIZATION BASED ON MULTI-AGENT REINFORCEMENT LEARNING

被引:0
|
作者
Atapour, S. Kaveh [1 ]
Seyedmohammadi, S. Jamal [2 ]
Sheikholeslami, S. Mohammad [3 ]
Abouei, Jamshid [4 ]
Mohammadi, Arash [2 ]
Plataniotis, Konstantinos N. [3 ]
机构
[1] Tarbiat Modares Univ, Dept Comp & Elect Engn, Tehran, Iran
[2] Concordia Inst Informat Syst Engn CIISE, Montreal, PQ, Canada
[3] Univ Toronto, Edward S Rogers Sr Dept Elect Comp Engn, Toronto, ON, Canada
[4] Yazd Univ, Dept Elect Engn, Yazd, Iran
来源
2023 IEEE 9TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING, CAMSAP | 2023年
关键词
Muti-Model Federated Learning; MDP; Reinforcement Learning; Team-Q algorithm; Cooperative Multi-Agents; MODEL;
D O I
10.1109/CAMSAP58249.2023.10403421
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses the problem of Multi-Model Federated Learning (MMFL) in a typical wireless network, where a cellular Base Station (BS) cooperates with multiple clients to simultaneously train several Machine Learning (ML) models. Accordingly, the objective of this paper is to make an efficient joint decision for client association and communication-computation resource allocation to optimize the performance of the MMFL algorithm. In this regard, an optimization problem is formulated to minimize the average global loss of ML models under clients' energy and delay constraints. It is shown that the problem is a mixed-integer optimization whose objective is implicit in terms of the decision variables. To solve the optimization problem, we propose a Multi-Agent Multi-Model Federated Learning (MAMMFL) scheme based on a cooperative multi-agent configuration to intelligently assign models and resources to clients. Specifically, the problem is first converted to a Markov Decision Process (MDP) problem, then it is divided into four sub-MDP problems, where each problem relates to a phase in MMFL. The reinforcement learning algorithm solves each subproblem, and a team-Q algorithm is adopted to coordinate agents in a cooperative multi-agent setting. Simulation results show that the proposed method can outperform other baselines in terms of average global loss and resource consumption.
引用
收藏
页码:151 / 155
页数:5
相关论文
共 50 条
  • [41] Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning
    Wang, Xin
    Zhao, Chen
    Huang, Tingwen
    Chakrabarti, Prasun
    Kurths, Juergen
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 13 - 23
  • [42] Learning to Communicate for Mobile Sensing with Multi-agent Reinforcement Learning
    Zhang, Bolei
    Liu, Junliang
    Xiao, Fu
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT II, 2021, 12938 : 612 - 623
  • [43] Learning of Communication Codes in Multi-Agent Reinforcement Learning Problem
    Kasai, Tatsuya
    Tenmoto, Hiroshi
    Kamiya, Akimoto
    2008 IEEE CONFERENCE ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS SMCIA/08, 2009, : 1 - +
  • [44] Reinforcement learning of multi-agent communicative acts
    Hoet S.
    Sabouret N.
    Revue d'Intelligence Artificielle, 2010, 24 (02) : 159 - 188
  • [45] Multi-agent Reinforcement Learning for Service Composition
    Lei, Yu
    Yu, Philip S.
    PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2016), 2016, : 790 - 793
  • [46] Multi-agent reinforcement learning with cooperation based on eligibility traces
    杨玉君
    程君实
    陈佳品
    Journal of Harbin Institute of Technology(New series), 2004, (05) : 564 - 568
  • [47] Multi-agent reinforcement learning based on policies of global objective
    张化祥
    黄上腾
    Journal of Systems Engineering and Electronics, 2005, (03) : 676 - 681
  • [48] Cooperative targets assignment based on multi-agent reinforcement learning
    Ma Y.
    Wu L.
    Xu X.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (09): : 2793 - 2801
  • [49] Storing Multi-model Data in RDBMSs based on Reinforcement Learning
    Yuan, Gongsheng
    Lu, Jiaheng
    Zhang, Shuxun
    Yan, Zhengtong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3608 - 3611
  • [50] A Multi-Agent Reinforcement Learning Based Control Method for CAVs in a Mixed Platoon
    Xu, Yaqi
    Shi, Yan
    Tong, Xiaolu
    Chen, Shanzhi
    Ge, Yuming
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (11) : 16160 - 16172