Federated Reinforcement Learning with Environment Heterogeneity

被引:0
|
作者
Jin, Hao [1 ]
Peng, Yang [1 ]
Yang, Wenhao [1 ]
Wang, Shusen [2 ]
Zhang, Zhihua [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Xiaohongshu Inc, Shanghai, Peoples R China
基金
北京市自然科学基金;
关键词
GAME; GO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study a Federated Reinforcement Learning (FedRL) problem in which n agents collaboratively learn a single policy without sharing the trajectories they collected during agent-environment interaction. We stress the constraint of environment heterogeneity, which means n environments corresponding to these n agents have different state transitions. To obtain a value function or a policy function which optimizes the overall performance in all environments, we propose two federated RL algorithms, QAvg and PAvg. We theoretically prove that these algorithms converge to suboptimal solutions, while such sub-optimality depends on how heterogeneous these n environments are. Moreover, we propose a heuristic that achieves personalization by embedding the n environments into n vectors. The personalization heuristic not only improves the training but also allows for better generalization to new environments.
引用
收藏
页码:18 / 37
页数:20
相关论文
共 50 条
  • [41] Manipulator Control using Federated Deep Reinforcement Learning
    Shivkumar, S.
    Kumaar, A. A. Nippun
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [42] Meta Federated Reinforcement Learning for Distributed Resource Allocation
    Ji, Zelin
    Qin, Zhijin
    Tao, Xiaoming
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (07) : 7865 - 7876
  • [43] Federated Offline Reinforcement Learning with Proximal Policy Evaluation
    Yue, Sheng
    Deng, Yongheng
    Wang, Guanbo
    Ren, Ju
    Zhang, Yaoxue
    CHINESE JOURNAL OF ELECTRONICS, 2024, 33 (06) : 1360 - 1372
  • [44] Federated Learning Under Statistical Heterogeneity on Riemannian Manifolds
    Ahmad, Adnan
    Luo, Wei
    Robles-Kelly, Antonio
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT I, 2023, 13935 : 380 - 392
  • [45] On the Tradeoff Between Heterogeneity and Communication Complexity in Federated Learning
    Sinha, Priyanka
    Kibilda, Jacek
    Saad, Walid
    FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 115 - 121
  • [46] A Flexible Distributed Building Simulator for Federated Reinforcement Learning
    Fujimura, Shugo
    Fujita, Koki
    Sun, Yuwei
    Esaki, Hiroshi
    Ochiai, Hideya
    2022 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS (IEEE COINS 2022), 2022, : 159 - 164
  • [47] FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning
    Zhang, Junyuan
    Zhang, Shuang
    Zhang, Miao
    Wang, Runxi
    Wang, Feifei
    Zhou, Yuyin
    Liang, Paul Pu
    Qi, Liangqiong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 12098 - 12108
  • [48] Federated Learning for Data and Model Heterogeneity in Medical Imaging
    Madni, Hussain Ahmad
    Umer, Rao Muhammad
    Foresti, Gian Luca
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 167 - 178
  • [49] FedOps: A Platform of Federated Learning Operations With Heterogeneity Management
    Moon, Jihwan
    Yang, Semo
    Lee, Kangyoon
    IEEE ACCESS, 2024, 12 : 4301 - 4314
  • [50] Federated Learning with complete service commitment of data heterogeneity
    Zhou, Yizhi
    Wang, Junxiao
    Qin, Yuchen
    Kong, Xiangyu
    Xie, Xin
    Qi, Heng
    Zeng, Deze
    KNOWLEDGE-BASED SYSTEMS, 2025, 310