Multi-task coalition parallel formation strategy based on reinforcement learning

被引:0
作者
Department of Computer and Information Science, Hefei University of Technology, Hefei 230009, China [1 ]
不详 [2 ]
机构
[1] Department of Computer and Information Science, Hefei University of Technology
[2] Engineering Research Center of Safety Critical Industrial Measurement and Control Technology
来源
Zidonghua Xuebao | 2008年 / 3卷 / 349-352期
基金
中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
Markov decision process; Multi-task coalition; Parallel formation; Reinforcement learning;
D O I
10.3724/SP.J.1004.2008.00349
中图分类号
学科分类号
摘要
Agent coalition is an important manner of agents' coordination and cooperation. Forming a coalition, agents can enhance their ability to solve problems and obtain more utilities. In this paper, a novel multi-task coalition parallel formation strategy is presented, and the conclusion that the process of multi-task coalition formation is a Markov decision process is testified theoretically. Moreover, reinforcement learning is used to solve agents' behavior strategy, and the process of multi-task coalition parallel formation is described. In multi-task oriented domains, the strategy can effectively and parallel form multi-task coalitions.
引用
收藏
页码:349 / 352
页数:3
相关论文
共 50 条
[21]   Genetically-regulated Neuromodulation Facilitates Multi-Task Reinforcement Learning [J].
Cussat-Blanc, Sylvain ;
Harrington, Kyle I. S. .
GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, :551-558
[22]   A reinforcement learning assisted evolutionary algorithm for constrained multi-task optimization [J].
Yang, Yufei ;
Zhang, Changsheng ;
Zhang, Bin ;
Ning, Jiaxu .
INFORMATION SCIENCES, 2024, 678
[23]   A multi-task learning model with reinforcement optimization for ASD comorbidity discrimination [J].
Dong, Heyou ;
Chen, Dan ;
Chen, Yukang ;
Tang, Yunbo ;
Yin, Dingze ;
Li, Xiaoli .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 243
[24]   Multi-task end-edge offloading based on Lyapunov optimization and deep reinforcement learning [J].
Xu C. ;
Tang Z.-X. ;
Jin X. ;
Xia C.-Q. .
Kongzhi yu Juece/Control and Decision, 2024, 39 (07) :2457-2464
[25]   Click is not equal to purchase: multi-task reinforcement learning for multi-behavior recommendation [J].
Huiwang Zhang ;
Pengpeng Zhao ;
Xuefeng Xian ;
Victor S. Sheng ;
Yongjing Hao ;
Zhiming Cui .
World Wide Web, 2023, 26 :4153-4172
[26]   Click is Not Equal to Purchase: Multi-task Reinforcement Learning for Multi-behavior Recommendation [J].
Zhang, Huiwang ;
Zhao, Pengpeng ;
Xian, Xuefeng ;
Sheng, Victor S. ;
Hao, Yongjing ;
Cui, Zhiming .
WEB INFORMATION SYSTEMS ENGINEERING - WISE 2022, 2022, 13724 :443-459
[27]   Click is not equal to purchase: multi-task reinforcement learning for multi-behavior recommendation [J].
Zhang, Huiwang ;
Zhao, Pengpeng ;
Xian, Xuefeng ;
Sheng, Victor S. ;
Hao, Yongjing ;
Cui, Zhiming .
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (06) :4153-4172
[28]   Multi-Task Multi-Agent Reinforcement Learning With Task-Entity Transformers and Value Decomposition Training [J].
Zhu, Yuanheng ;
Huang, Shangjing ;
Zuo, Binbin ;
Zhao, Dongbin ;
Sun, Changyin .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 :9164-9177
[29]   Evolving hierarchical memory-prediction machines in multi-task reinforcement learning [J].
Kelly, Stephen ;
Voegerl, Tatiana ;
Banzhaf, Wolfgang ;
Gondro, Cedric .
GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2021, 22 (04) :573-605
[30]   Evolving hierarchical memory-prediction machines in multi-task reinforcement learning [J].
Stephen Kelly ;
Tatiana Voegerl ;
Wolfgang Banzhaf ;
Cedric Gondro .
Genetic Programming and Evolvable Machines, 2021, 22 :573-605