Integrating Reinforcement Learning with Multi-Agent Techniques for Adaptive Service Composition

被引：34

作者：

Wang, Hongbign ^{[1
,2
]}

Chen, Xin ^{[1
,2
]}

Wu, Qin ^{[1
,2
]}

Yu, Qi ^{[3
]}

Hu, Xingguo ^{[1
,2
]}

Zheng, Zibin ^{[4
,5
]}

Bouguettaya, Athman ^{[6
]}

机构：

[1] Southeast Univ, Sch Comp Sci & Engn, SIPAILOU 2, Nanjing 210096, Jiangsu, Peoples R China

[2] Southeast Univ, Key Lab Comp Network & Informat Integrat, SIPAILOU 2, Nanjing 210096, Jiangsu, Peoples R China

[3] Rochester Inst Tech, Coll Comp & Informat Sci, Rochester, NY USA

[4] Chinese Univ Hong Kong, Shenzhen Res Inst, HSB 101, Shatin, Hong Kong, Peoples R China

[5] Chinese Univ Hong Kong, Dept Comp Sci & Engn, HSB 101, Shatin, Hong Kong, Peoples R China

[6] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia

来源：

ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS | 2017年 / 12卷 / 02期

基金：

澳大利亚研究理事会;

关键词：

Service composition; reinforcement learning; multi-agent system; game theory; ADAPTATION;

D O I：

10.1145/3058592

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Service-oriented architecture is a widely used software engineering paradigm to cope with complexity and dynamics in enterprise applications. Service composition, which provides a cost-effective way to implement software systems, has attracted significant attention from both industry and research communities. As online services may keep evolving over time and thus lead to a highly dynamic environment, service composition must be self-adaptive to tackle uninformed behavior during the evolution of services. In addition, service composition should also maintain high efficiency for large-scale services, which are common for enterprise applications. This article presents a new model for large-scale adaptive service composition based on multi-agent reinforcement learning. The model integrates reinforcement learning and game theory, where the former is to achieve adaptation in a highly dynamic environment and the latter is to enable agents to work for a common task (i.e., composition). In particular, we propose a multi-agent Q-learning algorithm for service composition, which is expected to achieve better performance when compared with the single-agent Q-learning method and multi-agent SARSA (State-Action-Reward-State-Action) method. Our experimental results demonstrate the effectiveness and efficiency of our approach.

引用

页数：42

共 50 条

[31] Modelling Stock Markets by Multi-agent Reinforcement Learning
Lussange, Johann
Lazarevich, Ivan
Bourgeois-Gironde, Sacha
Palminteri, Stefano
Gutkin, Boris
COMPUTATIONAL ECONOMICS, 2021, 57 (01) : 113 - 147
[32] A reinforcement learning scheme for a multi-agent card game
Fujita, H
Matsuno, Y
Ishii, S
2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4071 - 4078
[33] Rationality of reward sharing in multi-agent reinforcement learning
Kazuteru Miyazaki
Shigenobu Kobayashi
New Generation Computing, 2001, 19 : 157 - 172
[34] A multi-agent reinforcement learning approach to robot soccer
Duan, Yong
Cui, Bao Xia
Xu, Xin He
ARTIFICIAL INTELLIGENCE REVIEW, 2012, 38 (03) : 193 - 211
[35] Robust multi-agent reinforcement learning for noisy environments
Xinning Chen
Xuan Liu
Canhui Luo
Jiangjin Yin
Peer-to-Peer Networking and Applications, 2022, 15 : 1045 - 1056
[36] Modelling Stock Markets by Multi-agent Reinforcement Learning
Johann Lussange
Ivan Lazarevich
Sacha Bourgeois-Gironde
Stefano Palminteri
Boris Gutkin
Computational Economics, 2021, 57 : 113 - 147
[37] An overview: Attention mechanisms in multi-agent reinforcement learning
Hu, Kai
Xu, Keer
Xia, Qingfeng
Li, Mingyang
Song, Zhiqiang
Song, Lipeng
Sun, Ning
NEUROCOMPUTING, 2024, 598
[38] Multi-Agent Reinforcement Learning With Decentralized Distribution Correction
Li, Kuo
Jia, Qing-Shan
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 1684 - 1696
[39] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
Malysheva, Aleksandra
Kudenko, Daniel
Shpilman, Aleksei
2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
[40] Multi-Agent Reinforcement Learning for Adaptive User Association in Dynamic mmWave Networks
Sana, Mohamed
De Domenico, Antonio
Yu, Wei
Lostanlen, Yves
Calvanese Strinati, Emilio
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (10) : 6520 - 6534

← 1 2 3 4 5 →