Integrating Reinforcement Learning with Multi-Agent Techniques for Adaptive Service Composition

被引:34
|
作者
Wang, Hongbign [1 ,2 ]
Chen, Xin [1 ,2 ]
Wu, Qin [1 ,2 ]
Yu, Qi [3 ]
Hu, Xingguo [1 ,2 ]
Zheng, Zibin [4 ,5 ]
Bouguettaya, Athman [6 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, SIPAILOU 2, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Key Lab Comp Network & Informat Integrat, SIPAILOU 2, Nanjing 210096, Jiangsu, Peoples R China
[3] Rochester Inst Tech, Coll Comp & Informat Sci, Rochester, NY USA
[4] Chinese Univ Hong Kong, Shenzhen Res Inst, HSB 101, Shatin, Hong Kong, Peoples R China
[5] Chinese Univ Hong Kong, Dept Comp Sci & Engn, HSB 101, Shatin, Hong Kong, Peoples R China
[6] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
基金
澳大利亚研究理事会;
关键词
Service composition; reinforcement learning; multi-agent system; game theory; ADAPTATION;
D O I
10.1145/3058592
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Service-oriented architecture is a widely used software engineering paradigm to cope with complexity and dynamics in enterprise applications. Service composition, which provides a cost-effective way to implement software systems, has attracted significant attention from both industry and research communities. As online services may keep evolving over time and thus lead to a highly dynamic environment, service composition must be self-adaptive to tackle uninformed behavior during the evolution of services. In addition, service composition should also maintain high efficiency for large-scale services, which are common for enterprise applications. This article presents a new model for large-scale adaptive service composition based on multi-agent reinforcement learning. The model integrates reinforcement learning and game theory, where the former is to achieve adaptation in a highly dynamic environment and the latter is to enable agents to work for a common task (i.e., composition). In particular, we propose a multi-agent Q-learning algorithm for service composition, which is expected to achieve better performance when compared with the single-agent Q-learning method and multi-agent SARSA (State-Action-Reward-State-Action) method. Our experimental results demonstrate the effectiveness and efficiency of our approach.
引用
收藏
页数:42
相关论文
共 50 条
  • [31] Modelling Stock Markets by Multi-agent Reinforcement Learning
    Lussange, Johann
    Lazarevich, Ivan
    Bourgeois-Gironde, Sacha
    Palminteri, Stefano
    Gutkin, Boris
    COMPUTATIONAL ECONOMICS, 2021, 57 (01) : 113 - 147
  • [32] A reinforcement learning scheme for a multi-agent card game
    Fujita, H
    Matsuno, Y
    Ishii, S
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4071 - 4078
  • [33] Rationality of reward sharing in multi-agent reinforcement learning
    Kazuteru Miyazaki
    Shigenobu Kobayashi
    New Generation Computing, 2001, 19 : 157 - 172
  • [34] A multi-agent reinforcement learning approach to robot soccer
    Duan, Yong
    Cui, Bao Xia
    Xu, Xin He
    ARTIFICIAL INTELLIGENCE REVIEW, 2012, 38 (03) : 193 - 211
  • [35] Robust multi-agent reinforcement learning for noisy environments
    Xinning Chen
    Xuan Liu
    Canhui Luo
    Jiangjin Yin
    Peer-to-Peer Networking and Applications, 2022, 15 : 1045 - 1056
  • [36] Modelling Stock Markets by Multi-agent Reinforcement Learning
    Johann Lussange
    Ivan Lazarevich
    Sacha Bourgeois-Gironde
    Stefano Palminteri
    Boris Gutkin
    Computational Economics, 2021, 57 : 113 - 147
  • [37] An overview: Attention mechanisms in multi-agent reinforcement learning
    Hu, Kai
    Xu, Keer
    Xia, Qingfeng
    Li, Mingyang
    Song, Zhiqiang
    Song, Lipeng
    Sun, Ning
    NEUROCOMPUTING, 2024, 598
  • [38] Multi-Agent Reinforcement Learning With Decentralized Distribution Correction
    Li, Kuo
    Jia, Qing-Shan
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 1684 - 1696
  • [39] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [40] Multi-Agent Reinforcement Learning for Adaptive User Association in Dynamic mmWave Networks
    Sana, Mohamed
    De Domenico, Antonio
    Yu, Wei
    Lostanlen, Yves
    Calvanese Strinati, Emilio
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (10) : 6520 - 6534