Monte-Carlo Tree Search for Graph Coalition Structure Generation

被引：0

作者：

Kong, Xianglong ^{[1
]}

Tong, Xiangrong ^{[1
]}

机构：

[1] Yantai Univ, Sch Comp & Control Engn, Yantai, Peoples R China

来源：

2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020) | 2020年

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

graph coalition structure generation; monte-carlo tree search; Multi-agent Systems; Cooperative Games; ALGORITHM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The techniques used in tree search have been proved to be effective in solving coalition structure generation (CSG). Inspired by this, we propose an algorithm to solve the problem of CSG on graphs, where the goal is to divide the graph into connected subgraphs and find the optimal partition. Specifically, we develop a new search algorithm for the problem of graph coalition structure generation (GCSG), which draws on the monte-carlo tree search (MCTS) to sample and expand the GCSG search space generated by edge contraction and 2-color graph to obtain the optimal solution. The algorithm is complete and anytime. The experimental results are compared with the latest methods, and show that this method is better than other. The proposed algorithm can be applied to the study of large-scale agent GCSG problems.

引用

页码：1058 / 1063

页数：6

共 18 条

[1] Finite-time analysis of the multiarmed bandit problem
Auer, P
Cesa-Bianchi, N
Fischer, P
[J]. MACHINE LEARNING, 2002, 47 (2-3) : 235 - 256
[2] A Survey of Monte Carlo Tree Search Methods
Browne, Cameron B.
Powley, Edward
Whitehouse, Daniel
Lucas, Simon M.
Cowling, Peter I.
Rohlfshagen, Philipp
Tavener, Stephen
Perez, Diego
Samothrakis, Spyridon
Colton, Simon
[J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) : 1 - 43
[3] Chang M. K., 2020, IEEE T GEOSCIENCE RE, V99, P1
[4] Flammini M, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P1353
[5] Hipel K. W, 2018, IEEE T SYS, P1
[6] Bandit based Monte-Carlo planning
Kocsis, Levente
Szepesvari, Csaba
[J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 282 - 293
[7] A hybrid exact algorithm for complete set partitioning
Michalak, Tomasz
Rahwan, Talal
Elkind, Edith
Wooldridge, Michael
Jennings, Nicholas R.
[J]. ARTIFICIAL INTELLIGENCE, 2016, 230 : 14 - 50
[8] Myerson R. B., 1977, Mathematics of Operations Research, V2, P225, DOI 10.1287/moor.2.3.225
[9] Policiuc A. A, 2020, CORR
[10] An Anytime Algorithm for Optimal Coalition Structure Generation
Rahwan, Talal
Ramchurn, Sarvapali D.
Jennings, Nicholas R.
Giovannucci, Andrea
[J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 34 : 521 - 567

← 1 2 →