On the Combination of Game-Theoretic Learning and Multi Model Adaptive Filters

被引:0
作者
Smyrnakis, Michalis [1 ]
Qu, Hongyang [2 ]
Bauso, Dario [3 ,4 ]
Veres, Sandor [2 ]
机构
[1] Sci & Technol Facil Council, Daresboury, England
[2] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield, S Yorkshire, England
[3] Univ Groningen Nijenborgh, Fac Sci & Engn, Jan C Willems Ctr Syst & Control ENTEG, Groningen, Netherlands
[4] Univ Palermo, Dipartimento Ingn, Viale Sci, Palermo, Italy
来源
AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2020 | 2021年 / 12613卷
基金
英国工程与自然科学研究理事会;
关键词
Game-theoretic learning; Distributed optimisation; Multi-model adaptive filters; Robot teams coordination; Fictitious play; Bayesian games; Potential games; State based games; Stochastic games; TASK ALLOCATION; COORDINATION; AGENTS; TEAM;
D O I
10.1007/978-3-030-71158-0_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper casts coordination of a team of robots within the framework of game theoretic learning algorithms. In particular a novel variant of fictitious play is proposed, by considering multi-model adaptive filters as a method to estimate other players' strategies. The proposed algorithm can be used as a coordination mechanism between players when they should take decisions under uncertainty. Each player chooses an action after taking into account the actions of the other players and also the uncertainty. Uncertainty can occur either in terms of noisy observations or various types of other players. In addition, in contrast to other game-theoretic and heuristic algorithms for distributed optimisation, it is not necessary to find the optimal parameters a priori. Various parameter values can be used initially as inputs to different models. Therefore, the resulting decisions will be aggregate results of all the parameter values. Simulations are used to test the performance of the proposed methodology against other game-theoretic learning algorithms.
引用
收藏
页码:73 / 105
页数:33
相关论文
共 50 条
[41]   Game-theoretic models for sustainable supply chains with asymmetric information: a review [J].
Wu, Kailan ;
Tavasszy, Lorant ;
Rezaei, Jafar ;
De Schutter, Bart .
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE-OPERATIONS & LOGISTICS, 2025, 12 (01)
[42]   A Game-Theoretic Approach to Distributed Scheduling of Rigid Demands on Dynamical Systems [J].
Farokhi, Farhad ;
Cantoni, Michael ;
Shames, Iman .
2016 AUSTRALIAN CONTROL CONFERENCE (AUCC), 2016, :147-152
[43]   The Corona-Pandemic: A Game-Theoretic Perspective on Regional and Global Governance [J].
Caparros, Alejandro ;
Finus, Michael .
ENVIRONMENTAL & RESOURCE ECONOMICS, 2020, 76 (04) :913-927
[44]   Path Planning and Task Assignment for Data Retrieval from Wireless Sensor Nodes Relying on Game-Theoretic Learning [J].
Papatheodorou, Sotiris ;
Smyrnakis, Michalis ;
Hamidou, Tembine ;
Tzes, Anthony .
2018 5TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2018, :1073-1078
[45]   A selective survey of game-theoretic models of closed-loop supply chains [J].
De Giovanni, Pietro ;
Zaccour, Georges .
ANNALS OF OPERATIONS RESEARCH, 2022, 314 (01) :77-116
[46]   Game-Theoretic Frameworks for Epidemic Spreading and Human Decision-Making: A Review [J].
Huang, Yunhan ;
Zhu, Quanyan .
DYNAMIC GAMES AND APPLICATIONS, 2022, 12 (01) :7-48
[47]   Research on manufacturer encroachment with advertising and design of incentive advertising: A game-theoretic approach [J].
Ma, Junhai ;
Hong, Yalan .
RAIRO-OPERATIONS RESEARCH, 2021, 55 :S1261-S1286
[48]   A selective survey of game-theoretic models of closed-loop supply chains [J].
De Giovanni, Pietro ;
Zaccour, Georges .
4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH, 2019, 17 (01) :1-44
[49]   The Long-Term Benefits of Following Fairness Norms: A Game-Theoretic Analysis [J].
Lorini, Emiliano ;
Muehlenbernd, Roland .
PRIMA 2015: PRINCIPLES AND PRACTICE OF MULTI-AGENT SYSTEMS, 2015, 9387 :301-318
[50]   Game-Theoretic Formulation of Power Dispatch With Guaranteed Convergence and Prioritized Best Response [J].
Du, Liang ;
Grijalva, Santiago ;
Harley, Ronald G. .
IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2015, 6 (01) :51-59