Game Theory-Based Control System Algorithms with Real-Time Reinforcement Learning HOW TO SOLVE MULTIPLAYER GAMES ONLINE

被引：136

作者：

Vamvoudakis, Kyriakos G. ^{[1
]}

Modares, Hamidreza ^{[2
]}

Kiumarsi, Bahare ^{[3
]}

Lewis, Frank L. ^{[4
,5
]}

机构：

[1] Virginia Tech, Dept Aerosp & Ocean Engn, Blacksburg, VA 24061 USA

[2] Missouri Univ Sci & Technol, Rolla, MO USA

[3] Univ Texas Arlington, Arlington, TX 76019 USA

[4] Univ Texas Arlington, Res Inst, Ft Worth, TX USA

[5] Northeastern Univ, Shenyang, Peoples R China

来源：

IEEE CONTROL SYSTEMS MAGAZINE | 2017年 / 37卷 / 01期

关键词：

OPTIMAL TRACKING CONTROL; ZERO-SUM GAMES; STACKELBERG STRATEGY; FEEDBACK; EQUATION;

D O I：

10.1109/MCS.2016.2621461

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Complex human-engineered systems involve an interconnection of multiple decision makers (or agents) whose collective behavior depends on a compilation of local decisions that are based on partial information about each other and the state of the environment [1]-[4]. Strategic interactions among agents in these systems can be modeled as a multiplayer simultaneous-move game [5]-[8]. The agents involved can have conflicting objectives, and it is natural to make decisions based upon optimizing individual payoffs or costs. © 2016 IEEE.

引用

页码：33 / 52

页数：20

共 66 条

[11]

Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f

[12]

Camerer C. F., 2003, Behavioral game theory: Experiments in strategic interaction, DOI [DOI 10.1016/J.SOCEC.2003.10.009, 10.1016/j.socec.2003.10.009]

[13] STATE-SPACE SOLUTIONS TO STANDARD H-2 AND H-INFINITY CONTROL-PROBLEMS [J].

DOYLE, JC ;

GLOVER, K ;

KHARGONEKAR, PP ;

FRANCIS, BA .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1989, 34 (08) :831-847

[14]

Engwerda J, 2005, LQ dynamic optimization and differential games

[15]

Euwe M., 1958, LOGICAL APPROACH CHE

[16] GAME-THEORY AND TRANSPORTATION SYSTEMS MODELING [J].

FISK, CS .

TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 1984, 18 (4-5) :301-313

[17] A max-plus-based algorithm for a Hamilton-Jacobi-Bellman equation of nonlinear filtering [J].

Fleming, WH ;

McEneaney, WM .

SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2000, 38 (03) :683-710

[18] On global existence of solutions to coupled matrix Riccati equations in closed-loop Nash games [J].

Freiling, G ;

Jank, G ;

AbouKandil, H .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1996, 41 (02) :264-269

[19]

Harrop Ronald, 1961, Z MATH LOGIK GRUNDLA, V7, P136

[20] Approximate N-Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System [J].

Johnson, Marcus ;

Kamalapurkar, Rushikesh ;

Bhasin, Shubhendu ;

Dixon, Warren E. .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (08) :1645-1658

← 1 2 3 4 5 6 7 →