共 255 条
[1]
Andriotis CP(2021)Cooperative zone-based rebalancing of idle overhead hoist transportations using multi-agent reinforcement learning with graph representation learning IISE Trans 0 1-17
[2]
Papakonstantinou KG(2019)Managing engineering systems with large state and action spaces through deep reinforcement learning Reliab Eng Syst 191 106483-135
[3]
Arel I(2010)Reinforcement learning-based multi-agent system for network traffic signal control IET Intell Transp Syst 4 128-279
[4]
Liu C(2020)The hanabi challenge: a new frontier for ai research Artif Intell 280 103216-840
[5]
Urbanik T(2013)The arcade learning environment: an evaluation platform for general agents J Artif Intell Res 47 253-405
[6]
Kohls AG(2002)The complexity of decentralized control of markov decision processes Math Oper Res 27 819-469
[7]
Edward H(2012)Convergence of a multi-agent projected stochastic gradient algorithm for non-convex optimization IEEE Trans Autom Control 58 391-250
[8]
Bellemare MG(2000)The o.d.e. method for convergence of stochastic approximation and reinforcement learning SIAM J Control Optim 38 447-172
[9]
Naddaf Y(2002)Multiagent learning using a variable learning rate Artif Intell 136 215-1512
[10]
Veness J(2008)A comprehensive survey of multiagent reinforcement learning IEEE Trans Syst, Man, Cybern, Part C (Appl Rev) 38 156-4305