共 23 条
[1]
Agogino AK(2012)A multiagent approach to managing air traffic flow Auton Agents Multiagent Syst 24 1-25
[2]
Tumer K(2000)Hierarchical reinforcement learning with the maxq value function decomposition J Artif Intell Res 13 227-303
[3]
Dietterich T(2006)Collaborative multiagent reinforcement learning by payoff propagation J Mach Learn Res 7 1789-1828
[4]
Kok JR(2020)Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning Auton Robot 44 485-503
[5]
Vlassis N(2004)Social optimality and cooperation in nonatomic congestion games J Econ Theory 114 56-87
[6]
Ma A(2017)Deeploco: dynamic locomotion skills using hierarchical deep reinforcement learning ACM Trans Graph 36 1-13
[7]
Ouimet M(2011)Congestion games with failures Discr Appl Math 159 1508-1525
[8]
Cortés J(2017)A neural model of hierarchical reinforcement learning PLoS One 12 e0180234-67
[9]
Milchtaich I(1973)A class of games processing pure-strategy nash equilibria Int J Game Theory 2 65-211
[10]
Peng XB(1999)Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning Artif Intell 112 181-undefined