共 49 条
[41]
Sutton RS, 2000, ADV NEUR IN, V12, P1057
[42]
Velickovi P., 2017, P INT C LEARN REPR
[43]
Wang WX, 2020, AAAI CONF ARTIF INTE, V34, P7293
[45]
Whiteson, 2018, QMIX MONOTONIC VALUE
[47]
Yang YD, 2018, PR MACH LEARN RES, V80
[49]
Zhang Z., 2019, Integrating independent and centralized multi-agent reinforcement learning for traffic signal network optimization