共 37 条
[1]
Agarwal A., 2020, C LEARN THEOR, P64
[2]
Andreas J, 2017, PR MACH LEARN RES, V70
[4]
Cooperative Multi-agent Policy Gradient
[J].
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT I,
2019, 11051
:459-476
[6]
Cutkosky A, 2019, ADV NEUR IN, P15210
[7]
DEramo C., 2020, INT C LEARN REPR
[9]
Fazel M, 2018, PR MACH LEARN RES, V80
[10]
Foerster JN, 2016, ADV NEUR IN, V29