共 13 条
- [1] Ariely D, 2010, UPSIDE IRRATIONALITY
- [2] A comprehensive survey of multiagent reinforcement learning [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172
- [3] Ferguson T.S., 1989, Stat. Sci., V4, P282, DOI [10.1214/ss/1177012493, DOI 10.1214/SS/1177012493]
- [4] Haarnoja T, 2018, PR MACH LEARN RES, V80
- [5] Hoffman M., P 34 INT C MACH LEAR
- [7] Mnih V., 2013, PLAYING ATARI DEEP R, V1312, P5602, DOI DOI 10.48550/ARXIV.1312.5602
- [8] Mnih V, 2016, PR MACH LEARN RES, V48
- [9] Human-level control through deep reinforcement learning [J]. NATURE, 2015, 518 (7540) : 529 - 533
- [10] Nowé A, 2012, ADAPT LEARN OPTIM, V12, P441