共 16 条
- [1] [Anonymous], 2015, 2015 12 INT C EXPO E
- [2] Behrisch M., 2011, Sumo-simulation of urban mobility: An overview, V2011
- [3] Casas N, 2017, Arxiv, DOI [arXiv:1703.09035, DOI 10.48550/ARXIV.1703.09035]
- [4] An Introduction to Deep Reinforcement Learning [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2018, 11 (3-4): : 219 - 354
- [6] Traffic signal timing via deep reinforcement learning [J]. Li, Li (li-li@tsinghua.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc. (03): : 247 - 254
- [7] Mnih V, 2013, Arxiv, DOI [arXiv:1312.5602, 10.48550/ARXIV.1312.5602]
- [9] Schulman J., 2015, arXiv, DOI [arXiv:1502.05477, DOI 10.48550/ARXIV.1502.05477]
- [10] Schulman J, 2017, Arxiv, DOI arXiv:1707.06347