共 16 条
- [2] [Anonymous], 2014, MARKOV DECISION PROC
- [3] Behrisch, 2012, INT J ADV SYST MEAS, V5, P128
- [4] CHIU S, 1993, SECOND IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, P1371, DOI 10.1109/FUZZY.1993.327593
- [6] Genders W., Using a deep reinforcement learning agent for traffic signal control
- [7] Genders W, 2016, ARXIV PREPRINT ARXIV
- [8] Traffic signal timing via deep reinforcement learning [J]. Li, Li (li-li@tsinghua.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc. (03): : 247 - 254
- [9] Human-level control through deep reinforcement learning [J]. NATURE, 2015, 518 (7540) : 529 - 533