共 66 条
- [1] Abbeel P., 2004, P 21 INT C MACH LEAR, P1, DOI [10.1145/1015330.1015430, DOI 10.1145/1015330.1015430]
- [2] Arjovsky M, 2017, Arxiv, DOI arXiv:1701.07875
- [4] Bai X., 2019, P ADV NEUR INF PESS, p10 735
- [5] Ballas N., 2015, DELVING DEEPER CONVO
- [7] Bloem M, 2014, IEEE DECIS CONTR P, P4911, DOI 10.1109/CDC.2014.7040156
- [8] DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2722 - 2730
- [9] Chen HK, 2019, AAAI CONF ARTIF INTE, P3312
- [10] Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1187 - 1196