共 29 条
[21]
Schulman J., 2017, arXiv
[22]
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[23]
Theodorou EA, 2010, J MACH LEARN RES, V11, P3137
[24]
Todorov E., 2006, Advances in neural information processing systems, V19
[25]
End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2020,
:7151-7160
[26]
Toussaint M., 2009, P 26 ANN INT C MACH, P1049, DOI 10.1145/1553374.1553508
[27]
Wang JX, 2017, Arxiv, DOI [arXiv:1611.05763, 10.48550/arXiv.1611.05763]
[28]
Ziebart B.D., 2010, MODELING PURPOSEFUL
[29]
Navigate Like a Cabbie: Probabilistic Reasoning from Observed Context-Aware Behavior
[J].
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING (UBICOMP 2008),
2008,
:322-331