共 89 条
[1]
Abbeel P(2010)Autonomous helicopter aerobatics through apprenticeship learning The International Journal of Robotics Research 29 1608-1639
[2]
Coates A(2017)A survey of methods for time series change point detection Knowledge and Information Systems 51 339-367
[3]
Ng AY(2009)A survey of robot learning from demonstration Robotics and Autonomous Systems 57 469-483
[4]
Aminikhanghahi S(2001)Infinite-horizon policy-gradient estimation Journal of Artificial Intelligence Research 15 319-350
[5]
Cook DJ(1958)On a routing problem Quarterly of applied mathematics 16 87-90
[6]
Argall BD(2019)Deep hedging Quantitative Finance 19 1271-1291
[7]
Chernova S(2013)A survey on policy search for robotics Foundations and Trends in Robotics 2 1-142
[8]
Veloso M(2013)Probabilistic model-based imitation learning Adaptive Behavior 21 388-403
[9]
Browning B(2019)Detecting the state of the climate system via artificial intelligence to improve seasonal forecasts and inform reservoir operations Water Resources Research 55 9133-9147
[10]
Baxter J(2017)Imitation learning: A survey of learning methods ACM Computing Surveys (CSUR) 50 1-35