共 46 条
[11]
Garbarino EC(1996)Average reward reinforcement learning: Foundations, algorithms, and empirical results Machine learning 22 159-195
[12]
Edell JA(2009)Robust navigation in an unknown environment with minimal sensing and representation IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 39 212-229
[13]
Helmert M(2008)Comparing the power of robots The International Journal of Robotics Research 27 5-23
[14]
Hoffmann J(2005)An MDP-based recommender system Journal of Machine Learning Research 6 1265-1295
[15]
Nebel B(2013)A survey of point-based POMDP solvers Autonomous Agents and Multi-Agent Systems 27 1-51
[16]
Kupcsik A(2014)Deploying a modeling framework for reusable robot behavior to enable informed strategies for domestic service robots Robotics and Autonomous Systems 62 619-631
[17]
Deisenroth MP(2006)Branching and pruning: An optimal temporal pocl planner based on constraint programming Artificial Intelligence 170 298-335
[18]
Peters J(2010)Comparison of optimal solutions to real-time path planning for a mobile vehicle IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans 40 721-731
[19]
Loh AP(undefined)undefined undefined undefined undefined-undefined
[20]
Vadakkepat P(undefined)undefined undefined undefined undefined-undefined