共 12 条
- [1] Sutton R.S., Barto A.G., Reinforcement Learning: An Introduction, pp. 1-5, (1998)
- [2] Gao Y., Chen S.F., Lu X., Research on reinforcement learning technology: A review, Acta Automatica Sinica, 30, 1, pp. 86-100, (2004)
- [3] Wu J., Xu X., Wang J., Et al., Recent advances of reinforcement learning in multi-robot systems: A survey, Control and Decision, 26, 11, pp. 1601-1610, (2011)
- [4] Chen Z.H., Yang Z.H., Wang H.B., Et al., Overview of reinforcement learning from knowledge expression and handling, Control and Decision, 23, 9, pp. 962-968, (2008)
- [5] Zhu M.Q., Li M., Zhang Q., A dyna Q-learning algorithm in underground path planning, Industrial and Mine Automation, 12, pp. 71-75, (2012)
- [6] Bianchi R.A.C., Ribeiro C.H.C., Costa A.H.R., Accelerating autonomous learning by using heuristic selection of actions, J of Heuristics, 14, 2, pp. 135-168, (2008)
- [7] Marek G., Improving exploration in reinforcement learning through domain knowledge and parameter analysis, pp. 34-36, (2010)
- [8] Bradley K.W., Peter S., Augmenting reinforcement learning with human feedback, The 28th ICML Workshop on New Developments in Imitation Learning, (2011)
- [9] Belkin M., Niyogi P., Laplacian eigenmaps for dimensionality reduction and data representation, Neural Computation, 15, 6, pp. 1373-1396, (2003)
- [10] Zhu M.Q., Cheng Y.H., Li M., Et al., A hybrid transfer algorithm for reinforcement learning based on spectral method, Acta Automatica Sinica, 38, 11, pp. 1765-1776, (2012)