共 18 条
- [1] Sutton R.S., Barto A.G., Reinforcement Learning: An Introduction, (1998)
- [2] Gao Y., Chen S.-F., Lu X., Research on reinforcement learning technology: A review, Acta Automatica Sinica, 30, 1, pp. 86-100, (2004)
- [3] Zhao D.-B., Liu D.-R., Yi J.-Q., An overview on the adaptive dynamic programming based urban city traffic signal optimal control, Acta Automatica Sinica, 35, 6, pp. 676-681, (2009)
- [4] Barto A.G., Mahadevan S., Recent advances in hierarchical reinforcement learning, Discrete Event Dynamic Systems, 13, 4, pp. 341-379, (2003)
- [5] Pan S.J., Yang Q., A survey on transfer learning, IEEE Transactions on Knowledge and Data Engineering, 22, 10, pp. 1345-1359, (2010)
- [6] Taylor M.E., Stone P., Transfer learning for reinforcement learning domains: A survey, The Journal of Machine Learning Research, 10, pp. 1633-1685, (2009)
- [7] Wang H., Gao Y., Cheng X.-G., Transfer of reinforcement learning: The state of the art, Acta Electronica Sinica, 36, 12 a, pp. 39-43, (2008)
- [8] Mahadevan S., Maggioni M., Proto-value functions: A Lapla-cian framework for learning representation and control in Markov decision processes, The Journal of Machine Learning Research, 8, pp. 2169-2231, (2007)
- [9] Chiu C.C., Soo V.W., Automatic complexity reduction in reinforcement learning, Computational Intelligence, 26, 1, pp. 1-25, (2010)
- [10] Simsek O., Wolfe A.P., Barto A.G., Identifying useful sub-goals in reinforcement learning by local graph partitioning, Proceedings of the 22nd International Conference on Machine Learning, pp. 816-823, (2005)