共 57 条
[1]
Belkin M., Niyogi P., Laplacian eigenmaps for dimensionality reduction and data representation, Neural Comput, 15, 6, pp. 1373-1396, (2003)
[2]
Bellman R.E., Dynamic Programming, (1957)
[3]
Bengio Y., Lamblin P., Popovici D., Larochelle H., Greedy layer-wise training of deep networks, Advances in Neural Information Processing Systems, (2007)
[4]
Bohmer W., Grunewalder S., Nickisch H., Obermayer K., Generating feature spaces for linear algorithms with regularized sparse kernel slow feature analysis, Mach Learn, 89, 1-2, pp. 67-86, (2012)
[5]
Bohmer W., Grunewalder S., Shen Y., Musial M., Obermayer K., Construction of approximation spaces for reinforcement learning, J Mach Learn Res, 14, pp. 2067-2118, (2013)
[6]
Bohmer W., Obermayer K., Towards structural generalization: Factored approximate planning, ICRA Workshop on Autonomous Learning, (2013)
[7]
Boutilier C., Dean T., Hanks S., Decision-theoretic planning: structural assumptions and computational leverage, J Artif Intell Res, 11, pp. 1-94, (1999)
[8]
Boyan J.A., Moore A.W., Generalization in reinforcement learning: Safely approximating the value function, Advances in Neural Information Processing Systems, pp. 369-376, (1995)
[9]
Bradtke S.J., Barto A.G., Linear least-squares algorithms for temporal difference learning, Mach Learn, 22, 1-3, pp. 33-57, (1996)
[10]
Dzeroski S., Raedt L.D., Drissens K., Relational reinforcement learning, Mach Learn, 43, pp. 7-52, (2001)