共 135 条
[1]
Munos R., From bandits to Monte-Carlo tree search: The optimistic principle applied to optimization and planning, Foundations and Trends in Machine Learning, 7, 1, pp. 1-129, (2014)
[2]
Sutton R.S., Barto A.G., Reinforcement Learning: An Introduction, (1998)
[3]
Bertsekas D.P., Bertsekas D.P., Bertsekas D.P., Et al., Dynamic Programming and Optimal Control, (1995)
[4]
Szepesvari C., Algorithms for reinforcement learning, Synthesis Lectures on Artificial Intelligence and Machine Learning, 4, 1, pp. 1-103, (2010)
[5]
Krizhevsky A., Sutskever I., Hinton G.E., ImageNet classification with deep convolutional neural networks, Proceedingsof the International Conference on Neural Information Processing Systems, pp. 1097-1105, (2012)
[6]
Sermanet P., Kavukcuoglu K., Chintala S., Et al., Pedestrian detection with unsupervised multi-stage feature learning, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3626-3633, (2013)
[7]
Dahl G.E., Acero A., Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, IEEE Transactions on Audio Speech & Language Processing, 20, 1, pp. 30-42, (2011)
[8]
Graves A., Mohamed A.R., Hinton G., Speech recognition with deep recurrent neural networks, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6645-6649, (2013)
[9]
Huval B., Coates A., Ng A., Deep learning for class-generic object detection, (2013)
[10]
Makantasis K., Karantzalos K., Doulamis A., Et al., Deep learning-based man-made object detection from hyperspectral data, Proceedings of the Advances in Visual Computing-11th International Symposium, pp. 717-727, (2015)