共 23 条
- [21] Bottou L., Large-scale machine learning with stochastic gradient descent, Proceedings of the 19th International Conference on Computational Statistics, pp. 177-186, (2010)
- [22] Kingma D.P., Ba J., Adam: a method for stochastic optimization, Proceedings of the 3rd International Conference on Learning Representations, (2015)
- [23] Srivastava N., Hinton G., Krizhevsky A., Et al., Dropout: a simple way to prevent neural networks from overfitting, The Journal of Machine Learning Research, 15, 1, pp. 1929-1958, (2014)