共 25 条
- [1] Natural gradient works efficiently in learning [J]. NEURAL COMPUTATION, 1998, 10 (02) : 251 - 276
- [2] [Anonymous], 1988, TECHNICAL REPORT
- [3] Optimization Methods for Large-Scale Machine Learning [J]. SIAM REVIEW, 2018, 60 (02) : 223 - 311
- [4] Dean J., 2012, ADV NEURAL INFORM PR, V1, P1223
- [5] Deng L, 2013, INT CONF ACOUST SPEE, P8604, DOI 10.1109/ICASSP.2013.6639345
- [6] Duchi J, 2011, J MACH LEARN RES, V12, P2121
- [7] Graves A., 2013, ARXIV PREPRINT ARXIV
- [8] Graves A, 2013, INT CONF ACOUST SPEE, P6645, DOI 10.1109/ICASSP.2013.6638947
- [9] Hinton GE., 2012, ARXIV PREPRINT ARXIV