共 21 条
[2]
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[3]
Conneau A, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1107
[4]
Understanding and Optimizing Asynchronous Low-Precision Stochastic Gradient Descent
[J].
44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017),
2017,
:561-574
[5]
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[7]
Gruau F., 1994, Neural network synthesis using cellular encoding and the genetic algorithm
[8]
Hara K, 2015, IEEE IJCNN
[9]
He K., 2015, INDIAN J CHEM B
[10]
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]