共 51 条
[1]
Baker B, 2017, Arxiv, DOI arXiv:1611.02167
[2]
LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
[J].
IEEE TRANSACTIONS ON NEURAL NETWORKS,
1994, 5 (02)
:157-166
[3]
Bi MX, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P3259
[4]
Cai H., 2019, ICLR
[5]
Deb K., 2000, Parallel Problem Solving from Nature PPSN VI. 6th International Conference. Proceedings (Lecture Notes in Computer Science Vol.1917), P849
[6]
Elsken T., 2018, P 6 INT C LEARN REPR, P1
[7]
Elsken T, 2017, Arxiv, DOI arXiv:1711.04528
[8]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778
[9]
Hsu CH, 2018, Arxiv, DOI [arXiv:1806.10332, DOI 10.48550/ARXIV.1806.10332]
[10]
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]