共 98 条
- [32] Gehring J, 2017, PR MACH LEARN RES, V70
- [33] A Convolutional Encoder Model for Neural Machine Translation [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 123 - 135
- [35] Glorot X., 2010, P 13 INT C ART INT S, P249
- [36] Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
- [37] Attention-based LSTM with Semantic Consistency for Videos Captioning [J]. MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE, 2016, : 357 - 361
- [38] Deep Residual Learning for Image Recognition [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
- [39] Hinton G. E., 1993, Advances in neural information processing systems, V6, DOI DOI 10.1021/JP906511Z
- [40] What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7366 - 7375