共 35 条
[1]
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:3185-3194
[2]
BidirectionalLong-Short Term Memory for Video Description
[J].
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE,
2016,
:436-440
[3]
Chen David, 2011, ACL, P190
[4]
Less Is More: Picking Informative Frames for Video Captioning
[J].
COMPUTER VISION - ECCV 2018, PT XIII,
2018, 11217
:367-384
[5]
Collobert R., 2008, Proceedings of the 25th international conference on Machine learning, V25, P160, DOI DOI 10.1145/1390156.1390177
[6]
Denkowski M, 2014, P 9 WORKSH STAT MACH, P376
[8]
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[9]
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[10]
He K., 2016, IEEE C COMPUT VIS PA, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]