共 44 条
[21]
Kingma D. P., P 3 INT C LEARN REPR
[22]
Kiros R, 2014, Arxiv, DOI arXiv:1411.2539
[23]
Visual Semantic Search: Retrieving Videos via Complex Textual Queries
[J].
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2014,
:2657-2664
[24]
Liu W, 2015, PROC CVPR IEEE, P3707, DOI 10.1109/CVPR.2015.7298994
[25]
Otani M., 2016, LNCS, V9913, P651, DOI [10.1007/978-3-319-46604-0 _46, DOI 10.1007/978-3-319-46604-0_46]
[26]
Plummer B.A., 2017, IEEE C COMP VIS PATT
[28]
Query-Focused Extractive Video Summarization
[J].
COMPUTER VISION - ECCV 2016, PT VIII,
2016, 9912
:3-19
[29]
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:1049-1058
[30]
Smoliar S. W., 1994, IEEE Multimedia, V1, P62, DOI 10.1109/93.311653