共 14 条
[1]
Chen Feiyu, 2020, T MULTIMEDIA
[2]
Gabeur Valentin, 2020, EUROPEAN C COMPUTER
[3]
Ging S., 2020, arXiv
[4]
Dense-Captioning Events in Videos
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:706-715
[5]
End-to-End Learning of Visual Representations from Uncurated Instructional Videos
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:9876-9886
[6]
Patrick M, 2021, Arxiv, DOI arXiv:2010.02824
[7]
Rohrbach A, 2015, PROC CVPR IEEE, P3202, DOI 10.1109/CVPR.2015.7298940
[8]
Rouditchenko A, 2021, Arxiv, DOI arXiv:2006.09199
[9]
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10635-10644