共 36 条
[1]
[Anonymous], 2004, WORKSH TEXT SUMM BRA
[3]
Chen Shizhe, 2019, ARXIV190705092
[4]
Personalized Key Frame Recommendation
[J].
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL,
2017,
:315-324
[5]
Cho K., 2014, P EMPIRICAL METHODS, P1724, DOI DOI 10.3115/V1/D14-1179
[6]
Visual Dialog
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:1080-1089
[7]
A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching
[J].
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2013,
:2634-2641
[8]
Denkowski M., 2014, P 9 WORKSH STAT MACH, P376
[9]
Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6087-6096
[10]
Fjord, 2018, LIBROSA LIBROSA 0 6, DOI DOI 10.5281/ZEN0D0.1342708