共 30 条
[1]
[Anonymous], 2016, ARXIV161004062
[2]
VQA: Visual Question Answering
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:2425-2433
[3]
Movie Fill in the Blank with Adaptive Temporal Attention and Description Update
[J].
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT,
2017,
:1039-1048
[4]
Cooijmans T., 2016, CoRR
[5]
Corrado G., 2013, WORKSH P INT C LEARN, V1301, P3781
[7]
YouTube2Text: Recognizing and Describing Arbitrary Activities Using Semantic Hierarchies and Zero-shot Recognition
[J].
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2013,
:2712-2719
[9]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778