共 79 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
[Anonymous], 2017, IEEE INT C COMPUT VI, DOI [10.1109/iccv.201, DOI 10.1109/ICCV.2017.322]
[3]
Bastings J., 2017, P 2017 C EMPIRICAL M, P1957
[4]
Cho K., 2014, ARXIV14061078, DOI 10.3115/v1/D14-1179
[5]
Detecting Visual Relationships with Deep Relational Networks
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:3298-3308
[6]
Learning Everything about Anything: Webly-Supervised Visual Concept Learning
[J].
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2014,
:3270-3277
[7]
Learning Spatiotemporal Features with 3D Convolutional Networks
[J].
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2015,
:4489-4497
[9]
Fukui A, 2016, P C EMP METH NAT LAN, P457, DOI [10.18653/v1/d16-1044, DOI 10.18653/V1/D16-1044]
[10]
Galleguillos C, 2008, PROC CVPR IEEE, P3552