共 64 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
SPICE: Semantic Propositional Image Caption Evaluation
[J].
COMPUTER VISION - ECCV 2016, PT V,
2016, 9909
:382-398
[3]
[Anonymous], 2012, Association for Computational Linguistics
[4]
[Anonymous], 2007, P 2 WORKSH STAT MACH
[5]
[Anonymous], 2012, P 13 C EUR CHAPT ASS
[6]
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[7]
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:6298-6306
[8]
Visual Dialog
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:1080-1089
[9]
Flick C., 2018, PROC ACL WORKSHOP, P1
[10]
StyleNet: Generating Attractive Visual Captions with Styles
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:955-964