共 57 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
SPICE: Semantic Propositional Image Caption Evaluation
[J].
COMPUTER VISION - ECCV 2016, PT V,
2016, 9909
:382-398
[3]
Convolutional Image Captioning
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:5561-5570
[4]
Artetxe Mikel, 2017, 6 INT C LEARN REPR I
[5]
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:6298-6306
[6]
Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:521-530
[7]
Chen X., 2015, MICROSOFT COCO CAPTI
[8]
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:8299-8308
[9]
Towards Diverse and Natural Image Descriptions via a Conditional GAN
[J].
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV),
2017,
:2989-2998
[10]
Denkowski M., 2014, PROC ACL WORKSHOP, P376