共 35 条
[1]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[2]
SPICE: Semantic Propositional Image Caption Evaluation
[J].
COMPUTER VISION - ECCV 2016, PT V,
2016, 9909
:382-398
[3]
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[4]
Banerjee S, 2005, P ACL WORKSH INTR EX, P65, DOI DOI 10.3115/1626355.1626389
[5]
SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
[J].
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017),
2017,
:6298-6306
[6]
Cornia M, 2020, PROC CVPR IEEE, P10575, DOI 10.1109/CVPR42600.2020.01059
[7]
Guo Longteng, 2019, IEEE T MULTIMEDIA
[8]
Herdade S, 2019, ADV NEUR IN, V32
[9]
Attention on Attention for Image Captioning
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:4633-4642
[10]
Ji Jiayi, 2021, P AAAI C ART INT