共 62 条
[1]
Anderson P, 2016, Arxiv, DOI arXiv:1607.08822
[2]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[3]
[Anonymous], 2004, ROUGE PACKAGE AUTOMA
[4]
Banerjee S., 2005, P ACL WORKSH INTR EX, P65
[6]
A Hierarchical Multimodal Attention-based Neural Network for Image Captioning
[J].
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL,
2017,
:889-892
[7]
Meshed-Memory Transformer for Image Captioning
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10575-10584
[9]
Fan ZH, 2021, Arxiv, DOI arXiv:2106.10936
[10]
Every Picture Tells a Story: Generating Sentences from Images
[J].
COMPUTER VISION-ECCV 2010, PT IV,
2010, 6314
:15-+