共 48 条
[1]
SPICE: Semantic Propositional Image Caption Evaluation
[J].
COMPUTER VISION - ECCV 2016, PT V,
2016, 9909
:382-398
[2]
[Anonymous], 2008, P 3 WORKSH STAT MACH
[3]
Arjovsky M, 2017, PR MACH LEARN RES, V70
[4]
Pseudo Content Hallucination for Unpaired Image Captioning
[J].
PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024,
2024,
:320-329
[6]
Self-Distillation for Few-Shot Image Captioning
[J].
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021),
2021,
:545-555
[7]
Meshed-Memory Transformer for Image Captioning
[J].
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020),
2020,
:10575-10584
[8]
Devlin Jacob, 2018, 181004805 ARXIV
[9]
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[10]
Unsupervised Image Captioning
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:4120-4129