共 49 条
[1]
nocaps: novel object captioning at scale
[J].
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019),
2019,
:8947-8956
[2]
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:6077-6086
[3]
SPICE: Semantic Propositional Image Caption Evaluation
[J].
COMPUTER VISION - ECCV 2016, PT V,
2016, 9909
:382-398
[4]
[Anonymous], 2005, The statistics of word cooccurrences: Word pairs and collocations
[5]
[Anonymous], 2000, How children learn the meanings of words
[6]
[Anonymous], 2021, P ICML
[7]
[Anonymous], 2014, P 9 WORKSHOP STAT MA, DOI DOI 10.3115/V1/W14-3348
[8]
[Anonymous], 2017, P NIPS
[9]
CaMEL: Mean Teacher Learning for Image Captioning
[J].
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR),
2022,
:4087-4094