共 47 条
- [1] SPICE: Semantic Propositional Image Caption Evaluation [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 382 - 398
- [2] Chaorui Deng, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12358), P712, DOI 10.1007/978-3-030-58601-0_42
- [3] Chen SZ, 2020, PROC CVPR IEEE, P9959, DOI 10.1109/CVPR42600.2020.00998
- [4] "Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 527 - 543
- [5] Cornia M, 2020, PROC CVPR IEEE, P10575, DOI 10.1109/CVPR42600.2020.01059
- [6] Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8299 - 8308
- [7] Imageability ratings for 3,000 monosyllabic words [J]. BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 2004, 36 (03): : 384 - 387
- [8] Denkowski M., 2014, P 9 WORKSHOP STAT MA, P376
- [9] Fast, Diverse and Accurate Image Captioning Guided By Part-of-Speech [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10687 - 10696
- [10] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171