共 32 条
- [1] SPICE: Semantic Propositional Image Caption Evaluation [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 382 - 398
- [2] Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4260 - 4269
- [3] Ashwin V., 2018, P AAAI C ART INT, V32
- [4] Banerjee Satanjeev, 2005, ACL WORKSHOPS, P65
- [5] Chen SZ, 2020, PROC CVPR IEEE, P9959, DOI 10.1109/CVPR42600.2020.00998
- [6] Fast, Diverse and Accurate Image Captioning Guided By Part-of-Speech [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10687 - 10696
- [7] Injecting Semantic Concepts into End-to-End Image Captioning [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17988 - 17998
- [8] Fei JJ, 2023, Arxiv, DOI arXiv:2307.16525
- [9] StyleNet: Generating Attractive Visual Captions with Styles [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 955 - 964
- [10] MSCap: Multi-Style Image Captioning with Unpaired Stylized Text [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4199 - 4208