共 69 条
- [1] Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6077 - 6086
- [2] SPICE: Semantic Propositional Image Caption Evaluation [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 382 - 398
- [3] Banerjee S., 2005, P ACL WORKSH INTR EX, V29, P65, DOI DOI 10.3115/1626355.1626389
- [5] ATTRIBUTE CONDITIONED FASHION IMAGE CAPTIONING [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1921 - 1925
- [6] Human-like Controllable Image Captioning with Verb-specific Semantic Roles [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16841 - 16851
- [7] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6298 - 6306
- [8] Chen SZ, 2020, PROC CVPR IEEE, P9959, DOI 10.1109/CVPR42600.2020.00998
- [9] Cheng WH, 2021, ACM COMPUT SURV, V54, DOI [10.1145/3552468.3554360, 10.1145/3447239]
- [10] Leveraging Weakly Annotated Data for Fashion Image Retrieval and Label Prediction [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2268 - 2274