共 46 条
- [1] Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6077 - 6086
- [2] SPICE: Semantic Propositional Image Caption Evaluation [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 382 - 398
- [3] Banerjee S., 2005, P ACL WORKSH INTR EX, P65
- [4] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6298 - 6306
- [6] Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, DOI 10.48550/ARXIV.1406.1078]
- [8] Every Picture Tells a Story: Generating Sentences from Images [J]. COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 : 15 - +
- [10] Gupta A, 2012, LECT NOTES COMPUT SC, V7667, P196, DOI 10.1007/978-3-642-34500-5_24