共 149 条
[1]
Yuan A(2019)3G structure for image caption generation Neurocomputing 45 539-559
[2]
Li X(2023)From show to tell: a survey on deep learning-based image captioning IEEE Trans. Pattern Anal. Mach. Intell. 80 18413-18443
[3]
Lu X(2020)The synergy of double attention: combine sentence-level and word-level attention for image captioning Comput. Vis. Image Underst. 8 154953-154965
[4]
Stefanini M(2019)Multilayer dense attention model for image caption IEEE Access 79 11531-11549
[5]
Cornia M(2021)MRRC: multiple role representation crossover interpretation for image captioning with R-CNN feature distribution composition (FDC) Multimed. Tools Appl. 54 3157-3171
[6]
Baraldi L(2021)Cross-domain image captioning via cross-modal retrieval and model adaptation IEEE Trans. Image Process. 82 1223-1236
[7]
Cascianelli S(2022)Image captioning with novel topics guidance and retrieval-based topics re-weighting IEEE Trans. Multimed. 23 2413-2427
[8]
Fiameni G(2021)Adaptive attention-based high-level semantic introduction for image caption ACM Trans. Multimed. Comput. Commun. Appl. 11 134-143
[9]
Cucchiara R(2020)Stack-VS: stacked visual-semantic attention for image caption generation IEEE Access 17 1-20
[10]
Wei H(2022)A detailed review of prevailing image captioning methods using deep learning techniques Multimed. Tools Appl. 7 66680-66688