共 19 条
[1]
Hossain MZ(2019)A comprehensive survey of deep learning for image captioning ACM Comput Surv 51 1-36
[2]
Sohel F(2020)Dual-CNN: a convolutional language decoder for paragraph image captioning Neurocomputing 396 92-101
[3]
Shiratuddin MF(2018)Captioning transformer with stacked attention modules Appl Sci 8 739-73
[4]
Laga HJACS(2017)Visual genome: connecting language and vision using crowdsourced dense image annotations Int J Comput Vis 123 32-99
[5]
Li R(2015)Faster r-cnn: towards real-time object detection with region proposal networks Adv Neural Inf Process Syst 28 91-undefined
[6]
Liang H(undefined)undefined undefined undefined undefined-undefined
[7]
Shi Y(undefined)undefined undefined undefined undefined-undefined
[8]
Feng F(undefined)undefined undefined undefined undefined-undefined
[9]
Wang XJN(undefined)undefined undefined undefined undefined-undefined
[10]
Zhu X(undefined)undefined undefined undefined undefined-undefined