Generating a description for an image/video is termed as the visual captioning task. It requires the model to capture the semantic information of visual content and translate them into syntactically and semantically human language. Connecting both research communities of computer vision (CV) and natural language processing (NLP), visual captioning presents the big challenge to bridge the gap between low-level visual features and high-level language information. Thanks to recent advances in deep learning, which are widely applied to the fields of visual and language modeling, the visual captioning methods depending on the deep neural networks has demonstrated state-of-the-art performances. In this paper, we aim to present a comprehensive survey of existing deep learning-based visual captioning methods. Relying on the adopted mechanism and technique to narrow the semantic gap, we divide visual captioning methods into various groups. Representative categories in each group are summarized, and their strengths and limitations are discussed. The quantitative evaluations of state-of-the-art approaches on popular benchmark datasets are also presented and analyzed. Furthermore, we provide the discussions on future research directions.
机构:
Yunnan Univ, Natl Pilot Sch Software, Kunming, Yunnan, Peoples R China
Yunnan Univ, Engn Res Ctr Cyberspace, Kunming, Yunnan, Peoples R ChinaYunnan Univ, Natl Pilot Sch Software, Kunming, Yunnan, Peoples R China
Song, Bingbing
Wei, Ping
论文数: 0引用数: 0
h-index: 0
机构:
Yunnan Univ, Natl Pilot Sch Software, Kunming, Yunnan, Peoples R China
Yunnan Univ, Engn Res Ctr Cyberspace, Kunming, Yunnan, Peoples R ChinaYunnan Univ, Natl Pilot Sch Software, Kunming, Yunnan, Peoples R China
Wei, Ping
Wu, Sixing
论文数: 0引用数: 0
h-index: 0
机构:
Yunnan Univ, Natl Pilot Sch Software, Kunming, Yunnan, Peoples R China
Yunnan Univ, Engn Res Ctr Cyberspace, Kunming, Yunnan, Peoples R ChinaYunnan Univ, Natl Pilot Sch Software, Kunming, Yunnan, Peoples R China
Wu, Sixing
Lin, Yu
论文数: 0引用数: 0
h-index: 0
机构:
Kunming Inst Phys, Kunming, Yunnan, Peoples R ChinaYunnan Univ, Natl Pilot Sch Software, Kunming, Yunnan, Peoples R China
Lin, Yu
Zhou, Wei
论文数: 0引用数: 0
h-index: 0
机构:
Yunnan Univ, Natl Pilot Sch Software, Kunming, Yunnan, Peoples R China
Yunnan Univ, Engn Res Ctr Cyberspace, Kunming, Yunnan, Peoples R ChinaYunnan Univ, Natl Pilot Sch Software, Kunming, Yunnan, Peoples R China
机构:
Department of Computer Science and Engineering, Ahsanullah University of Science and Technology, DhakaDepartment of Computer Science and Engineering, Ahsanullah University of Science and Technology, Dhaka
Dash A.
Seum A.
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science and Engineering, Ahsanullah University of Science and Technology, DhakaDepartment of Computer Science and Engineering, Ahsanullah University of Science and Technology, Dhaka
Seum A.
Raj A.H.
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science and Engineering, Ahsanullah University of Science and Technology, DhakaDepartment of Computer Science and Engineering, Ahsanullah University of Science and Technology, Dhaka
机构:
Yunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R ChinaYunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China
Zhou, Fangrong
Wen, Gang
论文数: 0引用数: 0
h-index: 0
机构:
Yunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R ChinaYunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China
Wen, Gang
Ma, Yi
论文数: 0引用数: 0
h-index: 0
机构:
Yunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R ChinaYunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China
Ma, Yi
Geng, Hao
论文数: 0引用数: 0
h-index: 0
机构:
Yunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R ChinaYunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China
Geng, Hao
Huang, Ran
论文数: 0引用数: 0
h-index: 0
机构:
Yunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R ChinaYunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China
Huang, Ran
Pei, Ling
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai Key Lab Nav & Locat Based Serv, Shanghai 200240, Peoples R ChinaYunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China
Pei, Ling
Yu, Wenxian
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai Key Lab Nav & Locat Based Serv, Shanghai 200240, Peoples R ChinaYunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China
Yu, Wenxian
Chu, Lei
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai Key Lab Nav & Locat Based Serv, Shanghai 200240, Peoples R ChinaYunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China
Chu, Lei
Qiu, Robert
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai Key Lab Nav & Locat Based Serv, Shanghai 200240, Peoples R ChinaYunnan Power Grid Co Ltd, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China