IMAGE CAPTIONING WITH ATTRIBUTE REFINEMENT

被引:0
|
作者
Huang, Yiqing [1 ]
Li, Cong [1 ]
Li, Tianpeng [1 ]
Wan, Weitao [1 ]
Chen, Jiansheng [1 ]
机构
[1] Tsinghua Univ, Beijing 100084, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年
基金
中国国家自然科学基金;
关键词
Image captioning; attribute recognition; Semantic attention; Deep Neural Network; Conditional Random Field;
D O I
10.1109/icip.2019.8803108
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Semantic attention has long been adopted to image captioning models to enhance the image captioning performances. The models pre-trained for attribute recognition are utilized to generate image attributes in image captioning. Generally, these models are not jointly trained with image captioning models. In this paper, we propose attribute refinement network, which incorporates attribute recognition with image captioning to boost the performance on both tasks. We model the correlation between attributes with the semantic information from image captioning to improve the recognition accuracy. In turn, better attribute recognition results effectively enhance image captioning performance. Our model achieves CIDEr-D/SPICE scores of 115.1 and 20.9 respectively on the MS COCO test set, comprehensively yields improvement over all compared methods.
引用
收藏
页码:1820 / 1824
页数:5
相关论文
共 50 条
  • [21] Image Captioning with Relational Knowledge
    Yang, Huan
    Song, Dandan
    Liao, Lejian
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2018, 11013 : 378 - 386
  • [22] Image Captioning with Memorized Knowledge
    Hui Chen
    Guiguang Ding
    Zijia Lin
    Yuchen Guo
    Caifeng Shan
    Jungong Han
    Cognitive Computation, 2021, 13 : 807 - 820
  • [23] Deep Image Captioning: An Overview
    Hrga, I.
    Ivasic-Kos, M.
    2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 995 - 1000
  • [24] Image Captioning by Asking Questions
    Yang, Xiaoshan
    Xu, Changsheng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (02)
  • [25] Boosted Transformer for Image Captioning
    Li, Jiangyun
    Yao, Peng
    Guo, Longteng
    Zhang, Weicun
    APPLIED SCIENCES-BASEL, 2019, 9 (16):
  • [26] Comparative Study on Image Captioning
    Patel, Hardik K.
    Rathod, Jagdish M.
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2022, 13 (04): : 874 - 884
  • [27] Image Captioning With Visual-Semantic Double Attention
    He, Chen
    Hu, Haifeng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (01)
  • [28] Visuals to Text: A Comprehensive Review on Automatic Image Captioning
    Ming, Yue
    Hu, Nannan
    Fan, Chunxiao
    Feng, Fan
    Zhou, Jiangwan
    Yu, Hui
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 9 (08) : 1339 - 1365
  • [29] A Survey on Enhancing Image Captioning with Advanced Strategies and Techniques
    Thobhani, Alaa
    Zou, Beiji
    Kui, Xiaoyan
    Abdussalam, Amr
    Asim, Muhammad
    Shah, Sajid
    Elaffendi, Mohammed
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2025, 142 (03): : 2247 - 2280
  • [30] Research Progress on Image Captioning
    Li Z.
    Wei H.
    Zhang C.
    Ma H.
    Shi Z.
    1951, Science Press (58): : 1951 - 1974