IMAGE CAPTIONING WITH ATTRIBUTE REFINEMENT

被引:0
|
作者
Huang, Yiqing [1 ]
Li, Cong [1 ]
Li, Tianpeng [1 ]
Wan, Weitao [1 ]
Chen, Jiansheng [1 ]
机构
[1] Tsinghua Univ, Beijing 100084, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年
基金
中国国家自然科学基金;
关键词
Image captioning; attribute recognition; Semantic attention; Deep Neural Network; Conditional Random Field;
D O I
10.1109/icip.2019.8803108
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Semantic attention has long been adopted to image captioning models to enhance the image captioning performances. The models pre-trained for attribute recognition are utilized to generate image attributes in image captioning. Generally, these models are not jointly trained with image captioning models. In this paper, we propose attribute refinement network, which incorporates attribute recognition with image captioning to boost the performance on both tasks. We model the correlation between attributes with the semantic information from image captioning to improve the recognition accuracy. In turn, better attribute recognition results effectively enhance image captioning performance. Our model achieves CIDEr-D/SPICE scores of 115.1 and 20.9 respectively on the MS COCO test set, comprehensively yields improvement over all compared methods.
引用
收藏
页码:1820 / 1824
页数:5
相关论文
共 50 条
  • [31] Text to Image Synthesis for Improved Image Captioning
    Hossain, Md. Zakir
    Sohel, Ferdous
    Shiratuddin, Mohd Fairuz
    Laga, Hamid
    Bennamoun, Mohammed
    IEEE ACCESS, 2021, 9 : 64918 - 64928
  • [32] Improving Image Captioning with Image Concepts of Words
    Wang, Yiyu
    Xiang, Xunzhi
    Jing, Kun
    Xu, Jungang
    Sun, Yingfei
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, KSEM 2024, 2024, 14885 : 358 - 370
  • [33] Object-aware semantics of attention for image captioning
    Shiwei Wang
    Long Lan
    Xiang Zhang
    Guohua Dong
    Zhigang Luo
    Multimedia Tools and Applications, 2020, 79 : 2013 - 2030
  • [34] Intra-Image Region Context for Image Captioning
    Wang, Shihao
    Mo, Hong
    Xu, Yue
    Wu, Wei
    Zhou, Zhong
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 212 - 222
  • [35] Automatic image captioning system based on augmentation and ranking mechanism
    B. S. Revathi
    A. Meena Kowshalya
    Signal, Image and Video Processing, 2024, 18 : 265 - 274
  • [36] Exploring Semantic Relationships for Image Captioning without Parallel Data
    Liu, Fenglin
    Gao, Meng
    Zhang, Tianhao
    Zou, Yuexian
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 439 - 448
  • [37] CSTNET: ENHANCING GLOBAL-TO-LOCAL INTERACTIONS FOR IMAGE CAPTIONING
    Yang, Xin
    Wang, Ying
    Chen, Haishun
    Li, Jie
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1861 - 1865
  • [38] Automatic image captioning system based on augmentation and ranking mechanism
    Revathi, B. S.
    Kowshalya, A. Meena
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 265 - 274
  • [39] Complementary Shifted Transformer for Image Captioning
    Liu, Yanbo
    Yang, You
    Xiang, Ruoyu
    Ma, Jixin
    NEURAL PROCESSING LETTERS, 2023, 55 (06) : 8339 - 8363
  • [40] Boost image captioning with knowledge reasoning
    Feicheng Huang
    Zhixin Li
    Haiyang Wei
    Canlong Zhang
    Huifang Ma
    Machine Learning, 2020, 109 : 2313 - 2332