IMAGE CAPTIONING WITH ATTRIBUTE REFINEMENT

被引：0

作者：

Huang, Yiqing ^{[1
]}

Li, Cong ^{[1
]}

Li, Tianpeng ^{[1
]}

Wan, Weitao ^{[1
]}

Chen, Jiansheng ^{[1
]}

机构：

[1] Tsinghua Univ, Beijing 100084, Peoples R China

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年

基金：

中国国家自然科学基金;

关键词：

Image captioning; attribute recognition; Semantic attention; Deep Neural Network; Conditional Random Field;

D O I：

10.1109/icip.2019.8803108

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Semantic attention has long been adopted to image captioning models to enhance the image captioning performances. The models pre-trained for attribute recognition are utilized to generate image attributes in image captioning. Generally, these models are not jointly trained with image captioning models. In this paper, we propose attribute refinement network, which incorporates attribute recognition with image captioning to boost the performance on both tasks. We model the correlation between attributes with the semantic information from image captioning to improve the recognition accuracy. In turn, better attribute recognition results effectively enhance image captioning performance. Our model achieves CIDEr-D/SPICE scores of 115.1 and 20.9 respectively on the MS COCO test set, comprehensively yields improvement over all compared methods.

引用

页码：1820 / 1824

页数：5

共 50 条

[31] Text to Image Synthesis for Improved Image Captioning
Hossain, Md. Zakir
Sohel, Ferdous
Shiratuddin, Mohd Fairuz
Laga, Hamid
Bennamoun, Mohammed
IEEE ACCESS, 2021, 9 : 64918 - 64928
[32] Improving Image Captioning with Image Concepts of Words
Wang, Yiyu
Xiang, Xunzhi
Jing, Kun
Xu, Jungang
Sun, Yingfei
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, KSEM 2024, 2024, 14885 : 358 - 370
[33] Object-aware semantics of attention for image captioning
Shiwei Wang
Long Lan
Xiang Zhang
Guohua Dong
Zhigang Luo
Multimedia Tools and Applications, 2020, 79 : 2013 - 2030
[34] Intra-Image Region Context for Image Captioning
Wang, Shihao
Mo, Hong
Xu, Yue
Wu, Wei
Zhou, Zhong
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 212 - 222
[35] Automatic image captioning system based on augmentation and ranking mechanism
B. S. Revathi
A. Meena Kowshalya
Signal, Image and Video Processing, 2024, 18 : 265 - 274
[36] Exploring Semantic Relationships for Image Captioning without Parallel Data
Liu, Fenglin
Gao, Meng
Zhang, Tianhao
Zou, Yuexian
2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 439 - 448
[37] CSTNET: ENHANCING GLOBAL-TO-LOCAL INTERACTIONS FOR IMAGE CAPTIONING
Yang, Xin
Wang, Ying
Chen, Haishun
Li, Jie
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1861 - 1865
[38] Automatic image captioning system based on augmentation and ranking mechanism
Revathi, B. S.
Kowshalya, A. Meena
SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 265 - 274
[39] Complementary Shifted Transformer for Image Captioning
Liu, Yanbo
Yang, You
Xiang, Ruoyu
Ma, Jixin
NEURAL PROCESSING LETTERS, 2023, 55 (06) : 8339 - 8363
[40] Boost image captioning with knowledge reasoning
Feicheng Huang
Zhixin Li
Haiyang Wei
Canlong Zhang
Huifang Ma
Machine Learning, 2020, 109 : 2313 - 2332

← 1 2 3 4 5 →