A Unified Modular Framework with Deep Graph Convolutional Networks for Multi-label Image Recognition

被引:0
作者
Lin, Qifan [1 ,2 ]
Chen, Zhaoliang [1 ,2 ]
Wang, Shiping [1 ,2 ]
Guo, Wenzhong [1 ,2 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Fujian, Peoples R China
[2] Fuzhou Univ, Fujian Prov Key Lab Network Comp & Intelligent In, Fuzhou, Fujian, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II | 2021年 / 13020卷
基金
中国国家自然科学基金;
关键词
Multi-label image recognition; Convolutional neural networks; Graph convolutional networks; Feature extraction;
D O I
10.1007/978-3-030-88007-1_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of handheld photographic devices, a large number of unlabeled images have been uploaded to the Internet. In order to retrieve these images, image recognition techniques have become particularly important. As there is often more than one object in a picture, multi-label image annotation techniques are of practical interest. To enhance its performance by fully exploiting the interrelationships between labels, we propose a unified modular framework with deep graph convolutional networks (MDGCN). It consists of two modules for extracting image features and label semantic respectively, after which the features are fused to obtain the final recognition results. With classical multi-label soft-margin loss, our model can be trained in an endto-end schema. It is important to note that a deep graph convolutional network is used in our framework to learn semantic associations. Moreover, a special normalization method is employed to strengthen its own connection and avoid features from disappearing in the deep graph network propagation. The results of experiments on two multi-label image classification benchmark datasets show that our framework has advanced performance compared to the state-of-the-art methods.
引用
收藏
页码:54 / 65
页数:12
相关论文
共 24 条
[1]  
Chen SF, 2018, AAAI CONF ARTIF INTE, P6714
[2]   Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition [J].
Chen, Tianshui ;
Xu, Muxin ;
Hui, Xiaolu ;
Wu, Hefeng ;
Lin, Liang .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :522-531
[3]  
Chen TS, 2018, AAAI CONF ARTIF INTE, P6730
[4]   Multi-Label Image Recognition with Graph Convolutional Networks [J].
Chen, Zhao-Min ;
Wei, Xiu-Shen ;
Wang, Peng ;
Guo, Yanwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5172-5181
[5]   Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks [J].
Chiang, Wei-Lin ;
Liu, Xuanqing ;
Si, Si ;
Li, Yang ;
Bengio, Samy ;
Hsieh, Cho-Jui .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :257-266
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[8]   Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based onWeakly Supervised Learning [J].
Ge, Weifeng ;
Yang, Sibei ;
Yu, Yizhou .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1277-1286
[9]  
Gong Y., 2014, INT C LEARN REPR
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778