Label-aware graph representation learning for multi-label image classification

被引:13
作者
Chen, Yilu [1 ]
Zou, Changzhong [1 ]
Chen, Jianli [2 ]
机构
[1] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Fujian, Peoples R China
[2] Fudan Univ, State Key Lab ASIC & Syst, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label image classification; Graph neural network; Graph representation; Semantic decoupling;
D O I
10.1016/j.neucom.2022.04.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label image classification (MLIC) is a quintessential but challenging issue in the field of Computer Vision. Since the label co-occurrence is a crucial component of MLIC, previous existing approaches resort to the label co-occurrence for either modeling label correlations or modeling visual feature relationships. However, these methods ignore either the feature interaction or the label characteristics in MLIC. In this paper, we propose a label-aware graph representation learning (LGR) for MLIC that can explore the label interaction via a graph neural network built on the label co-occurrence and mine the feature correlations via another graph neural network also based on the label co-occurrence. Moreover, to decouple semantic visual features, current approaches resort to the word embedding guided semantic decoupling methods. However, the word embedding cannot clearly represent the label semantic information of MLIC. Hence, we reconstruct the semantic decoupling method by using the graph label representation. Extensive experiments on three benchmark datasets well demonstrate that our proposed framework can signifi-cantly achieve the state-of-the-art performance. In addition, a series of ablative studies further demon-strate the positive impacts of our proposed model.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:50 / 61
页数:12
相关论文
共 50 条
[1]  
[Anonymous], 2016, IEEE T MULTIMEDIA
[2]  
[Anonymous], 2007, INT J DATA WAREHOUSI
[3]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[4]  
Chen JJ, 2014, INT C TRANS OPT NETW
[5]  
Chen SF, 2018, AAAI CONF ARTIF INTE, P6714
[6]   Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition [J].
Chen, Tianshui ;
Xu, Muxin ;
Hui, Xiaolu ;
Wu, Hefeng ;
Lin, Liang .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :522-531
[7]  
Chen TS, 2018, AAAI CONF ARTIF INTE, P6730
[8]   Multi-Label Image Recognition with Graph Convolutional Networks [J].
Chen, Zhao-Min ;
Wei, Xiu-Shen ;
Wang, Peng ;
Guo, Yanwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5172-5181
[9]   A survey and analysis on automatic image annotation [J].
Cheng, Qimin ;
Zhang, Qian ;
Fu, Peng ;
Tu, Conghuan ;
Li, Sen .
PATTERN RECOGNITION, 2018, 79 :242-259
[10]  
Cho K., 2014, ARXIV14061078, DOI [10.48550/arXiv.1406.1078, DOI 10.3115/V1/D14-1179]