Learning label correlations for multi-label image recognition with graph networks

被引:28
作者
Li, Qing [1 ,2 ]
Peng, Xiaojiang [2 ]
Qiao, Yu [2 ,3 ]
Peng, Qiang [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Comp Vis & Pattern Recognit, Shenzhen, Peoples R China
[3] Shenzhen Inst Artificial Intelligence & Robot Soc, SIAT Branch, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label image recognition; Graph convolutional networks; Label correlation graph; Sparse correlation constraint;
D O I
10.1016/j.patrec.2020.07.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label image recognition is a task that predicts a set of object labels in an image. As the objects co-occur in the physical world, it is desirable to model label dependencies. Previous existing methods resort to either recurrent networks or pre-defined label correlation graphs for this purpose. In this paper, instead of using a pre-defined graph which is inflexible and may be sub-optimal for multi-label classification, we propose the A-GCN, which leverages the popular Graph Convolutional Networks with an Adaptive label correlation graph to model label dependencies. Specifically, we introduce a plug-and-play Label Graph (LG) module to learn label correlations with word embeddings, and then utilize traditional GCN to map this graph into label-dependent object classifiers which are further applied to image features. The basic LG module incorporates two 1 x 1 convolutional layers and uses the dot product to generate label graphs. In addition, we propose a sparse correlation constraint to enhance the LG module, and also explore different LG architectures. We validate our method on two diverse multi-label datasets: MS-COCO and Fashion550K. Experimental results show that our A-GCN significantly improves baseline methods and achieves performance superior or comparable to the state of the art. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:378 / 384
页数:7
相关论文
共 44 条
[1]  
[Anonymous], 2016, arXiv
[2]  
[Anonymous], 2017, CVPR
[3]  
[Anonymous], 2014, ARXIV PREPRINT ARXIV
[4]  
[Anonymous], 2008, ISMIR
[5]  
[Anonymous], 2013, ARXIV13124894
[6]   Matrix Completion for Weakly-Supervised Multi-Label Image Classification [J].
Cabral, Ricardo ;
De la Torre, Fernando ;
Costeira, Joao Paulo ;
Bernardino, Alexandre .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (01) :121-135
[7]  
Chen Q, 2012, PROC CVPR IEEE, P3426, DOI 10.1109/CVPR.2012.6248083
[8]  
Chen SF, 2018, AAAI CONF ARTIF INTE, P6714
[9]  
Chen T., 2018, 32 AAAI C ART INT
[10]  
Chen Zhiqin, 2019, P IEEE CVF C COMP VI, DOI DOI 10.1109/CVPR.2019.00532