Learning label correlations for multi-label image recognition with graph networks

被引:25
作者
Li, Qing [1 ,2 ]
Peng, Xiaojiang [2 ]
Qiao, Yu [2 ,3 ]
Peng, Qiang [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Comp Vis & Pattern Recognit, Shenzhen, Peoples R China
[3] Shenzhen Inst Artificial Intelligence & Robot Soc, SIAT Branch, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label image recognition; Graph convolutional networks; Label correlation graph; Sparse correlation constraint;
D O I
10.1016/j.patrec.2020.07.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label image recognition is a task that predicts a set of object labels in an image. As the objects co-occur in the physical world, it is desirable to model label dependencies. Previous existing methods resort to either recurrent networks or pre-defined label correlation graphs for this purpose. In this paper, instead of using a pre-defined graph which is inflexible and may be sub-optimal for multi-label classification, we propose the A-GCN, which leverages the popular Graph Convolutional Networks with an Adaptive label correlation graph to model label dependencies. Specifically, we introduce a plug-and-play Label Graph (LG) module to learn label correlations with word embeddings, and then utilize traditional GCN to map this graph into label-dependent object classifiers which are further applied to image features. The basic LG module incorporates two 1 x 1 convolutional layers and uses the dot product to generate label graphs. In addition, we propose a sparse correlation constraint to enhance the LG module, and also explore different LG architectures. We validate our method on two diverse multi-label datasets: MS-COCO and Fashion550K. Experimental results show that our A-GCN significantly improves baseline methods and achieves performance superior or comparable to the state of the art. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:378 / 384
页数:7
相关论文
共 44 条
  • [1] [Anonymous], 2016, arXiv
  • [2] [Anonymous], 2017, CVPR
  • [3] [Anonymous], 2014, ARXIV PREPRINT ARXIV
  • [4] [Anonymous], 2008, ISMIR
  • [5] [Anonymous], 2013, ARXIV13124894
  • [6] Matrix Completion for Weakly-Supervised Multi-Label Image Classification
    Cabral, Ricardo
    De la Torre, Fernando
    Costeira, Joao Paulo
    Bernardino, Alexandre
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (01) : 121 - 135
  • [7] Chen Q, 2012, PROC CVPR IEEE, P3426, DOI 10.1109/CVPR.2012.6248083
  • [8] Chen SF, 2018, AAAI CONF ARTIF INTE, P6714
  • [9] Chen T., 2018, 32 AAAI C ART INT
  • [10] Chen Zhiqin, 2019, P IEEE CVF C COMP VI, DOI DOI 10.1109/CVPR.2019.00532