Learning label correlations for multi-label image recognition with graph networks

被引:25
|
作者
Li, Qing [1 ,2 ]
Peng, Xiaojiang [2 ]
Qiao, Yu [2 ,3 ]
Peng, Qiang [1 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen Key Lab Comp Vis & Pattern Recognit, Shenzhen, Peoples R China
[3] Shenzhen Inst Artificial Intelligence & Robot Soc, SIAT Branch, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label image recognition; Graph convolutional networks; Label correlation graph; Sparse correlation constraint;
D O I
10.1016/j.patrec.2020.07.040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label image recognition is a task that predicts a set of object labels in an image. As the objects co-occur in the physical world, it is desirable to model label dependencies. Previous existing methods resort to either recurrent networks or pre-defined label correlation graphs for this purpose. In this paper, instead of using a pre-defined graph which is inflexible and may be sub-optimal for multi-label classification, we propose the A-GCN, which leverages the popular Graph Convolutional Networks with an Adaptive label correlation graph to model label dependencies. Specifically, we introduce a plug-and-play Label Graph (LG) module to learn label correlations with word embeddings, and then utilize traditional GCN to map this graph into label-dependent object classifiers which are further applied to image features. The basic LG module incorporates two 1 x 1 convolutional layers and uses the dot product to generate label graphs. In addition, we propose a sparse correlation constraint to enhance the LG module, and also explore different LG architectures. We validate our method on two diverse multi-label datasets: MS-COCO and Fashion550K. Experimental results show that our A-GCN significantly improves baseline methods and achieves performance superior or comparable to the state of the art. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:378 / 384
页数:7
相关论文
共 50 条
  • [41] Multi-label classification by exploiting label correlations
    Yu, Ying
    Pedrycz, Witold
    Miao, Duoqian
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (06) : 2989 - 3004
  • [42] Disentangling, Embedding and Ranking Label Cues for Multi-Label Image Recognition
    Chen, Zhao-Min
    Cui, Quan
    Wei, Xiu-Shen
    Jin, Xin
    Guo, Yanwen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1827 - 1840
  • [43] Label Co-Occurrence Learning With Graph Convolutional Networks for Multi-Label Chest X-Ray Image Classification
    Chen, Bingzhi
    Li, Jinxing
    Lu, Guangming
    Yu, Hongbing
    Zhang, David
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (08) : 2292 - 2302
  • [44] Semantic-Aware Graph Matching Mechanism for Multi-Label Image Recognition
    Wu, Yanan
    Feng, Songhe
    Wang, Yang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6788 - 6803
  • [45] Transformer-based Dual Relation Graph for Multi-label Image Recognition
    Zhao, Jiawei
    Yan, Ke
    Zhao, Yifan
    Guo, Xiaowei
    Huang, Feiyue
    Li, Jia
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 163 - 172
  • [46] Graph attention mechanism with global contextual information for multi-label image recognition
    Ban, Xiaoxiao
    Li, Peihua
    Wang, Qilong
    Zhou, Shoujun
    Guo, Shijie
    Wang, Yuanquan
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (06)
  • [47] STMG: Swin transformer for multi-label image recognition with graph convolution network
    Wang, Yangtao
    Xie, Yanzhao
    Fan, Lisheng
    Hu, Guangxing
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (12): : 10051 - 10063
  • [48] STMG: Swin transformer for multi-label image recognition with graph convolution network
    Yangtao Wang
    Yanzhao Xie
    Lisheng Fan
    Guangxing Hu
    Neural Computing and Applications, 2022, 34 : 10051 - 10063
  • [49] Capsule Graph Neural Network for Multi-Label Image Recognition (Student Abstract)
    Zheng, Xiangping
    Liang, Xun
    Wu, Bo
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 13117 - 13118
  • [50] Multi-Label Classification with Label Graph Superimposing
    Wang, Ya
    He, Dongliang
    Li, Fu
    Long, Xiang
    Zhou, Zhichao
    Ma, Jinwen
    Wen, Shilei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12265 - 12272