Multi-Label Image Classification Based on Object Detection and Dynamic Graph Convolutional Networks

被引:0
|
作者
Liu, Xiaoyu [1 ]
Hu, Yong [1 ]
机构
[1] Sichuan Univ, Sch Cyber Sci & Engn, Chengdu 610207, Peoples R China
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2024年 / 80卷 / 03期
关键词
Deep learning; multi-label image recognition; object detection; graph convolution networks;
D O I
10.32604/cmc.2024.053938
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label image classification is recognized as an important task within the field of computer vision, a discipline that has experienced a significant escalation in research endeavors in recent years. The widespread adoption of convolutional neural networks (CNNs) has catalyzed the remarkable success of architectures such as ResNet-101 within the domain of image classification. However, in multi-label image classification tasks, it is crucial to consider the correlation between labels. In order to improve the accuracy and performance of multi-label classification and fully combine visual and semantic features, many existing studies use graph convolutional networks (GCN) for modeling. Object detection and multi-label image classification exhibit a degree of conceptual overlap; however, the integration of these two tasks within a unified framework has been relatively underexplored in the existing literature. In this paper, we come up with Object-GCN framework, a model combining object detection network YOLOv5 and graph convolutional network, and we carry out a thorough experimental analysis using a range of well-established public datasets. The designed framework Object-GCN achieves significantly better performance than existing studies in public datasets COCO2014, VOC2007, VOC2012. The final results achieved are 86.9%, 96.7%, and 96.3% mean Average Precision (mAP) across the three datasets.
引用
收藏
页码:4413 / 4432
页数:20
相关论文
共 50 条
  • [1] Multi-Label Image Recognition with Graph Convolutional Networks
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Wang, Peng
    Guo, Yanwen
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5172 - 5181
  • [2] Multiple Semantic Embedding with Graph Convolutional Networks for Multi-Label Image Classification
    Zhou, Tong
    Feng, Songhe
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 449 - 461
  • [3] An Attention-Driven Multi-label Image Classification with Semantic Embedding and Graph Convolutional Networks
    Sun, Dengdi
    Ma, Leilei
    Ding, Zhuanlian
    Luo, Bin
    COGNITIVE COMPUTATION, 2023, 15 (04) : 1308 - 1319
  • [4] An Attention-Driven Multi-label Image Classification with Semantic Embedding and Graph Convolutional Networks
    Dengdi Sun
    Leilei Ma
    Zhuanlian Ding
    Bin Luo
    Cognitive Computation, 2023, 15 : 1308 - 1319
  • [5] Modular Graph Transformer Networks for Multi-Label Image Classification
    Nguyen, Hoang D.
    Vu, Xuan-Son
    Le, Duc-Trong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9092 - 9100
  • [6] Label Correlation Based Graph Convolutional Network for Multi-label Text Classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Manh-Tran Tien
    Van-Hau Nguyen
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [7] Active learning in multi-label image classification with graph convolutional network embedding
    Xie, Xiurui
    Tian, Maojun
    Luo, Guangchun
    Liu, Guisong
    Wu, Yizhe
    Qin, Ke
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 148 : 56 - 65
  • [8] Hierarchical Multi-Label Attribute Classification With Graph Convolutional Networks on Anime Illustration
    Lan, Ziwen
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    IEEE ACCESS, 2023, 11 : 35447 - 35456
  • [9] Multi-label classification of fundus images based on graph convolutional network
    Yinlin Cheng
    Mengnan Ma
    Xingyu Li
    Yi Zhou
    BMC Medical Informatics and Decision Making, 21
  • [10] Multi-label classification of fundus images based on graph convolutional network
    Cheng, Yinlin
    Ma, Mengnan
    Li, Xingyu
    Zhou, Yi
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (SUPPL 2)