Global-guided weakly-supervised learning for multi-label image classification*

被引:4
作者
Dai, Yong [1 ]
Song, Weiwei [1 ]
Gao, Zhi [2 ]
Fang, Leyuan [1 ,3 ]
机构
[1] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[2] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
[3] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Peoples R China
关键词
Global correlation Feature disentanglement Label-related regions Weakly-supervised learning Multi-label classification;
D O I
10.1016/j.jvcir.2023.103823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label classification with region-free labels is attracting increasing attention compared to that with region-based labels due to the time-consuming manual region-labeling process. Existing methods usually employ attention-based technology to discover the conspicuous label-related regions in a weakly-supervised manner with only image-level region-free labels, while the region covering is not precise without exploring global clues of multi-level features. To address this issue, a novel Global-guided Weakly-Supervised Learning (GWSL) method for multi-label classification is proposed. The GWSL first extracts the multi-level features to estimate their global correlation map which is further utilized to guide feature disentanglement in the proposed Feature Disentanglement and Localization (FDL) networks. Specifically, the FDL networks then adaptively combine the different correlated features and localize the fine-grained features for identifying multiple labels. The proposed method is optimized in an end-to-end manner under weakly supervision with only image-level labels. Experimental results demonstrate that the proposed method outperforms the state-of-the-arts for multi-label learning problems on several publicly available image datasets. To facilitate similar researches in the future, the codes are directly available online at https://github.com/Yong-DAI/GWSL.
引用
收藏
页数:10
相关论文
共 54 条
[1]   Combining instance-based learning and logistic regression for multilabel classification [J].
Cheng, Weiwei ;
Huellermeier, Eyke .
MACHINE LEARNING, 2009, 76 (2-3) :211-225
[2]   Feature disentangling and reciprocal learning with label-guided similarity for multi-label image retrieval [J].
Dai, Yong ;
Song, Weiwei ;
Li, Yi ;
Di Stefano, Luigi .
NEUROCOMPUTING, 2022, 511 :353-365
[3]   Skip-connected network with gram matrix for product image retrieval [J].
Dai, Yong ;
Li, Yi ;
Sun, Bin ;
Liu, Li-Jun .
NEUROCOMPUTING, 2021, 447 :307-318
[4]   Multi-label learning for concept-oriented labels of product image data [J].
Dai, Yong ;
Li, Yi ;
Li, Shu-Tao .
IMAGE AND VISION COMPUTING, 2020, 93
[5]   Pedestrian Attribute Recognition At Far Distance [J].
Deng, Yubin ;
Luo, Ping ;
Loy, Chen Change ;
Tang, Xiaoou .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :789-792
[6]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[7]   Multi-label learning: a review of the state of the art and ongoing research [J].
Gibaja, Eva ;
Ventura, Sebastian .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 4 (06) :411-444
[8]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[9]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[10]   Learning discriminative representations for multi-label image recognition [J].
Hassanin, Mohammed ;
Radwan, Ibrahim ;
Khan, Salman ;
Tahtali, Murat .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 83