PaCL: Part-level Contrastive Learning for Fine-grained Few-shot Image Classification

被引:13
作者
Wang, Chuanming [1 ]
Fu, Huiyuan [1 ]
Ma, Huadong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年
基金
中国国家自然科学基金;
关键词
few-shot; fine-grained; image classification; contrastive learning; NETWORK;
D O I
10.1145/3503161.3547997
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Recently, it is gaining increasingly attention to incorporate self-supervised technologies into few-shot learning. Previous methods have exclusively focused on image-level self-supervision, but they ignore that capturing subtle part features plays an important role in distinguishing fine-grained images. In this paper, we propose an approach named PaCL that embeds part-level contrastive learning into fine-grained few-shot image classification, strengthening the models' capability to extract discriminative features from indistinguishable images. PaCL treats parts as the inputs of contrastive learning, and it uses a transformation module to involve image-specific information into pre-defined meta parts, generating multiple features from each meta part depending on different images. To alleviate the impact of changes in views or occlusions, we propose to adopt part prototypes in contrastive learning. Part prototypes are generated by aggregating the features of each certain type of part, which are more reliable than directly using part features. A few-shot classifier is adopted to predict query images, which calculates the classification loss to optimize the transformation module and meta parts in conjunction with the loss calculated in contrastive learning. The optimization process will enforce the model to learn to extract discriminative and diverse features from different parts of the objects, even for the samples of unseen classes. Extensive studies show that our proposed method improves the performance of fine-grained few-shot image classification across several backbones, datasets, and tasks, achieving superior results compared with state-of-the-art methods.
引用
收藏
页码:6416 / 6424
页数:9
相关论文
共 45 条
[1]  
[Anonymous], 2020, IEEE C COMP VIS PATT, DOI DOI 10.1109/CEIDP49254.2020.9437448
[2]   SELF-SUPERVISED LEARNING FOR FEW-SHOT IMAGE CLASSIFICATION [J].
Chen, Da ;
Chen, Yuefeng ;
Li, Yuhong ;
Mao, Feng ;
He, Yuan ;
Xue, Hui .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :1745-1749
[3]  
Chen Ting, 2020, ICML, P1597
[4]   Steady-state Non-Line-of-Sight Imaging [J].
Chen, Wenzheng ;
Daneau, Simon ;
Mannan, Fahim ;
Heide, Felix .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3783-6792
[5]   Exploring Simple Siamese Representation Learning [J].
Chen, Xinlei ;
He, Kaiming .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15745-15753
[6]   Imposing Semantic Consistency of Local Descriptors for Few-Shot Learning [J].
Cheng, Jun ;
Hao, Fusheng ;
Liu, Liu ;
Tao, Dacheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :1587-1600
[7]  
Finn C, 2017, PR MACH LEARN RES, V70
[8]   Boosting Few-Shot Visual Learning with Self-Supervision [J].
Gidaris, Spyros ;
Bursuc, Andrei ;
Komodakis, Nikos ;
Perez, Patrick ;
Cord, Matthieu .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8058-8067
[9]  
Grill J.-B., 2020, arXiv, V33, P21271
[10]   Collect and Select: Semantic Alignment Metric Learning for Few-Shot Learning [J].
Hao, Fusheng ;
He, Fengxiang ;
Cheng, Jun ;
Wang, Lei ;
Cao, Jianzhong ;
Tao, Dacheng .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8459-8468