SRCPT: Spatial Reconstruction Contrastive Pretext Task for Improving Few-Shot Image Classification

被引：0

作者：

Wang, ZhenBang ^{[1
]}

Duan, PengFei ^{[1
]}

Rong, Yi ^{[1
,2
]}

机构：

[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan, Hubei, Peoples R China

[2] Wuhan Univ Technol, Sanya Sci & Educ Innovat Pk, Sanya, Hainan, Peoples R China

来源：

2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024 | 2024年

关键词：

few-shot learning; local spatial feature reconstruction; contrastive learning;

D O I：

10.1145/3651671.3651701

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Self-supervised learning (SSL) has been widely applied in the pretraining phase of models. Among these SSL methods, the various data augmentation used in contrastive learning for constructing positive and negative sample pairs conveniently contribute to alleviating the issue of data scarcity in few-shot learning (FSL) tasks. Therefore, many approaches have introduced contrastive learning into FSL tasks. However, most of these methods only utilize the global embedding information of the entire image, making it challenging to capture and fully leverage the local visual information and structural details of image samples. To address this, we proposes a novel Spatial Reconstruction Contrastive Pretext Task (SRCPT) to enhance the FSL training objective. By constructing a two-branch network, the model can use local patches of the image for feature map reconstruction and employ spatial reconstruction weights to create a contrastive learning objective. The enhanced FSL objective of SRCPT encourages the model to capture more transferable spatial structures and local feature information, enabling the model to adapt well to new categories even with a few samples. Extensive experiments demonstrate that our proposed SRCPT method achieves state-of-the-art performance in three popular benchmark datasets across three types of few-shot image classification tasks.

引用

页码：424 / 432

页数：9

共 31 条

[1]

Chen HX, 2020, Arxiv, DOI arXiv:2011.14479

[2]

Chen Wei-Yu, 2019, P INT C LEARN REPR, DOI DOI 10.1109/MSR.2015.54

[3]

Chen Y., 2020, A new meta-baseline for few-shot learning

[4]

Finn C, 2017, PR MACH LEARN RES, V70

[5] Boosting Few-Shot Visual Learning with Self-Supervision [J].

Gidaris, Spyros ;

Bursuc, Andrei ;

Komodakis, Nikos ;

Perez, Patrick ;

Cord, Matthieu .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8058-8067

[6] Dynamic Few-Shot Visual Learning without Forgetting [J].

Gidaris, Spyros ;

Komodakis, Nikos .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4367-4375

[7] A Broad Study on the Transferability of Visual Representations with Contrastive Learning [J].

Islam, Ashraful ;

Chen, Chun-Fu ;

Panda, Rameswar ;

Karlinsky, Leonid ;

Radke, Richard ;

Feris, Rogerio .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :8825-8835

[8] Task Agnostic Meta-Learning for Few-Shot Learning [J].

Jamal, Muhammad Abdullah ;

Qi, Guo-Jun .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11711-11719

[9]

Jong-Chyi Su, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12352), P645, DOI 10.1007/978-3-030-58571-6_38

[10]

Koch G., 2015, P ICML

← 1 2 3 4 →