Meta-collaborative comparison for effective cross-domain few-shot learning

被引：2

作者：

Zhou, Fei ^{[1
]}

Wang, Peng ^{[2
]}

Zhang, Lei ^{[1
]}

Wei, Wei ^{[1
]}

Zhang, Yanning ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China

[2] Univ Elect Sci & Technol China UESTC, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 156卷

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Cross-domain few-shot learning; Meta-learning; Deep neural network;

D O I：

10.1016/j.patcog.2024.110790

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent advancements in cross-domain few-shot learning (CD-FSL) primarily focus on learning to compare global representations between query and support images for classification. However, due to the notorious cross-domain semantic gap, the ideal global representations can be totally different across domains, thereby solely learning to compare global representations is not sufficient to achieve effective generalization in challenging cases. To mitigate this problem, we present a Me ta-collaborative Co mparison Net work (MeCoNet) for CD-FSL, which imitates humans to recognize unfamiliar objects through collaborative comparison on both global and local representations. Following this idea, paralleling with a conventional global comparison branch, we additionally feed random crops of both query and support images into a feature encoder to separately extract their local representations. Subsequently, we associate these local representations across images through bipartite graph matching for local comparison. Thanks to the complementary global and local comparisons, we can obtain a more generalizable classifier for CD-FSL by meta-integrating them for final prediction. Experimental results on eight benchmarks demonstrate that the proposed model generalizes to multiple target domains with state-of-the-art performance without the need for fine-tuning.

引用

页数：11

共 39 条

[1] Matching Feature Sets for Few-Shot Image Classification [J].

Afrasiyabi, Arman ;

Larochelle, Hugo ;

Lalonde, Jean-Francois ;

Gagne, Christian .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :9004-9014

[2]

Baik S, 2020, ADV NEUR IN, V33

[3]

Bardes Adrien, 2022, ICLR 2022 10 INT C L

[4]

Caron M, 2020, ADV NEUR IN, V33

[5] Spatial Structure Constraints for Weakly Supervised Semantic Segmentation [J].

Chen, Tao ;

Yao, Yazhou ;

Huang, Xingguo ;

Li, Zechao ;

Nie, Liqiang ;

Tang, Jinhui .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :1136-1148

[6]

Das Debasmit, 2021, INT C LEARN REPR

[7] Selecting Relevant Features from a Multi-domain Representation for Few-Shot Classification [J].

Dvornik, Nikita ;

Schmid, Cordelia ;

Mairal, Julien .

COMPUTER VISION - ECCV 2020, PT X, 2020, 12355 :769-786

[8] StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning [J].

Fu, Yuqian ;

Xie, Yu ;

Fu, Yanwei ;

Jiang, Yu-Gang .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :24575-24584

[9] Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data [J].

Fu, Yuqian ;

Fu, Yanwei ;

Jiang, Yu-Gang .

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :5326-5334

[10]

Garcia Victor., 2017, ARXIV171104043

← 1 2 3 4 →