Cross-Domain Few-Shot Classification based on Lightweight Res2Net and Flexible GNN

被引：14

作者：

Chen, Yu ^{[1
]}

Zheng, Yunan ^{[1
]}

Xu, Zhenyu ^{[1
]}

Tang, Tianhang ^{[1
]}

Tang, Zixin ^{[1
]}

Chen, Jie ^{[1
]}

Liu, Yiguang ^{[1
]}

机构：

[1] Sichuan Univ, Dept Comp Sci, Chengdu 610065, Sichuan, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2022年 / 247卷

关键词：

Cross-domain; Few-shot classification; GNN; Res2Net; Multi-scale representation;

D O I：

10.1016/j.knosys.2022.108623

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-Domain Few-Shot Classification aims to recognize new categories from unseen domains while each category has only a few support examples. But existing networks cannot be effectively applied to cross-domain scenario. To solve this problem, in this paper, we propose two new strategies, respectively for the encoder , the metric function of metric-based network: First, we propose a precise metric function named FGNN(Flexible GNN) to better measure the distance between images whether labeled or unlabeled; Second, based on the idea of multi-scale representation, we build a new hierarchical residual-like block which is applicable to lightweight ResNet structures such as ResNet-10. The constructed network-LR2Net(Lightweight Res2Net), performs much better than ResNet and provides a new scale-based strategy to constantly increase precision. Various feature encoders combined with metric function GNN or FGNN are verified through a lot of contrast experiments using leave-one-out setting on four datasets-CUB, Cars, Places and Plantae. As a result, the highest average precision of our combined networks achieves up to 2.22% and 2.26% improvement compared to the state-of-art under the 5-way 1-shot and 5-way 5-shot cross-domain classification. (C)& nbsp;2022 Elsevier B.V. All rights reserved.

引用

页数：12

共 43 条

[1]

[Anonymous], 2010, Caltech-UCSD Birds 200

[2]

Asperti Andrea, 2018, International Journal of Neural Networks and Advanced Applications, V5, P17

[3]

Bousmalis K, 2016, ADV NEUR IN, V29

[4] How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks) [J].

Bulat, Adrian ;

Tzimiropoulos, Georgios .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1021-1030

[5]

Cai J., 2020, ARXIV200510544

[6] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[7]

Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709

[8]

Chen W.-Y., 2019, ICLR POSTER

[9]

Cheng B., 2019, ARXIV191004751

[10]

Codella N.C.F., 2019, ARXIV

← 1 2 3 4 5 →