Cross-Domain Few-Shot Classification based on Lightweight Res2Net and Flexible GNN

被引:14
作者
Chen, Yu [1 ]
Zheng, Yunan [1 ]
Xu, Zhenyu [1 ]
Tang, Tianhang [1 ]
Tang, Zixin [1 ]
Chen, Jie [1 ]
Liu, Yiguang [1 ]
机构
[1] Sichuan Univ, Dept Comp Sci, Chengdu 610065, Sichuan, Peoples R China
关键词
Cross-domain; Few-shot classification; GNN; Res2Net; Multi-scale representation;
D O I
10.1016/j.knosys.2022.108623
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-Domain Few-Shot Classification aims to recognize new categories from unseen domains while each category has only a few support examples. But existing networks cannot be effectively applied to cross-domain scenario. To solve this problem, in this paper, we propose two new strategies, respectively for the encoder , the metric function of metric-based network: First, we propose a precise metric function named FGNN(Flexible GNN) to better measure the distance between images whether labeled or unlabeled; Second, based on the idea of multi-scale representation, we build a new hierarchical residual-like block which is applicable to lightweight ResNet structures such as ResNet-10. The constructed network-LR2Net(Lightweight Res2Net), performs much better than ResNet and provides a new scale-based strategy to constantly increase precision. Various feature encoders combined with metric function GNN or FGNN are verified through a lot of contrast experiments using leave-one-out setting on four datasets-CUB, Cars, Places and Plantae. As a result, the highest average precision of our combined networks achieves up to 2.22% and 2.26% improvement compared to the state-of-art under the 5-way 1-shot and 5-way 5-shot cross-domain classification. (C)& nbsp;2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 43 条
[1]  
[Anonymous], 2010, Caltech-UCSD Birds 200
[2]  
Asperti Andrea, 2018, International Journal of Neural Networks and Advanced Applications, V5, P17
[3]  
Bousmalis K, 2016, ADV NEUR IN, V29
[4]   How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks) [J].
Bulat, Adrian ;
Tzimiropoulos, Georgios .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1021-1030
[5]  
Cai J., 2020, ARXIV200510544
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[8]  
Chen W.-Y., 2019, ICLR POSTER
[9]  
Cheng B., 2019, ARXIV191004751
[10]  
Codella N.C.F., 2019, ARXIV