A Broader Study of Cross-Domain Few-Shot Learning

被引:217
作者
Guo, Yunhui [1 ]
Codella, Noel C. [2 ]
Karlinsky, Leonid [2 ]
Codella, James V. [2 ]
Smith, John R. [2 ]
Saenko, Kate [3 ]
Rosing, Tajana [1 ]
Feris, Rogerio [2 ]
机构
[1] Univ Calif San Diego, San Diego, CA 92093 USA
[2] IBM Res AI, Cambridge, MA USA
[3] Boston Univ, Boston, MA USA
来源
COMPUTER VISION - ECCV 2020, PT XXVII | 2020年 / 12372卷
关键词
Cross-domain; Few-shot learning; Benchmark; Transfer learning;
D O I
10.1007/978-3-030-58583-9_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent progress on few-shot learning largely relies on annotated data for meta-learning: base classes sampled from the same domain as the novel classes. However, in many applications, collecting data for meta-learning is infeasible or impossible. This leads to the cross-domain few-shot learning problem, where there is a large shift between base and novel class domains. While investigations of the cross-domain few-shot scenario exist, these works are limited to natural images that still contain a high degree of visual similarity. No work yet exists that examines few-shot learning across different imaging methods seen in real world scenarios, such as aerial and medical imaging. In this paper, we propose the Broader Study of Cross-Domain Few-Shot Learning (BSCD-FSL) benchmark, consisting of image data from a diverse assortment of image acquisition methods. This includes natural images, such as crop disease images, but additionally those that present with an increasing dissimilarity to natural images, such as satellite images, dermatology images, and radiology images. Extensive experiments on the proposed benchmark are performed to evaluate state-of-art meta-learning approaches, transfer learning approaches, and newer methods for cross-domain few-shot learning. The results demonstrate that state-of-art meta-learning methods are surprisingly outperformed by earlier meta-learning approaches, and all meta-learning methods underperform in relation to simple fine-tuning by 12.8% average accuracy. In some cases, meta-learning even underperforms networks with random weights. Performance gains previously observed with methods specialized for cross-domain few-shot learning vanish in this more challenging benchmark. Finally, accuracy of all methods tend to correlate with dataset similarity to natural images, verifying the value of the benchmark to better represent the diversity of data seen in practice and guiding future research. Code for the experiments in this work can be found at https://github.com/IBM/cdfsl-benchmark.
引用
收藏
页码:124 / 141
页数:18
相关论文
共 69 条
[1]   Machine Learning and Health Care Disparities in Dermatology [J].
Adamson, Adewole S. ;
Smith, Avery .
JAMA DERMATOLOGY, 2018, 154 (11) :1247-1248
[2]  
[Anonymous], ICLR 2019 OpenReview
[3]  
[Anonymous], 2007, P ACM MM
[4]  
[Anonymous], 2016, OPTIMIZATION MODEL F
[5]  
Bertinetto L, 2019, Arxiv, DOI arXiv:1805.08136
[6]  
Bousmalis K, 2016, ADV NEUR IN, V29
[7]  
Chen W.-Y., 2019, P INT C LEARN REPR I
[8]   Multi-Level Semantic Feature Augmentation for One-Shot Learning [J].
Chen, Zitian ;
Fu, Yanwei ;
Zhang, Yinda ;
Jiang, Yu-Gang ;
Xue, Xiangyang ;
Sigal, Leonid .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) :4594-4605
[9]   Describing Textures in the Wild [J].
Cimpoi, Mircea ;
Maji, Subhransu ;
Kokkinos, Iasonas ;
Mohamed, Sammy ;
Vedaldi, Andrea .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3606-3613
[10]  
Codella N, 2019, Arxiv, DOI arXiv:1902.03368