Meta transfer evidence deep learning for trustworthy few-shot classification

被引:1
作者
Liu, Tong [1 ]
Wen, Chaoyu [1 ]
Xiong, Qiangwei [1 ]
Li, Jin [1 ]
机构
[1] Yunnan Univ, Software Sch, Kunming, Peoples R China
关键词
Trustworthy classification; Meta-learning; Transfer learning; Uncertainty quantification; OOD detection; Active learning;
D O I
10.1016/j.eswa.2024.125371
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although the standard meta-learning methods have demonstrated strong performance in few-shot image classification scenarios, the models typically lack the capability to assess the reliability of their predictions, which can lead to risks in certain applications. Aiming at this problem, we first propose a meta-learning-based Evidential Deep Learning (EDL) called Meta Evidence Deep Learning (MetaEDL), which enables reliable prediction in the few-show image classification scenario. Being the same as general meta-learning methods, MetaEDL commonly employed shallow neural networks as feature extractors to avoid overfitting when dealing with few-shot samples, which significantly restricts the model's ability to extract features. To further address this limitation, we propose a Meta Transfer Evidence Deep Learning (MetaTEDL) to address the few-shot trustworthy classification issue. MetaTEDL adopts a large-scale pre-trained neural network as its feature extractor. In the meta-training process, we only train two lightweight neuron operations Scaling and Shifting to reduce the risk of over-fitting. Then, two evidential head neural networks are trained to integrate evidence from different sources, aiming to improve the quality of the evidence output. We conduct comprehensive experiments on several challenging few-shot classification benchmarks. The results indicate that our proposed method not only outperforms other conventional meta-learning methods in terms of few-shot classification performance, but also has good UQ (uncertainty quantification), Uncertainty-guided active learning, and OOD (Out-of-Distribution) detection capabilities.
引用
收藏
页数:12
相关论文
共 54 条
[11]  
Gal Y, 2016, PR MACH LEARN RES, V48
[12]  
Gharoun H., 2023, Meta-learning approaches for few-shot learning: A survey of recent advances
[13]  
Griewank A., 2008, EVALUATING DERIVATIV
[14]  
Gruber SG, 2022, ADV NEUR IN
[15]   Trusted Multi-View Classification With Dynamic Evidential Fusion [J].
Han, Zongbo ;
Zhang, Changqing ;
Fu, Huazhu ;
Zhou, Joey Tianyi .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) :2551-2566
[16]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[17]   Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods [J].
Huellermeier, Eyke ;
Waegeman, Willem .
MACHINE LEARNING, 2021, 110 (03) :457-506
[18]  
Jeong T, 2020, ADV NEUR IN, V33
[19]  
Josang A, 2016, ARTIF INTELL-FOUND, P1, DOI 10.1007/978-3-319-42337-1
[20]  
Kotz S., 2019, Continuous Multivariate Distributions, Models and Applications, V2nd edn