Low-resolution few-shot learning via multi-space knowledge distillation

被引：0

作者：

Liu, Ke ^{[1
,2
]}

Ye, Xinchen ^{[1
,2
]}

Sun, Baoli ^{[1
,2
]}

Yang, Hairui ^{[1
,2
]}

Li, Haojie ^{[3
]}

Xu, Rui ^{[1
,2
]}

Wang, Zhihui ^{[1
,2
]}

机构：

[1] Dalian Univ Technol, DUT Sch Software Technol, Dalian, Peoples R China

[2] Dalian Univ Technol, DUT RU Int Sch Informat Sci Engn, Dalian, Peoples R China

[3] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao, Peoples R China

来源：

INFORMATION SCIENCES | 2024年 / 677卷

关键词：

Few-shot learning; Low-resolution classification; Multi-space knowledge distillation;

D O I：

10.1016/j.ins.2024.120968

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Existing few -shot classification models usually rely on limited known support images to form class centers, and classify query images based on the distance between their embedding and the class centers. However, these models assume that the query image is high -resolution (HR), and thus suffer from significant performance degradation when applied to low -resolution (LR) images. Due to the lack of discriminative information in LR images, there is a noticeable discrepancy between the embeddings of LR query images and the class centers formed by HR support images. To address this issue, we first formulate the problem of Low -Resolution Few -Shot Learning (LRFSL), where the support images are HR while the query images are only available in LR. Then, we propose an end -to -end pipeline that leverages mutual learning between a super -resolution (SR) network and a few -shot classification network. To further reduce the domain discrepancy between the embeddings of the SR images and HR class centers, we introduce a multi -space knowledge distillation strategy that aims to transfer pixel -level, feature -level, and logit-level knowledge of the HR domain to the SR domain. We conduct extensive experiments on classic few -shot datasets: miniImageNet, tieredImageNet, and the fine-grained few -shot dataset CUB. Experimental results show that our method can handle few -shot classification with LR input, and achieve performance that is almost comparable to using HR images as input. Specifically, our method achieves an average accuracy improvement of 26.47% with the Meta -Baseline Model and 7.44% with the Meta DeepBDC Model across all datasets compared to LR Query.

引用

页数：12

共 41 条

[1] Variational Information Distillation for Knowledge Transfer
Ahn, Sungsoo
Hu, Shell Xu
Damianou, Andreas
Lawrence, Neil D.
Dai, Zhenwen
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9155 - 9163
[2] Convolutional low-resolution fine-grained classification
Cai, Dingding
Chen, Ke
Qian, Yanlin
Kamarainen, Joni-Kristian
[J]. PATTERN RECOGNITION LETTERS, 2019, 119 : 166 - 171
[3] Memory Matching Networks for One-Shot Image Recognition
Cai, Qi
Pan, Yingwei
Yao, Ting
Yan, Chenggang
Mei, Tao
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4080 - 4088
[4] Video anomaly detection with spatio-temporal dissociation
Chang, Yunpeng
Tu, Zhigang
Xie, Wei
Luo, Bin
Zhang, Shifu
Sui, Haigang
Yuan, Junsong
[J]. PATTERN RECOGNITION, 2022, 122
[5] Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning
Chen, Yinbo
Liu, Zhuang
Xu, Huijuan
Darrell, Trevor
Wang, Xiaolong
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9042 - 9051
[6] CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification
Chikontwe, Philip
Kim, Soopil
Park, Sang Hyun
[J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14534 - 14543
[7] On the Efficacy of Knowledge Distillation
Cho, Jang Hyun
Hariharan, Bharath
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4793 - 4801
[8] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9] Diversity with Cooperation: Ensemble Methods for Few-Shot Classification
Dvornik, Nikita
Schmid, Cordelia
Mairal, Julien
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3722 - 3730
[10] Finn C, 2017, PR MACH LEARN RES, V70

← 1 2 3 4 5 →