An Applicative Survey on Few-shot Learning

被引:1
作者
Zhang J. [1 ]
Zhang X. [1 ]
Lv L. [1 ]
Di Y. [1 ]
Chen W. [1 ]
机构
[1] The State Key Lab of CAD & CG, Zhejiang University, Hangzhou
关键词
feature representation; Few-shot learning; meta-learning; metric learning; neural network; supervised learning;
D O I
10.2174/1872212115666210715121344
中图分类号
学科分类号
摘要
Background: Learning discriminative representation from large-scale data sets has made a breakthrough in decades. However, it is still a thorny problem to generate representative embedding from limited examples, for example, a class containing only one image. Recently, deep learn-ing-based Few-Shot Learning (FSL) has been proposed. It tackles this problem by leveraging prior knowledge in various ways. Objective: In this work, we review recent advances of FSL in a perspective of high-dimensional representation learning. The results of the analysis can provide insights and directions for future work. Methods: We first present the definition of general FSL. Then, we propose a general framework for the FSL problem and give the taxonomy under the framework. We survey two FSL directions: learning policy and meta-learning. Results: We review the advanced applications of FSL, including image classification, object detec-tion, image segmentation and other tasks etc., as well as the corresponding benchmarks to provide an overview of recent progress. Conclusion: In future work, FSL needs to be further studied in medical images, language models and reinforcement learning. In addition, cross-domain FSL, successive FSL and associated FSL are more challenging and valuable research directions. © 2022 Bentham Science Publishers.
引用
收藏
页码:104 / 124
页数:20
相关论文
共 163 条
[1]  
Krizhevsky Alex, Sutskever Ilya, Hinton Geoffrey E, Imagenet classification with deep convolutional neural networks, Adv. Neur. Info. Process. Sys, pp. 1097-1105, (2012)
[2]  
Deng J., Dong W., Socher R., Li L-J., Li K., Li F-F., Imagenet: A large-scale hierarchical image database, 2009 IEEE conference on computer vision and pattern recognition, pp. 248-255, (2009)
[3]  
Jouppi N. P., Young C., Patil N., Patterson D., Agrawal G., Bajwa R., Bates S., Bhatia S., Boden N., Borchers Al, In-datacenter performance analysis of a tensor processing unit, Proceedings of the 44th Annual International Symposium on Computer Architecture, pp. 1-12, (2017)
[4]  
Taigman Y., Yang M., Marc'Aurelio Ranzato, Lior Wolf, Deepface: Closing the gap to human-level performance in face verification, Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1701-1708, (2014)
[5]  
He K., Zhang X., Ren S., Sun J., Delving deep into rectifiers: Surpassing human-level performance on imagenet classification, Proceedings of the IEEE international conference on computer vi-sion, pp. 1026-1034, (2015)
[6]  
Silver D., Huang A., Chris M.J., Arthur G., Laurent S., Van Den D. G., Julian S., Ioannis A., Veda P., Marc Lanctot, Mastering the game of go with deep neural networks and tree search, Nature, 529, 7587, pp. 484-489, (2016)
[7]  
Lake B., Salakhutdinov R., Gross J., Tenenbaum J., One shot learning of simple visual concepts, Proceedings of the annual meeting of the cognitive science society, 33, (2011)
[8]  
Finn C., Levine S., Meta-learning: From fewshot learning to rapid reinforcement learning, ICML, pp. 1-9, (2019)
[9]  
Haralick Robert M., Karthikeyan Shanmugam, Din-stein ' Hak, Textural features for image classification, IEEE Transactions on systems, man, and cybernetics, 6, (1973)
[10]  
Mohanaiah P, Sathyanarayana P, GuruKumar L, Image texture feature extraction using glcm approach, Int. J. Sci. Res. Publ, 3, 3, (2013)