Critical direction projection networks for few-shot learning

被引:7
作者
Bi, Sheng [1 ,2 ]
Wang, Yongxing [1 ]
Li, Xiaoxiao [1 ]
Dong, Min [1 ]
Zhu, Jinhui [3 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[2] Shenzhen Acad Robot, Shenzhen 518000, Guangdong, Peoples R China
[3] South China Univ Technol, Sch Software Engn, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Few-shot learning; Point cloud; 3D object classification;
D O I
10.1007/s10489-020-02110-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of deep learning, visual systems perform better than human beings in many classification tasks. However, the scarcity of labelled data is the most critical problem in such visual systems. Few-shot learning is adopted to tackle this problem, wherein a classifier should acquire the ability to identify some class is not in the training data when given only a few examples. In this paper, critical direction projection (CDP) networks are proposed for few-shot learning. Basically, two crucial steps are involved in CDP: The first step is to find the critical directions for each category in the embedding space, and the second step is to measure the similarity between samples and critical directions according to the projection length. It emerges that CDP networks can be effectively compatible with existing classification networks and achieve state-of-the-art performance on several benchmark datasets. Moreover, CDP achieves outstanding performance both on 2D image and 3D object classification. This study is a new attempt to achieve 3D object classification in a few-shot learning scenario. To summarize, our major research contributions are as follows: 1) a novel metric learning method, CDP, is proposed; 2) a new feature extraction module, EffNet, is introduced; and 3) a benchmark for few-shot 3D object classification is provided.
引用
收藏
页码:5400 / 5413
页数:14
相关论文
共 42 条
  • [1] [Anonymous], 2019, ARXIV190204552
  • [2] [Anonymous], 2015, P IEEE C COMPUTER VI, DOI [DOI 10.1109/CVPR.2015.7298801, 10.1109/CVPR.2015.7298801]
  • [3] Bauer M., 2017, ARXIV170600326
  • [4] Chollet F., 2016, IEEE C COMP VIS PATT, P1251, DOI [DOI 10.1109/CVPR.2017.195, 10.48550/ARXIV.1610.02357]
  • [5] Finn C, 2017, PR MACH LEARN RES, V70
  • [6] Multiresolution Tree Networks for 3D Point Cloud Processing
    Gadelha, Matheus
    Wang, Rui
    Maji, Subhransu
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 105 - 122
  • [7] Garcia Victor, 2017, arXiv
  • [8] Graves A., 2014, Neural turing machines
  • [9] The LISS-A Public Database of Common Imaging Signs of Lung Diseases for Computer-Aided Detection and Diagnosis Research and Medical Education
    Han, Guanghui
    Liu, Xiabi
    Han, Feifei
    Santika, I. Nyoman Tenaya
    Zhao, Yanfeng
    Zhao, Xinming
    Zhou, Chunwu
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2015, 62 (02) : 648 - 656
  • [10] Hayashi Toshitaka, 2020, Trends in Artificial Intelligence Theory and Applications. Artificial Intelligence Practices. 33rd International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2020. Proceedings. Lecture Notes in Artificial Intelligence. Subseries of Lecture Notes in Computer Science (LNAI 12144), P759, DOI 10.1007/978-3-030-55789-8_65