Task-wise attention guided part complementary learning for few-shot image classification

被引:55
作者
Cheng, Gong [1 ,2 ,3 ]
Li, Ruimin [1 ,2 ,3 ]
Lang, Chunbo [1 ,2 ,3 ]
Han, Junwei [2 ]
机构
[1] Northwestern Polytech Univ Shenzhen, Res & Dev Inst, Shenzhen 518057, Peoples R China
[2] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[3] CETC Key Lab Aerosp Informat Applicat, Shijiazhuang 050081, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
few-shot learning; meta-learning; task-wise attention; part complementary learning; NETWORKS;
D O I
10.1007/s11432-020-3156-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A general framework to tackle the problem of few-shot learning is meta-learning, which aims to train a well-generalized meta-learner (or backbone network) to learn a base-learner for each future task with small training data. Although a lot of work has produced relatively good results, there are still some challenges for few-shot image classification. First, meta-learning is a learning problem over a collection of tasks and the meta-learner is usually shared among all tasks. To achieve image classification of novel classes in different tasks, it is needed to learn a base-learner for each task. Under the circumstances, how to make the base-learner specialized, and thus respond to different inputs in an extremely task-wise manner for different tasks is a big challenge at present. Second, classification network usually inclines to identify local regions from the most discriminative object parts rather than the whole objects for recognition, thereby resulting in incomplete feature representations. To address the first challenge, we propose a task-wise attention (TWA) module to guide the base-learner to extract task-specific image features. To address the second challenge, under the guidance of TWA, we propose a part complementary learning (PCL) module to extract and fuse the features of multiple complementary parts of target objects, and thus we can obtain more specific and complete information. In addition, the proposed TWA module and PCL module can be embedded into a unified network for end-to-end training. Extensive experiments on two commonly-used benchmark datasets and comparison with state-of-the-art methods demonstrate the effectiveness of our proposed method.
引用
收藏
页数:14
相关论文
共 53 条
[21]   Edge-Labeling Graph Neural Network for Few-shot Learning [J].
Kim, Jongmin ;
Kim, Taesup ;
Kim, Sungwoong ;
Yoo, Chang D. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11-20
[22]   Meta-Learning with Differentiable Convex Optimization [J].
Lee, Kwonjoon ;
Maji, Subhransu ;
Ravichandran, Avinash ;
Soatto, Stefano .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10649-10657
[23]   Finding Task-Relevant Features for Few-Shot Learning by Category Traversal [J].
Li, Hongyang ;
Eigen, David ;
Dodge, Samuel ;
Zeiler, Matthew ;
Wang, Xiaogang .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1-10
[24]   Sub-sampled Cross-Component Prediction For Chroma Component Coding [J].
Li, Junru ;
Wang, Meng ;
Zhang, Li ;
Zhang, Kai ;
Wang, Shiqi ;
Wang, Shanshe ;
Ma, Siwei ;
Gao, Wen .
2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, :203-212
[25]   Revisiting Local Descriptor based Image-to-Class Measure for Few-shot Learning [J].
Li, Wenbin ;
Wang, Lei ;
Xu, Jinglin ;
Huo, Jing ;
Gao, Yang ;
Luo, Jiebo .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7253-7260
[26]  
Li WB, 2019, AAAI CONF ARTIF INTE, P8642
[27]  
Li Z., 2017, Meta-SGD: Learning to Learn Quickly for Few-Shot Learning
[28]   Dense Classification and Implanting for Few-Shot Learning [J].
Lifchitz, Yann ;
Avrithis, Yannis ;
Picard, Sylvaine ;
Bursuc, Andrei .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9250-9259
[29]   Feature Pyramid Networks for Object Detection [J].
Lin, Tsung-Yi ;
Dollar, Piotr ;
Girshick, Ross ;
He, Kaiming ;
Hariharan, Bharath ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944
[30]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37