Multi-spectral template matching based object detection in a few-shot learning manner

被引:12
作者
Feng, Chen [1 ]
Cao, Zhiguo [1 ]
Xiao, Yang [1 ]
Fang, Zhiwen [2 ]
Zhou, Joey Tianyi [3 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Natl Key Lab Sci & Technol Multispectral Informat, Wuhan 430074, Hubei, Peoples R China
[2] Southern Med Univ, Sch Biomed Engn, Guangdong Prov Key Lab Med Image Proc, Guangzhou 510515, Guangdong, Peoples R China
[3] ASTAR, Inst High Performance Comp, Singapore, Singapore
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Multi-spectral template matching; Object detection; Few-shot learning; Knowledge transfer; Multi-domain subspace alignment;
D O I
10.1016/j.ins.2022.12.067
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-spectral template matching (MSTM) based object detection approaches can be widely used in robotics and aerospace systems for fine-grained object discovery. However, the performance of existing nearest neighbor search based nonparametric para-digms (e.g., correlation coefficient and lp-norm) turns out to be unsatisfactory. These para-digms tend to suffer from two defects: 1) they fail to select from the raw features the discriminative ones that can help distinguish between the target and background; 2) the domain shift between the template and search spectra has not been well addressed within the feature space. In this work, we propose a data-driven MSTM method to address these two issues. First, Exemplar-SVM (E-SVM) is applied to execute feature selection and target/ background categorization jointly, which is facilitated by its max-margin mechanism. To enable the learning process where the template is regarded as a single positive sample, knowledge transfer is executed to attain negative samples from other domains, e.g., large-scale public datasets. Then, the hard negative samples are mined to help train a dis-criminative classifier. Concerning practical applications, we also augment the template with different image degradations and extend E-SVM from the original one-shot learning approach to its few-shot version. Second, a multi-domain adaptation approach via unsu-pervised multi-domain subspace alignment is proposed to tackle multi-domain shift prob-lem. Here the multiple domains relate to template, search, and negative ones considering both offline learning and online matching. The wide-range experimental results on two multi-spectral datasets demonstrate the effectiveness of our method. The tailored dataset and code will be released publicly.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:20 / 36
页数:17
相关论文
共 50 条
[1]  
[Anonymous], 2015, INT C LEARN REPR ICL
[2]  
[Anonymous], 2009, CVPR
[3]  
[Anonymous], 2018, P ADV NEUR INF PROC
[4]  
[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[5]  
Babayan P.V., 2018, PROC MEDITERRANEAN C
[6]   Multispectral piecewise planar stereo using Manhattan-world assumption [J].
Barrera, Fernando ;
Lumbreras, Felipe ;
Sappa, Angel D. .
PATTERN RECOGNITION LETTERS, 2013, 34 (01) :52-61
[7]  
Brown M, 2011, PROC CVPR IEEE, P177, DOI 10.1109/CVPR.2011.5995637
[8]   Memory Matching Networks for One-Shot Image Recognition [J].
Cai, Qi ;
Pan, Yingwei ;
Yao, Ting ;
Yan, Chenggang ;
Mei, Tao .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4080-4088
[9]  
[陈刚 Chen Gang], 2012, [红外技术, Infrared Technology], V34, P229
[10]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893