Weakly supervised vehicle detection in satellite images via multi-instance discriminative learning

被引:57
作者
Cao, Liujuan [1 ]
Luo, Feng [1 ]
Chen, Li [1 ]
Sheng, Yihan [1 ]
Wang, Haibin [1 ]
Wang, Cheng [1 ]
Ji, Rongrong [1 ]
机构
[1] Xiamen Univ, Sch Informat Sci & Engn, Xiamen 361005, Peoples R China
关键词
Multiple instance learning; Density estimation; Multiple instance SVM; Vehicle detection;
D O I
10.1016/j.patcog.2016.10.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vehicle detection in satellite images has attracted extensive research interest with widespreading application potentials. The main challenge lies in the difficulty of labeling sufficient training instances (vehicle rectangles) across all resolutions and imaging conditions of satellite images, which degenerates the performance of vehicle detectors trained correspondingly. To tackle this challenge, in this paper we propose an intelligent and labor light scheme for large-scale training of vehicle detectors. Our scheme only requires region-level group annotation, i.e. whether this region contains vehicle(s) or not, without explicitly labeling the bounding boxes of vehicles. To this end, a novel weakly supervised, multi-instance learning algorithm is designed to learn instance-wise vehicle detectors from such "weak labels". In particular, a density estimator is firstly adopted to estimate the density map of vehicle instances from the positive regions. Then, a multi-instance SVM is trained to classify and locate vehicle instances from this map. We have carried out extensive experiments on a large-scale satellite image collection that contains various resolutions and imaging conditions. We have demonstrated that the proposed scheme has achieved superior performance by comparing to a set of state-of-the-art and alternative approaches.
引用
收藏
页码:417 / 424
页数:8
相关论文
共 33 条
[1]  
Andrews Stuart, 2002, Proceedings of the 15th International Conference on Neural Information Processing Systems. NIPS'02, P561
[2]  
[Anonymous], 2009, INT ARCH PHOTOGRAMM
[3]  
[Anonymous], 2013, International Conference on Machine Learning
[4]  
[Anonymous], PHOTOGRAMM FERNERKUN
[5]   PERSPECTIVE ON PERFORMANCE [J].
BENTLEY, J .
COMMUNICATIONS OF THE ACM, 1984, 27 (11) :1087-1092
[6]  
Bentley J., 1984, Communications of the ACM, V27, P865, DOI 10.1145/358234.381162
[7]  
Bilen H., ARXIV151102853
[8]  
Bilen H, 2015, PROC CVPR IEEE, P1081, DOI 10.1109/CVPR.2015.7298711
[9]  
Bunescu RC, 2007, P 24 INT C MACH LEAR, P105, DOI 10.1145/1273496.1273510
[10]  
Cai D., LITEKMEANS FASTEST M