SM plus : REFINED SCALE MATCH FOR TINY PERSON DETECTION

被引:19
作者
Jiang, Nan [1 ]
Yu, Xuehui [1 ]
Peng, Xiaoke [1 ]
Gong, Yuqi [1 ]
Han, Zhenjun [1 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
tiny object detection; pre-training strategy;
D O I
10.1109/ICASSP39728.2021.9414162
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Detecting tiny objects (e.g., less than 20 x 20 pixels) in large-scale images is an important yet open problem. Modern CNN-based detectors are challenged by the scale mismatch between the dataset for network pre-training and the target dataset for detector training. In this paper, we investigate the scale alignment between pre-training and target datasets, and propose a new refined Scale Match method (termed SM+) for tiny person detection. SM+ improves the scale match from image level to instance level, and effectively promotes the similarity between pre-training and target dataset. Moreover, considering SM+ possibly destroys the image structure, a new probabilistic structure inpainting (PSI) method is proposed for the background processing. Experiments conducted across various detectors show that SM+ noticeably improves the performance on TinyPerson, and outperforms the state-of-the-art detectors with a significant margin.
引用
收藏
页码:1815 / 1819
页数:5
相关论文
共 18 条
[1]  
[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[2]  
BERTALMIO M, 2001, PROC CVPR IEEE, P355, DOI DOI 10.1109/CVPR.2001.990497
[3]  
Collins RT., 2000, SYSTEM VIDEO SURVEIL, V2000, P1
[4]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[5]  
Dollár P, 2009, PROC CVPR IEEE, P304, DOI 10.1109/CVPRW.2009.5206631
[6]  
Fang Hao-Shu, 2019, ICCV
[7]   W4:: Real-time surveillance of people and their activities [J].
Haritaoglu, I ;
Harwood, D ;
Davis, LS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :809-830
[8]  
Kaiming He, 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P2049, DOI 10.1109/CVPR.2011.5995495
[9]  
Kullback Solomon, 1951, AMS
[10]   The Open Images Dataset V4 Unified Image Classification, Object Detection, and Visual Relationship Detection at Scale [J].
Kuznetsova, Alina ;
Rom, Hassan ;
Alldrin, Neil ;
Uijlings, Jasper ;
Krasin, Ivan ;
Pont-Tuset, Jordi ;
Kamali, Shahab ;
Popov, Stefan ;
Malloci, Matteo ;
Kolesnikov, Alexander ;
Duerig, Tom ;
Ferrari, Vittorio .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (07) :1956-1981