StairsNet: Mixed Multi-scale Network for Object Detection

被引:0
作者
Gao, Weiyi [1 ,2 ]
Cao, Wenlong [1 ,2 ]
Zhai, Jian [2 ]
Rui, Jianwu [2 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Software, Natl Engn Res Ctr Fundamental Software, Beijing, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I | 2018年 / 10735卷
基金
中国国家自然科学基金;
关键词
StairsNet; Object detection; ResNet; Inception;
D O I
10.1007/978-3-319-77380-3_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is common to choose image classification network as backbone in the object detector. The art-of-the-state image classification network exhibits excellent performance on image classification, but that network hurts the detection efficiency, mainly due to the coarseness of features from several convolution and pooling layers. In this paper, we present a single deep neural network with inceptions, called StairsNet, to take advantage of the art-of-the-state image classification network in object detection. In contrast to previous single network SSD [13] which uses VGG-16 as a feature to extract network, our approach applies recently state-of-the-art classification network Residual Network (ResNets [5]). Meanwhile, to avoid coarseness of the last CNN feature, StairsNet not only utilizes various of scale features, but also mixes different scale features to predict. To this end, we insert two stairs-like architectures into the network: top stairway network that mixes multiscale feature maps as input to predict bounding boxes and bottom stairway network that turns into two different scale feature branches. Our StairsNet significantly increases the PASCAL-style mean Average Precision (mAP) from 75.0% (SSD + ResNet-101) to 77.7%. Code is available at https://github.com/gwyve/caffe/tree/StairsNet.
引用
收藏
页码:303 / 314
页数:12
相关论文
共 27 条
[1]   Measuring the Objectness of Image Windows [J].
Alexe, Bogdan ;
Deselaers, Thomas ;
Ferrari, Vittorio .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2189-2202
[2]  
[Anonymous], 2017, 31 AAAI C ART INT AA
[3]  
[Anonymous], COMPUTER SCI
[4]  
[Anonymous], PROC CVPR IEEE
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], IEEE T PATTERN ANAL
[7]  
[Anonymous], 2013, Advances in Neural Information Processing Systems
[8]  
[Anonymous], 2015, P INT C MACHINE LEAR
[9]  
[Anonymous], 2013, ARXIV PREPRINT ARXIV
[10]  
[Anonymous], 2014, P 2 INT C LEARNING R