A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

被引:1116
作者
Cai, Zhaowei [1 ]
Fan, Quanfu [2 ]
Feris, Rogerio S. [2 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, SVCL, San Diego, CA 92103 USA
[2] IBM TJ Watson Res, Yorktown Hts, NY USA
来源
COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷
基金
美国国家科学基金会;
关键词
Object detection; Multi-scale; Unified neural network;
D O I
10.1007/978-3-319-46493-0_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A unified deep neural network, denoted the multi-scale CNN (MS-CNN), is proposed for fast multi-scale object detection. The MS-CNN consists of a proposal sub-network and a detection sub-network. In the proposal sub-network, detection is performed at multiple output layers, so that receptive fields match objects of different scales. These complementary scale-specific detectors are combined to produce a strong multi-scale object detector. The unified network is learned end-to-end, by optimizing a multi-task loss. Feature upsampling by deconvolution is also explored, as an alternative to input upsampling, to reduce the memory and computation costs. State-of-the-art object detection performance, at up to 15 fps, is reported on datasets, such as KITTI and Caltech, containing a substantial number of small objects.
引用
收藏
页码:354 / 370
页数:17
相关论文
共 50 条
[21]   Deep Learning for Multi-scale Object Detection: A Survey [J].
Chen K.-Q. ;
Zhu Z.-L. ;
Deng X.-M. ;
Ma C.-X. ;
Wang H.-A. .
Ruan Jian Xue Bao/Journal of Software, 2021, 32 (04) :1201-1227
[22]   Multi-Scale Visual Attention Deep Convolutional Neural Network for Multi-Focus Image Fusion [J].
Lai, Rui ;
Li, Yongxue ;
Guan, Juntao ;
Xiong, Ai .
IEEE ACCESS, 2019, 7 :114385-114399
[23]   StairsNet: Mixed Multi-scale Network for Object Detection [J].
Gao, Weiyi ;
Cao, Wenlong ;
Zhai, Jian ;
Rui, Jianwu .
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 :303-314
[24]   Multi-scale fully convolutional neural network for building extraction [J].
Cui W. ;
Xiong B. ;
Zhang L. .
Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2019, 48 (05) :597-608
[25]   Learning Environmental Sounds with Multi-scale Convolutional Neural Network [J].
Zhu, Boqing ;
Wang, Changjian ;
Liu, Feng ;
Lei, Jin ;
Huang, Zhen ;
Peng, Yuxing ;
Li, Fei .
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[26]   Multi-Scale convolutional neural network for finger vein recognition [J].
Liu, Junbo ;
Ma, Hui ;
Guo, Zishuo .
INFRARED PHYSICS & TECHNOLOGY, 2024, 143
[27]   MULTI-SCALE 3D DEEP CONVOLUTIONAL NEURAL NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION [J].
He, Mingyi ;
Li, Bo ;
Chen, Huahui .
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, :3904-3908
[28]   3D multi-scale deep convolutional neural networks for pulmonary nodule detection [J].
Peng, Haixin ;
Sun, Huacong ;
Guo, Yanfei .
PLOS ONE, 2021, 16 (01)
[29]   Multi-scale oriented object detection in aerial images based on convolutional neural networks with global attention [J].
Fei, Jingjing ;
Wang, Zhicheng ;
Yu, Zhaohui ;
Gu, Xi ;
Wei, Gang .
MIPPR 2019: REMOTE SENSING IMAGE PROCESSING, GEOGRAPHIC INFORMATION SYSTEMS, AND OTHER APPLICATIONS, 2020, 11432
[30]   Exploring Multi-scale Deep Feature Fusion for Object Detection [J].
Zhang, Quan ;
Lai, Jianhuang ;
Xie, Xiaohua ;
Zhu, Junyong .
PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 :40-52