A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

被引:1074
作者
Cai, Zhaowei [1 ]
Fan, Quanfu [2 ]
Feris, Rogerio S. [2 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, SVCL, San Diego, CA 92103 USA
[2] IBM TJ Watson Res, Yorktown Hts, NY USA
来源
COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷
基金
美国国家科学基金会;
关键词
Object detection; Multi-scale; Unified neural network;
D O I
10.1007/978-3-319-46493-0_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A unified deep neural network, denoted the multi-scale CNN (MS-CNN), is proposed for fast multi-scale object detection. The MS-CNN consists of a proposal sub-network and a detection sub-network. In the proposal sub-network, detection is performed at multiple output layers, so that receptive fields match objects of different scales. These complementary scale-specific detectors are combined to produce a strong multi-scale object detector. The unified network is learned end-to-end, by optimizing a multi-task loss. Feature upsampling by deconvolution is also explored, as an alternative to input upsampling, to reduce the memory and computation costs. State-of-the-art object detection performance, at up to 15 fps, is reported on datasets, such as KITTI and Caltech, containing a substantial number of small objects.
引用
收藏
页码:354 / 370
页数:17
相关论文
共 50 条
[1]   Multi-Scale Attention Deep Neural Network for Fast Accurate Object Detection [J].
Song, Kaiyou ;
Yang, Hua ;
Yin, Zhouping .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) :2972-2985
[2]   Multi-scale Dilated Convolutional Neural Network for Object Detection in UAV Images [J].
Zhang R. ;
Shao Z. ;
Aleksei P. ;
Wang J. .
Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2020, 45 (06) :895-903
[3]   A Deep Multi-scale Convolutional Neural Network for Classifying Heartbeats [J].
Bai, Mengyao ;
Xu, Yongjun ;
Wang, Lianyan ;
Wei, Zhihui .
2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
[4]   A Multi-Scale Fusion Convolutional Neural Network for Face Detection [J].
Chen, Qiaosong ;
Meng, Xiaomin ;
Li, Wen ;
Fu, Xingyu ;
Deng, Xin ;
Wang, Jin .
2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, :1013-1018
[5]   MRMNet: Multi-scale residual multi-branch neural network for object detection [J].
Dong, Yongsheng ;
Liu, Yafeng ;
Li, Xuelong .
NEUROCOMPUTING, 2024, 596
[6]   A maritime targets detection method based on hierarchical and multi-scale deep convolutional neural network [J].
Chen, Wei ;
Li, Juelong ;
Xing, Jianchun ;
Yang, Qiliang ;
Zhou, Qizhen .
TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806
[7]   Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images [J].
Fu, Kun ;
Chang, Zhonghan ;
Zhang, Yue ;
Xu, Guangluan ;
Zhang, Keshu ;
Sun, Xian .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 161 (161) :294-308
[8]   3D Object Detection Algorithm for Panoramic Images With Multi-Scale Convolutional Neural Network [J].
Wang, Dianwei ;
He, Yanhui ;
Liu, Ying ;
Li, Daxiang ;
Wu, Shiqian ;
Qin, Yongrui ;
Xu, Zhijie .
IEEE ACCESS, 2019, 7 :171461-171470
[9]   Multi-scale object detection in remote sensing imagery with convolutional neural networks [J].
Deng, Zhipeng ;
Sun, Hao ;
Zhou, Shilin ;
Zhao, Juanping ;
Lei, Lin ;
Zou, Huanxin .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 :3-22
[10]   MDFN: Multi-scale deep feature learning network for object detection [J].
Ma, Wenchi ;
Wu, Yuanwei ;
Cen, Feng ;
Wang, Guanghui .
PATTERN RECOGNITION, 2020, 100