A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

被引:1074
|
作者
Cai, Zhaowei [1 ]
Fan, Quanfu [2 ]
Feris, Rogerio S. [2 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, SVCL, San Diego, CA 92103 USA
[2] IBM TJ Watson Res, Yorktown Hts, NY USA
来源
COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷
基金
美国国家科学基金会;
关键词
Object detection; Multi-scale; Unified neural network;
D O I
10.1007/978-3-319-46493-0_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A unified deep neural network, denoted the multi-scale CNN (MS-CNN), is proposed for fast multi-scale object detection. The MS-CNN consists of a proposal sub-network and a detection sub-network. In the proposal sub-network, detection is performed at multiple output layers, so that receptive fields match objects of different scales. These complementary scale-specific detectors are combined to produce a strong multi-scale object detector. The unified network is learned end-to-end, by optimizing a multi-task loss. Feature upsampling by deconvolution is also explored, as an alternative to input upsampling, to reduce the memory and computation costs. State-of-the-art object detection performance, at up to 15 fps, is reported on datasets, such as KITTI and Caltech, containing a substantial number of small objects.
引用
收藏
页码:354 / 370
页数:17
相关论文
共 50 条
  • [1] Multi-Scale Attention Deep Neural Network for Fast Accurate Object Detection
    Song, Kaiyou
    Yang, Hua
    Yin, Zhouping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 2972 - 2985
  • [2] Multi-scale Dilated Convolutional Neural Network for Object Detection in UAV Images
    Zhang R.
    Shao Z.
    Aleksei P.
    Wang J.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2020, 45 (06): : 895 - 903
  • [3] A Deep Multi-scale Convolutional Neural Network for Classifying Heartbeats
    Bai, Mengyao
    Xu, Yongjun
    Wang, Lianyan
    Wei, Zhihui
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [4] A Multi-Scale Fusion Convolutional Neural Network for Face Detection
    Chen, Qiaosong
    Meng, Xiaomin
    Li, Wen
    Fu, Xingyu
    Deng, Xin
    Wang, Jin
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 1013 - 1018
  • [5] MRMNet: Multi-scale residual multi-branch neural network for object detection
    Dong, Yongsheng
    Liu, Yafeng
    Li, Xuelong
    NEUROCOMPUTING, 2024, 596
  • [6] A maritime targets detection method based on hierarchical and multi-scale deep convolutional neural network
    Chen, Wei
    Li, Juelong
    Xing, Jianchun
    Yang, Qiliang
    Zhou, Qizhen
    TENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2018), 2018, 10806
  • [7] Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images
    Fu, Kun
    Chang, Zhonghan
    Zhang, Yue
    Xu, Guangluan
    Zhang, Keshu
    Sun, Xian
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 161 (161) : 294 - 308
  • [8] 3D Object Detection Algorithm for Panoramic Images With Multi-Scale Convolutional Neural Network
    Wang, Dianwei
    He, Yanhui
    Liu, Ying
    Li, Daxiang
    Wu, Shiqian
    Qin, Yongrui
    Xu, Zhijie
    IEEE ACCESS, 2019, 7 : 171461 - 171470
  • [9] Multi-scale object detection in remote sensing imagery with convolutional neural networks
    Deng, Zhipeng
    Sun, Hao
    Zhou, Shilin
    Zhao, Juanping
    Lei, Lin
    Zou, Huanxin
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 : 3 - 22
  • [10] MDFN: Multi-scale deep feature learning network for object detection
    Ma, Wenchi
    Wu, Yuanwei
    Cen, Feng
    Wang, Guanghui
    PATTERN RECOGNITION, 2020, 100