A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

被引:1074
作者
Cai, Zhaowei [1 ]
Fan, Quanfu [2 ]
Feris, Rogerio S. [2 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, SVCL, San Diego, CA 92103 USA
[2] IBM TJ Watson Res, Yorktown Hts, NY USA
来源
COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷
基金
美国国家科学基金会;
关键词
Object detection; Multi-scale; Unified neural network;
D O I
10.1007/978-3-319-46493-0_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A unified deep neural network, denoted the multi-scale CNN (MS-CNN), is proposed for fast multi-scale object detection. The MS-CNN consists of a proposal sub-network and a detection sub-network. In the proposal sub-network, detection is performed at multiple output layers, so that receptive fields match objects of different scales. These complementary scale-specific detectors are combined to produce a strong multi-scale object detector. The unified network is learned end-to-end, by optimizing a multi-task loss. Feature upsampling by deconvolution is also explored, as an alternative to input upsampling, to reduce the memory and computation costs. State-of-the-art object detection performance, at up to 15 fps, is reported on datasets, such as KITTI and Caltech, containing a substantial number of small objects.
引用
收藏
页码:354 / 370
页数:17
相关论文
共 50 条
  • [21] Deep Learning for Multi-scale Object Detection: A Survey
    Chen K.-Q.
    Zhu Z.-L.
    Deng X.-M.
    Ma C.-X.
    Wang H.-A.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (04): : 1201 - 1227
  • [22] Multi-Scale Visual Attention Deep Convolutional Neural Network for Multi-Focus Image Fusion
    Lai, Rui
    Li, Yongxue
    Guan, Juntao
    Xiong, Ai
    IEEE ACCESS, 2019, 7 : 114385 - 114399
  • [23] StairsNet: Mixed Multi-scale Network for Object Detection
    Gao, Weiyi
    Cao, Wenlong
    Zhai, Jian
    Rui, Jianwu
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 303 - 314
  • [24] Multi-Scale convolutional neural network for finger vein recognition
    Liu, Junbo
    Ma, Hui
    Guo, Zishuo
    INFRARED PHYSICS & TECHNOLOGY, 2024, 143
  • [25] Learning Environmental Sounds with Multi-scale Convolutional Neural Network
    Zhu, Boqing
    Wang, Changjian
    Liu, Feng
    Lei, Jin
    Huang, Zhen
    Peng, Yuxing
    Li, Fei
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [26] MULTI-SCALE 3D DEEP CONVOLUTIONAL NEURAL NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    He, Mingyi
    Li, Bo
    Chen, Huahui
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3904 - 3908
  • [27] Multi-scale fully convolutional neural network for building extraction
    Cui W.
    Xiong B.
    Zhang L.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2019, 48 (05): : 597 - 608
  • [28] Multi-scale oriented object detection in aerial images based on convolutional neural networks with global attention
    Fei, Jingjing
    Wang, Zhicheng
    Yu, Zhaohui
    Gu, Xi
    Wei, Gang
    MIPPR 2019: REMOTE SENSING IMAGE PROCESSING, GEOGRAPHIC INFORMATION SYSTEMS, AND OTHER APPLICATIONS, 2020, 11432
  • [29] 3D multi-scale deep convolutional neural networks for pulmonary nodule detection
    Peng, Haixin
    Sun, Huacong
    Guo, Yanfei
    PLOS ONE, 2021, 16 (01):
  • [30] Exploring Multi-scale Deep Feature Fusion for Object Detection
    Zhang, Quan
    Lai, Jianhuang
    Xie, Xiaohua
    Zhu, Junyong
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 40 - 52