A Multi-strategy Region Proposal Network

被引:8
|
作者
Chen, Yu-Peng [1 ,2 ]
Li, Ying [1 ,2 ]
Wang, Gang [1 ,2 ]
Xu, Qian [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Jilin, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Region proposal generation; Convolutional neural network; Classification;
D O I
10.1016/j.eswa.2018.06.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Faster Region-based Convolutional Network (Faster R-CNN) was recently proposed achieving outstanding performance for object detection. Specially, a Region Proposal Network (RPN) is designed to efficiently predict region proposals with a wide range of scales and aspect ratios in Faster R-CNN. Nevertheless, once the number and quality of region proposals generated by RPN are not ideal the object detection performance of Faster R-CNN is affected. In this paper, multiple strategies are applied to address these limitations and improve RPN. Hence, a novel architecture for region proposal generation is presented which is named as Multi-strategy Region Proposal Network (MSRPN). Four improvements are presented in MSRPN. Firstly, a novel skip-layer connection network is designed for combining multi-level features and boosting the ability of pooling layers. Thereupon, the quality of region proposals is strengthened. Secondly, improved anchor boxes are introduced with adaptive aspect ratio and evenly distributed interval of selected scales. In this way, the number of predicted region proposals for detection is seriously reduced and the efficiency of object localization is increased. Particularly, the capability of small object detection is enhanced by applying the first and second improvements. Thirdly, classification layer and regression layer are unified as a single convolutional layer. Furthermore, the model complexity of output layer is reduced. Thus, the speed of training and testing is accelerated. Fourthly, the bounding box regression part of multi-task loss function in RPN is improved. Consequently, the performance of bounding box regression is promoted. In the experiment, MSRPN is compared with the Fast Region-based Convolutional Network (Fast R-CNN), Faster R-CNN, Inside-Outside Net (ION), Multi-region CNN (MR-CNN) and HyperNet approaches. MSRPN achieves the state-of-the-art mean average precision (mAP) of 78.9%, 74.8% and 32.1% on PASCAL VOC 2007, 2012 and MS COCO data sets with the deep VGG-16 model, surpassing other five object detection methods. Simultaneously, the above experiment results are obtained by MSRPN with only 150 region proposals per image. Additionally, MSRPN gets excellent performance on small object detection. Furthermore, MSRPN runs at 6 fps which is faster than other methods. In conclusion, the MSRPN method can provide important support for the intelligent object detection systems. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [21] Multi object detection and classification in solid waste management using region proposal network and YOLO model
    Jansi Rani, S., V
    Raman, V. Raghu
    Ram, M. Rahul
    Sivasubramaniya, Prithvi Raj A. Sri
    GLOBAL NEST JOURNAL, 2022, 24 (04): : 743 - 751
  • [22] Strategy Selection Versus Strategy Blending: A Predictive Perspective on Single- and Multi-Strategy Accounts in Multiple-Cue Estimation
    Herzog, Stefan M.
    von Helversen, Bettina
    JOURNAL OF BEHAVIORAL DECISION MAKING, 2018, 31 (02) : 233 - 249
  • [23] Region Proposal Network Based on Effective Receptive Field
    Zhang S.
    Dong S.
    Jiao L.
    Wang Q.
    Wang H.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (05): : 393 - 400
  • [24] A flight maneuver recognition method based on multi-strategy affine canonical time warping
    Wei, Zhenglei
    Ding, Dali
    Zhou, Huan
    Zhang, Zhuoran
    Xie, Lei
    Wang, Le
    APPLIED SOFT COMPUTING, 2020, 95
  • [25] A multi-strategy integrated multi-objective artificial bee colony for unsupervised band selection of hyperspectral images
    Zhang Yong
    He Chun-lin
    Song Xian-fang
    Sun Xiao-yan
    SWARM AND EVOLUTIONARY COMPUTATION, 2021, 60
  • [26] Multi-Scale Proposal Regression Network for Temporal Action Proposal Generation
    Zheng, Jingye
    Chen, Dihu
    Hu, Haifeng
    IEEE ACCESS, 2019, 7 : 183860 - 183868
  • [27] Real-Time Object Detection With Reduced Region Proposal Network via Multi-Feature Concatenation
    Shih, Kuan-Hung
    Chiu, Ching-Te
    Lin, Jiou-Ai
    Bu, Yen-Yu
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (06) : 2164 - 2173
  • [28] REAL-TIME OBJECT DETECTION VIA PRUNING AND A CONCATENATED MULTI-FEATURE ASSISTED REGION PROPOSAL NETWORK
    Shih, Kuan-Hung
    Chiu, Ching-Te
    Pu, Yen-Yu
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1398 - 1402
  • [29] Class incremental learning with KL constraint and multi-strategy exemplar selection for classification based on MMFA model
    Li, Yang
    Du, Lan
    Chen, Jian
    INFORMATION SCIENCES, 2024, 681
  • [30] Hourly forecasting of solar irradiance based on CEEMDAN and multi-strategy CNN-LSTM neural networks
    Gao, Bixuan
    Huang, Xiaoqiao
    Shi, Junsheng
    Tai, Yonghang
    Zhang, Jun
    RENEWABLE ENERGY, 2020, 162 : 1665 - 1683