A Multi-strategy Region Proposal Network

被引:8
|
作者
Chen, Yu-Peng [1 ,2 ]
Li, Ying [1 ,2 ]
Wang, Gang [1 ,2 ]
Xu, Qian [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Jilin, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Region proposal generation; Convolutional neural network; Classification;
D O I
10.1016/j.eswa.2018.06.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Faster Region-based Convolutional Network (Faster R-CNN) was recently proposed achieving outstanding performance for object detection. Specially, a Region Proposal Network (RPN) is designed to efficiently predict region proposals with a wide range of scales and aspect ratios in Faster R-CNN. Nevertheless, once the number and quality of region proposals generated by RPN are not ideal the object detection performance of Faster R-CNN is affected. In this paper, multiple strategies are applied to address these limitations and improve RPN. Hence, a novel architecture for region proposal generation is presented which is named as Multi-strategy Region Proposal Network (MSRPN). Four improvements are presented in MSRPN. Firstly, a novel skip-layer connection network is designed for combining multi-level features and boosting the ability of pooling layers. Thereupon, the quality of region proposals is strengthened. Secondly, improved anchor boxes are introduced with adaptive aspect ratio and evenly distributed interval of selected scales. In this way, the number of predicted region proposals for detection is seriously reduced and the efficiency of object localization is increased. Particularly, the capability of small object detection is enhanced by applying the first and second improvements. Thirdly, classification layer and regression layer are unified as a single convolutional layer. Furthermore, the model complexity of output layer is reduced. Thus, the speed of training and testing is accelerated. Fourthly, the bounding box regression part of multi-task loss function in RPN is improved. Consequently, the performance of bounding box regression is promoted. In the experiment, MSRPN is compared with the Fast Region-based Convolutional Network (Fast R-CNN), Faster R-CNN, Inside-Outside Net (ION), Multi-region CNN (MR-CNN) and HyperNet approaches. MSRPN achieves the state-of-the-art mean average precision (mAP) of 78.9%, 74.8% and 32.1% on PASCAL VOC 2007, 2012 and MS COCO data sets with the deep VGG-16 model, surpassing other five object detection methods. Simultaneously, the above experiment results are obtained by MSRPN with only 150 region proposals per image. Additionally, MSRPN gets excellent performance on small object detection. Furthermore, MSRPN runs at 6 fps which is faster than other methods. In conclusion, the MSRPN method can provide important support for the intelligent object detection systems. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [1] Hierarchical objectness network for region proposal generation and object detection
    Wang, Juan
    Tao, Xiaoming
    Xu, Mai
    Duan, Yiping
    Lu, Jianhua
    PATTERN RECOGNITION, 2018, 83 : 260 - 272
  • [2] MULTI-STREAM REGION PROPOSAL NETWORK FOR PEDESTRIAN DETECTION
    Lei, Jianjun
    Chen, Yue
    Peng, Bo
    Huang, Qingming
    Ling, Nam
    Hou, Chunping
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [3] A Multi-Strategy Integration Prediction Model for Carbon Price
    Dong, Hongwei
    Hu, Yue
    Yang, Yihe
    Jiang, Wenjing
    ENERGIES, 2023, 16 (12)
  • [4] A multi-strategy approach for mining multimedia data repositories
    Viktor, HL
    Paquet, E
    DATA MINING VI: DATA MINING, TEXT MINING AND THEIR BUSINESS APPLICATIONS, 2005, : 63 - 73
  • [5] Weakly Supervised Region Proposal Network and Object Detection
    Tang, Peng
    Wang, Xinggang
    Wang, Angtian
    Yan, Yongluan
    Liu, Wenyu
    Huang, Junzhou
    Yuille, Alan
    COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 : 370 - 386
  • [6] Multi-strategy active learning for power quality disturbance identification
    Zhang, Haoyi
    Wu, Wei
    Li, Kaicheng
    Zheng, Xinyue
    Xu, Xuebin
    Wei, Xuan
    Zhao, Chen
    APPLIED SOFT COMPUTING, 2024, 154
  • [7] A Framework for Classification in Data Streams Using Multi-strategy Learning
    Pesaranghader, Ali
    Viktor, Herna L.
    Paquet, Eric
    DISCOVERY SCIENCE, (DS 2016), 2016, 9956 : 341 - 355
  • [8] An Automatic Facial Expression Recognition System Employing Convolutional Neural Network with Multi-strategy Gravitational Search Algorithm
    Alenazy, Wael Mohammad
    Alqahtani, Abdullah Saleh
    IETE TECHNICAL REVIEW, 2022, 39 (01) : 72 - 85
  • [9] Focal Loss for Region Proposal Network
    Chen, Chengpeng
    Song, Xinhang
    Jiang, Shuqiang
    PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 368 - 380
  • [10] Geospatial Object Detection via Deconvolutional Region Proposal Network
    Wang, Chen
    Shi, Jun
    Yang, Xiaqing
    Zhou, Yuanyuan
    Wei, Shunjun
    Li, Liang
    Zhang, Xiaoling
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (08) : 3014 - 3027