A Multi-strategy Region Proposal Network

被引:8
|
作者
Chen, Yu-Peng [1 ,2 ]
Li, Ying [1 ,2 ]
Wang, Gang [1 ,2 ]
Xu, Qian [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Jilin, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun, Jilin, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Region proposal generation; Convolutional neural network; Classification;
D O I
10.1016/j.eswa.2018.06.043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Faster Region-based Convolutional Network (Faster R-CNN) was recently proposed achieving outstanding performance for object detection. Specially, a Region Proposal Network (RPN) is designed to efficiently predict region proposals with a wide range of scales and aspect ratios in Faster R-CNN. Nevertheless, once the number and quality of region proposals generated by RPN are not ideal the object detection performance of Faster R-CNN is affected. In this paper, multiple strategies are applied to address these limitations and improve RPN. Hence, a novel architecture for region proposal generation is presented which is named as Multi-strategy Region Proposal Network (MSRPN). Four improvements are presented in MSRPN. Firstly, a novel skip-layer connection network is designed for combining multi-level features and boosting the ability of pooling layers. Thereupon, the quality of region proposals is strengthened. Secondly, improved anchor boxes are introduced with adaptive aspect ratio and evenly distributed interval of selected scales. In this way, the number of predicted region proposals for detection is seriously reduced and the efficiency of object localization is increased. Particularly, the capability of small object detection is enhanced by applying the first and second improvements. Thirdly, classification layer and regression layer are unified as a single convolutional layer. Furthermore, the model complexity of output layer is reduced. Thus, the speed of training and testing is accelerated. Fourthly, the bounding box regression part of multi-task loss function in RPN is improved. Consequently, the performance of bounding box regression is promoted. In the experiment, MSRPN is compared with the Fast Region-based Convolutional Network (Fast R-CNN), Faster R-CNN, Inside-Outside Net (ION), Multi-region CNN (MR-CNN) and HyperNet approaches. MSRPN achieves the state-of-the-art mean average precision (mAP) of 78.9%, 74.8% and 32.1% on PASCAL VOC 2007, 2012 and MS COCO data sets with the deep VGG-16 model, surpassing other five object detection methods. Simultaneously, the above experiment results are obtained by MSRPN with only 150 region proposals per image. Additionally, MSRPN gets excellent performance on small object detection. Furthermore, MSRPN runs at 6 fps which is faster than other methods. In conclusion, the MSRPN method can provide important support for the intelligent object detection systems. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [41] Depth Driven People Counting Using Deep Region Proposal Network
    Song, Diping
    Qiao, Yu
    Corbetta, Alessandro
    2017 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (IEEE ICIA 2017), 2017, : 416 - 421
  • [42] Two-Line Element Outlier and Space Event Detection Method Based on Multi-Strategy Genetic Algorithm
    Zhang, Haoyue
    Zhao, Chunmei
    He, Zhengbin
    APPLIED SCIENCES-BASEL, 2024, 14 (09):
  • [43] SA-RPN: A Spacial Aware Region Proposal Network for Acne Detection
    Zhang, Jianwei
    Zhang, Lei
    Wang, Junyou
    Wei, Xin
    Li, Jiaqi
    Jiang, Xian
    Du, Dan
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (11) : 5439 - 5448
  • [44] Region Proposal and Regression Network for Fishing Spots Detection From Sea Environment
    Fu, An
    Patil, Kalpesh Ravindra
    Iiyama, Masaaki
    IEEE ACCESS, 2021, 9 : 68366 - 68375
  • [45] Object detection with class aware region proposal network and focused attention objective
    Tao, Xiaoyu
    Gong, Yihong
    Shi, Weiwei
    Cheng, De
    PATTERN RECOGNITION LETTERS, 2020, 130 (130) : 353 - 361
  • [46] DOMAIN-INVARIANT REGION PROPOSAL NETWORK FOR CROSS-DOMAIN DETECTION
    Yang, Xuebin
    Wan, Shouhong
    Jin, Peiquan
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [47] Efficient and robust optic disc detection and fovea localization using region proposal network and cascaded network
    Huang, Yijin
    Zhong, Zhiquan
    Yuan, Jin
    Tang, Xiaoying
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 60
  • [48] Multi-strategy text data augmentation for enhanced aspect-based sentiment analysis in resource-limited scenarios
    Zhao, Chuanjun
    Sun, Xuzhuang
    Feng, Rong
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (08) : 11129 - 11148
  • [49] TRPN: A Text Region Proposal Network in the wild under the constraint of low memory GPU
    Keserwani, Prateek
    Ali, Tofik
    Roy, Partha Pratim
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 286 - 291
  • [50] Zoom Out-and-In Network with Map Attention Decision for Region Proposal and Object Detection
    Li, Hongyang
    Liu, Yu
    Ouyang, Wanli
    Wang, Xiaogang
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (03) : 225 - 238