Enhancing representation learning by exploiting effective receptive fields for object detection

被引:7
|
作者
Wang, Qijin [1 ,2 ,3 ]
Zhang, Shengyu [1 ]
Qian, Yu [3 ]
Zhang, Guangcai [4 ]
Wang, Hongqiang [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei, Peoples R China
[2] Univ Sci & Technol China, Hefei, Peoples R China
[3] Anhui Xinhua Univ, Hefei 230088, Peoples R China
[4] Anhui Normal Univ, Coll Comp & Informat, Wuhu, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Region proposal network (RPN); Convolutional neural network (CNN); Effective receptive field; PROPOSAL;
D O I
10.1016/j.neucom.2022.01.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of state-of-the-art object detectors depend on multiple anchors/reference boxes in representation learning. However, such anchor-based representation does not completely match with the visual information perceived by the sliding windows, thus degrading the overall performance of object detection. In this paper, we present an effective receptive field (eRF)-dependent region proposal network (eRPN) for proposal generation, which enhances the anchor-based representation via eRFs. Specifically, we define an eRF for each sliding window on the feature map and only encode objects within the eRF for unbiasedly representation learning. The size of eRF depends on its backbone network. An eRF-based matching rule is devised and combined with the commonly used IoU rule for pertinent sample selection. We also design an eRF filter module, which can be appended to RPN for eliminating redundant low-quality region proposals in inference time. eRPN enhances representation learning from two perspectives: input information and sample balance, to make generating region proposals more robust. We evaluate eRPN by combining with two commonly used detection heads: Faster RCNN and Faster RCNN w FPN(FasterFPN). Experimental results on PASCAL VOC dataset and MS COCO dataset benchmarks demonstrate the effectiveness of the proposed method in learning representation for object detection. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:22 / 32
页数:11
相关论文
共 50 条
  • [1] Boosting Broader Receptive Fields for Salient Object Detection
    Ma, Mingcan
    Xia, Changqun
    Xie, Chenxi
    Chen, Xiaowu
    Li, Jia
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1026 - 1038
  • [2] PANetW: PANet with wider receptive fields for object detection
    Chen, Ran
    Xin, Dongjun
    Wang, Chuanli
    Wang, Peng
    Tan, Junwen
    Kang, Wenjie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (25) : 66517 - 66538
  • [3] M2YOLOF: Based on effective receptive fields and multiple-in-single-out encoder for object detection
    Wang, Qijin
    Qian, Yu
    Hu, Yating
    Wang, Chao
    Ye, Xiaodong
    Wang, Hongqiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [4] Delving into the Effectiveness of Receptive Fields: Learning Scale-Transferrable Architectures for Practical Object Detection
    Zhaoxiang Zhang
    Cong Pan
    Junran Peng
    International Journal of Computer Vision, 2022, 130 : 970 - 989
  • [5] Delving into the Effectiveness of Receptive Fields: Learning Scale-Transferrable Architectures for Practical Object Detection
    Zhang, Zhaoxiang
    Pan, Cong
    Peng, Junran
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (04) : 970 - 989
  • [6] Enhanced Feature Fusion and Multiple Receptive Fields Object Detection
    Liu, Hailong
    Cui, Jinrong
    Zhong, Haowei
    Huang, Cheng
    ARTIFICIAL INTELLIGENCE AND ROBOTICS, ISAIR 2022, PT I, 2022, 1700 : 118 - 129
  • [7] Enhancing Representation Learning With Spatial Transformation and Early Convolution for Reinforcement Learning-Based Small Object Detection
    Fang, Fen
    Liang, Wenyu
    Cheng, Yi
    Xu, Qianli
    Lim, Joo-Hwee
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 315 - 328
  • [8] An effective learning strategy for cascaded object detection
    Bria, A.
    Marrocco, C.
    Molinara, M.
    Tortorella, F.
    INFORMATION SCIENCES, 2016, 340 : 17 - 26
  • [9] Enhancing Autonomous Driving By Exploiting Thermal Object Detection Through Feature Fusion
    Moataz Eltahan
    Khaled Elsayed
    International Journal of Intelligent Transportation Systems Research, 2024, 22 : 146 - 158
  • [10] Enhancing Autonomous Driving By Exploiting Thermal Object Detection Through Feature Fusion
    Eltahan, Moataz
    Elsayed, Khaled
    INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2024, 22 (01) : 146 - 158