Enhancing representation learning by exploiting effective receptive fields for object detection

被引:7
|
作者
Wang, Qijin [1 ,2 ,3 ]
Zhang, Shengyu [1 ]
Qian, Yu [3 ]
Zhang, Guangcai [4 ]
Wang, Hongqiang [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei, Peoples R China
[2] Univ Sci & Technol China, Hefei, Peoples R China
[3] Anhui Xinhua Univ, Hefei 230088, Peoples R China
[4] Anhui Normal Univ, Coll Comp & Informat, Wuhu, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Region proposal network (RPN); Convolutional neural network (CNN); Effective receptive field; PROPOSAL;
D O I
10.1016/j.neucom.2022.01.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of state-of-the-art object detectors depend on multiple anchors/reference boxes in representation learning. However, such anchor-based representation does not completely match with the visual information perceived by the sliding windows, thus degrading the overall performance of object detection. In this paper, we present an effective receptive field (eRF)-dependent region proposal network (eRPN) for proposal generation, which enhances the anchor-based representation via eRFs. Specifically, we define an eRF for each sliding window on the feature map and only encode objects within the eRF for unbiasedly representation learning. The size of eRF depends on its backbone network. An eRF-based matching rule is devised and combined with the commonly used IoU rule for pertinent sample selection. We also design an eRF filter module, which can be appended to RPN for eliminating redundant low-quality region proposals in inference time. eRPN enhances representation learning from two perspectives: input information and sample balance, to make generating region proposals more robust. We evaluate eRPN by combining with two commonly used detection heads: Faster RCNN and Faster RCNN w FPN(FasterFPN). Experimental results on PASCAL VOC dataset and MS COCO dataset benchmarks demonstrate the effectiveness of the proposed method in learning representation for object detection. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:22 / 32
页数:11
相关论文
共 50 条
  • [31] Structural Sparse Representation for Object Detection
    FANG Wenhua
    CHEN Jun
    HU Ruimin
    Wuhan University Journal of Natural Sciences, 2017, 22 (04) : 318 - 322
  • [32] Circle Representation for Medical Object Detection
    Nguyen, Ethan H.
    Yang, Haichun
    Deng, Ruining
    Lu, Yuzhe
    Zhu, Zheyu
    Roland, Joseph T.
    Lu, Le
    Landman, Bennett A.
    Fogo, Agnes B.
    Huo, Yuankai
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (03) : 746 - 754
  • [33] Enhancing object detection in aerial images
    Pandey, Vishal
    Anand, Khushboo
    Kalra, Anmol
    Gupta, Anmol
    Roy, Partha Pratim
    Kim, Byung-Gyu
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (08) : 7920 - 7932
  • [34] REPRESENTATION RECONSTRUCTION HEAD FOR OBJECT DETECTION
    Miao, Shuyu
    Feng, Rui
    Zhang, Yuejie
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1516 - 1520
  • [35] Enhancing Object Detection With Fourier Series
    Liu, Jin
    Lu, Zhongyuan
    Cen, Yaorong
    Hu, Hui
    Shao, Zhenfeng
    Hong, Yong
    Jiang, Ming
    Xu, Miaozhong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2581 - 2596
  • [36] Effective Rotate: Learning Rotation-Robust Prototype for Aerial Object Detection
    Wang, Chaowei
    Guo, Guangqian
    Liu, Chang
    Shao, Dian
    Gao, Shan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
  • [37] DFL-Net: Effective Object Detection via Distinguishable Feature Learning
    Xie, Jia
    Wan, Shouhong
    Jin, Peiquan
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT II, 2021, 12924 : 195 - 206
  • [38] Exploiting LSTM for Joint Object and Semantic Part Detection
    Yao, Qi
    Gong, Xiaojin
    COMPUTER VISION - ACCV 2018, PT V, 2019, 11365 : 498 - 512
  • [39] FMDL: Enhancing Open-World Object Detection with foundation models and dynamic learning
    Huang, Yangyang
    Hu, Jie
    Luo, Ronghua
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 275
  • [40] Enhancing object detection in low-resolution images via frequency domain learning
    Gao, Shuaiqiang
    Chen, Yunliang
    Cui, Ningning
    Qin, Wenjian
    ARRAY, 2024, 22