Enhancing representation learning by exploiting effective receptive fields for object detection

被引:7
|
作者
Wang, Qijin [1 ,2 ,3 ]
Zhang, Shengyu [1 ]
Qian, Yu [3 ]
Zhang, Guangcai [4 ]
Wang, Hongqiang [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei, Peoples R China
[2] Univ Sci & Technol China, Hefei, Peoples R China
[3] Anhui Xinhua Univ, Hefei 230088, Peoples R China
[4] Anhui Normal Univ, Coll Comp & Informat, Wuhu, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Region proposal network (RPN); Convolutional neural network (CNN); Effective receptive field; PROPOSAL;
D O I
10.1016/j.neucom.2022.01.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of state-of-the-art object detectors depend on multiple anchors/reference boxes in representation learning. However, such anchor-based representation does not completely match with the visual information perceived by the sliding windows, thus degrading the overall performance of object detection. In this paper, we present an effective receptive field (eRF)-dependent region proposal network (eRPN) for proposal generation, which enhances the anchor-based representation via eRFs. Specifically, we define an eRF for each sliding window on the feature map and only encode objects within the eRF for unbiasedly representation learning. The size of eRF depends on its backbone network. An eRF-based matching rule is devised and combined with the commonly used IoU rule for pertinent sample selection. We also design an eRF filter module, which can be appended to RPN for eliminating redundant low-quality region proposals in inference time. eRPN enhances representation learning from two perspectives: input information and sample balance, to make generating region proposals more robust. We evaluate eRPN by combining with two commonly used detection heads: Faster RCNN and Faster RCNN w FPN(FasterFPN). Experimental results on PASCAL VOC dataset and MS COCO dataset benchmarks demonstrate the effectiveness of the proposed method in learning representation for object detection. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:22 / 32
页数:11
相关论文
共 50 条
  • [21] Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection
    Xu, Hongyu
    Lv, Xutao
    Wang, Xiaoyu
    Ren, Zhou
    Bodla, Navaneeth
    Chellappa, Rama
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (06) : 1914 - 1927
  • [22] TranSDet: Toward Effective Transfer Learning for Small-Object Detection
    Xu, Xinkai
    Zhang, Hailan
    Ma, Yan
    Liu, Kang
    Bao, Hong
    Qian, Xu
    REMOTE SENSING, 2023, 15 (14)
  • [23] An effective sign language learning with object detection based ROI segmentation
    Kim, Sunmok
    Ji, Yangho
    Lee, Ki-Baek
    2018 SECOND IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC), 2018, : 330 - 333
  • [24] From detection to understanding: A survey on representation learning for human-object interaction
    Luo, Tianlun
    Guan, Steven
    Yang, Rui
    Smith, Jeremy
    NEUROCOMPUTING, 2023, 543
  • [25] Generalized Focal Loss: Towards Efficient Representation Learning for Dense Object Detection
    Li, Xiang
    Lv, Chengqi
    Wang, Wenhai
    Li, Gang
    Yang, Lingfeng
    Yang, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3139 - 3153
  • [26] Cross-Modality Data Augmentation for Aerial Object Detection with Representation Learning
    Wei, Chiheng
    Bai, Lianfa
    Chen, Xiaoyu
    Han, Jing
    REMOTE SENSING, 2024, 16 (24)
  • [27] Cross-Transfer Learning for Enhancing Object Detection in Remote Sensing Images
    Musunuri, Yogendra Rao
    Kwon, Oh-Seol
    Kung, Sun-Yuan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [28] Multilevel receptive field expansion network for small object detection
    Liu, Zhiwei
    Gan, Menghan
    Xiong, Li
    Mao, Xiaofeng
    Que, Yue
    IET IMAGE PROCESSING, 2023, 17 (08) : 2385 - 2398
  • [29] Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation
    Zheng, Zhaohui
    Wang, Ping
    Ren, Dongwei
    Liu, Wei
    Ye, Rongguang
    Hu, Qinghua
    Zuo, Wangmeng
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 8574 - 8586
  • [30] Deep Learning-based Object Detection for Crop Monitoring in Soybean Fields
    Pratama, Muhammad Taufiq
    Kim, Sangwook
    Ozawa, Seiichi
    Ohkawa, Takenao
    Chona, Yuya
    Tsuji, Hiroyuki
    Murakami, Noriyuki
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,