Enhancing representation learning by exploiting effective receptive fields for object detection

被引:7
|
作者
Wang, Qijin [1 ,2 ,3 ]
Zhang, Shengyu [1 ]
Qian, Yu [3 ]
Zhang, Guangcai [4 ]
Wang, Hongqiang [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei, Peoples R China
[2] Univ Sci & Technol China, Hefei, Peoples R China
[3] Anhui Xinhua Univ, Hefei 230088, Peoples R China
[4] Anhui Normal Univ, Coll Comp & Informat, Wuhu, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Region proposal network (RPN); Convolutional neural network (CNN); Effective receptive field; PROPOSAL;
D O I
10.1016/j.neucom.2022.01.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of state-of-the-art object detectors depend on multiple anchors/reference boxes in representation learning. However, such anchor-based representation does not completely match with the visual information perceived by the sliding windows, thus degrading the overall performance of object detection. In this paper, we present an effective receptive field (eRF)-dependent region proposal network (eRPN) for proposal generation, which enhances the anchor-based representation via eRFs. Specifically, we define an eRF for each sliding window on the feature map and only encode objects within the eRF for unbiasedly representation learning. The size of eRF depends on its backbone network. An eRF-based matching rule is devised and combined with the commonly used IoU rule for pertinent sample selection. We also design an eRF filter module, which can be appended to RPN for eliminating redundant low-quality region proposals in inference time. eRPN enhances representation learning from two perspectives: input information and sample balance, to make generating region proposals more robust. We evaluate eRPN by combining with two commonly used detection heads: Faster RCNN and Faster RCNN w FPN(FasterFPN). Experimental results on PASCAL VOC dataset and MS COCO dataset benchmarks demonstrate the effectiveness of the proposed method in learning representation for object detection. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:22 / 32
页数:11
相关论文
共 50 条
  • [41] Exploiting Web Images for Weakly Supervised Object Detection
    Tao, Qingyi
    Yang, Hao
    Cai, Jianfei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (05) : 1135 - 1146
  • [42] Object detection via deeply exploiting depth information
    Hou, Saihui
    Wang, Zilei
    Wu, Feng
    NEUROCOMPUTING, 2018, 286 : 58 - 66
  • [43] Expectation Maximization Method for Effective Detection and Tracking of Object Using Machine Learning Technique for Secure Wireless Communication
    Alqahtani, Abdulrahman Saad
    WIRELESS PERSONAL COMMUNICATIONS, 2022, 127 (01) : 869 - 880
  • [44] Expectation Maximization Method for Effective Detection and Tracking of Object Using Machine Learning Technique for Secure Wireless Communication
    Abdulrahman Saad Alqahtani
    Wireless Personal Communications, 2022, 127 : 869 - 880
  • [45] Remote Sensing Object Detection Based on Receptive Field Expansion Block
    Dong, Xiaohu
    Fu, Ruigang
    Gao, Yinghui
    Qin, Yao
    Ye, Yuanxin
    Li, Biao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [46] Dynamic Receptive Field-Based Object Detection in Aerial Imaging
    Xie Xueli
    Li Chuanxiang
    Yang Xiaogang
    Xi Jianxiang
    Chen Tong
    ACTA OPTICA SINICA, 2020, 40 (04)
  • [47] Diverse receptive field network with context aggregation for fast object detection
    Xie, Shaorong
    Liu, Chang
    Gao, Jiantao
    Li, Xiaomao
    Luo, Jun
    Fan, Baojie
    Chen, Jiahong
    Pu, Huayan
    Peng, Yan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 70
  • [48] DRFnet: Dynamic receptive field network for object detection and image recognition
    Tan, Minjie
    Yuan, Xinyang
    Liang, Binbin
    Han, Songchen
    FRONTIERS IN NEUROROBOTICS, 2023, 16
  • [49] Local structured representation for generic object detection
    Zhang, Junge
    Huang, Kaiqi
    Tan, Tieniu
    Zhang, Zhaoxiang
    FRONTIERS OF COMPUTER SCIENCE, 2017, 11 (04) : 632 - 648
  • [50] Local structured representation for generic object detection
    Junge Zhang
    Kaiqi Huang
    Tieniu Tan
    Zhaoxiang Zhang
    Frontiers of Computer Science, 2017, 11 : 632 - 648