Enhancing representation learning by exploiting effective receptive fields for object detection

被引：7

作者：

Wang, Qijin ^{[1
,2
,3
]}

Zhang, Shengyu ^{[1
]}

Qian, Yu ^{[3
]}

Zhang, Guangcai ^{[4
]}

Wang, Hongqiang ^{[1
,3
]}

机构：

[1] Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei, Peoples R China

[2] Univ Sci & Technol China, Hefei, Peoples R China

[3] Anhui Xinhua Univ, Hefei 230088, Peoples R China

[4] Anhui Normal Univ, Coll Comp & Informat, Wuhu, Peoples R China

来源：

NEUROCOMPUTING | 2022年 / 481卷

基金：

中国国家自然科学基金;

关键词：

Object detection; Region proposal network (RPN); Convolutional neural network (CNN); Effective receptive field; PROPOSAL;

D O I：

10.1016/j.neucom.2022.01.020

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most of state-of-the-art object detectors depend on multiple anchors/reference boxes in representation learning. However, such anchor-based representation does not completely match with the visual information perceived by the sliding windows, thus degrading the overall performance of object detection. In this paper, we present an effective receptive field (eRF)-dependent region proposal network (eRPN) for proposal generation, which enhances the anchor-based representation via eRFs. Specifically, we define an eRF for each sliding window on the feature map and only encode objects within the eRF for unbiasedly representation learning. The size of eRF depends on its backbone network. An eRF-based matching rule is devised and combined with the commonly used IoU rule for pertinent sample selection. We also design an eRF filter module, which can be appended to RPN for eliminating redundant low-quality region proposals in inference time. eRPN enhances representation learning from two perspectives: input information and sample balance, to make generating region proposals more robust. We evaluate eRPN by combining with two commonly used detection heads: Faster RCNN and Faster RCNN w FPN(FasterFPN). Experimental results on PASCAL VOC dataset and MS COCO dataset benchmarks demonstrate the effectiveness of the proposed method in learning representation for object detection. (c) 2022 Elsevier B.V. All rights reserved.

引用

页码：22 / 32

页数：11

共 50 条

[21] Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection
Xu, Hongyu
Lv, Xutao
Wang, Xiaoyu
Ren, Zhou
Bodla, Navaneeth
Chellappa, Rama
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (06) : 1914 - 1927
[22] TranSDet: Toward Effective Transfer Learning for Small-Object Detection
Xu, Xinkai
Zhang, Hailan
Ma, Yan
Liu, Kang
Bao, Hong
Qian, Xu
REMOTE SENSING, 2023, 15 (14)
[23] An effective sign language learning with object detection based ROI segmentation
Kim, Sunmok
Ji, Yangho
Lee, Ki-Baek
2018 SECOND IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC), 2018, : 330 - 333
[24] From detection to understanding: A survey on representation learning for human-object interaction
Luo, Tianlun
Guan, Steven
Yang, Rui
Smith, Jeremy
NEUROCOMPUTING, 2023, 543
[25] Generalized Focal Loss: Towards Efficient Representation Learning for Dense Object Detection
Li, Xiang
Lv, Chengqi
Wang, Wenhai
Li, Gang
Yang, Lingfeng
Yang, Jian
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3139 - 3153
[26] Cross-Modality Data Augmentation for Aerial Object Detection with Representation Learning
Wei, Chiheng
Bai, Lianfa
Chen, Xiaoyu
Han, Jing
REMOTE SENSING, 2024, 16 (24)
[27] Cross-Transfer Learning for Enhancing Object Detection in Remote Sensing Images
Musunuri, Yogendra Rao
Kwon, Oh-Seol
Kung, Sun-Yuan
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
[28] Multilevel receptive field expansion network for small object detection
Liu, Zhiwei
Gan, Menghan
Xiong, Li
Mao, Xiaofeng
Que, Yue
IET IMAGE PROCESSING, 2023, 17 (08) : 2385 - 2398
[29] Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance Segmentation
Zheng, Zhaohui
Wang, Ping
Ren, Dongwei
Liu, Wei
Ye, Rongguang
Hu, Qinghua
Zuo, Wangmeng
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 8574 - 8586
[30] Deep Learning-based Object Detection for Crop Monitoring in Soybean Fields
Pratama, Muhammad Taufiq
Kim, Sangwook
Ozawa, Seiichi
Ohkawa, Takenao
Chona, Yuya
Tsuji, Hiroyuki
Murakami, Noriyuki
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,

← 1 2 3 4 5 →