Enhancing representation learning by exploiting effective receptive fields for object detection

被引：7

作者：

Wang, Qijin ^{[1
,2
,3
]}

Zhang, Shengyu ^{[1
]}

Qian, Yu ^{[3
]}

Zhang, Guangcai ^{[4
]}

Wang, Hongqiang ^{[1
,3
]}

机构：

[1] Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei, Peoples R China

[2] Univ Sci & Technol China, Hefei, Peoples R China

[3] Anhui Xinhua Univ, Hefei 230088, Peoples R China

[4] Anhui Normal Univ, Coll Comp & Informat, Wuhu, Peoples R China

来源：

NEUROCOMPUTING | 2022年 / 481卷

基金：

中国国家自然科学基金;

关键词：

Object detection; Region proposal network (RPN); Convolutional neural network (CNN); Effective receptive field; PROPOSAL;

D O I：

10.1016/j.neucom.2022.01.020

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most of state-of-the-art object detectors depend on multiple anchors/reference boxes in representation learning. However, such anchor-based representation does not completely match with the visual information perceived by the sliding windows, thus degrading the overall performance of object detection. In this paper, we present an effective receptive field (eRF)-dependent region proposal network (eRPN) for proposal generation, which enhances the anchor-based representation via eRFs. Specifically, we define an eRF for each sliding window on the feature map and only encode objects within the eRF for unbiasedly representation learning. The size of eRF depends on its backbone network. An eRF-based matching rule is devised and combined with the commonly used IoU rule for pertinent sample selection. We also design an eRF filter module, which can be appended to RPN for eliminating redundant low-quality region proposals in inference time. eRPN enhances representation learning from two perspectives: input information and sample balance, to make generating region proposals more robust. We evaluate eRPN by combining with two commonly used detection heads: Faster RCNN and Faster RCNN w FPN(FasterFPN). Experimental results on PASCAL VOC dataset and MS COCO dataset benchmarks demonstrate the effectiveness of the proposed method in learning representation for object detection. (c) 2022 Elsevier B.V. All rights reserved.

引用

页码：22 / 32

页数：11

共 50 条

[31] Structural Sparse Representation for Object Detection
FANG Wenhua
CHEN Jun
HU Ruimin
Wuhan University Journal of Natural Sciences, 2017, 22 (04) : 318 - 322
[32] Circle Representation for Medical Object Detection
Nguyen, Ethan H.
Yang, Haichun
Deng, Ruining
Lu, Yuzhe
Zhu, Zheyu
Roland, Joseph T.
Lu, Le
Landman, Bennett A.
Fogo, Agnes B.
Huo, Yuankai
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (03) : 746 - 754
[33] Enhancing object detection in aerial images
Pandey, Vishal
Anand, Khushboo
Kalra, Anmol
Gupta, Anmol
Roy, Partha Pratim
Kim, Byung-Gyu
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (08) : 7920 - 7932
[34] REPRESENTATION RECONSTRUCTION HEAD FOR OBJECT DETECTION
Miao, Shuyu
Feng, Rui
Zhang, Yuejie
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1516 - 1520
[35] Enhancing Object Detection With Fourier Series
Liu, Jin
Lu, Zhongyuan
Cen, Yaorong
Hu, Hui
Shao, Zhenfeng
Hong, Yong
Jiang, Ming
Xu, Miaozhong
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2581 - 2596
[36] Effective Rotate: Learning Rotation-Robust Prototype for Aerial Object Detection
Wang, Chaowei
Guo, Guangqian
Liu, Chang
Shao, Dian
Gao, Shan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
[37] DFL-Net: Effective Object Detection via Distinguishable Feature Learning
Xie, Jia
Wan, Shouhong
Jin, Peiquan
DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT II, 2021, 12924 : 195 - 206
[38] Exploiting LSTM for Joint Object and Semantic Part Detection
Yao, Qi
Gong, Xiaojin
COMPUTER VISION - ACCV 2018, PT V, 2019, 11365 : 498 - 512
[39] FMDL: Enhancing Open-World Object Detection with foundation models and dynamic learning
Huang, Yangyang
Hu, Jie
Luo, Ronghua
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 275
[40] Enhancing object detection in low-resolution images via frequency domain learning
Gao, Shuaiqiang
Chen, Yunliang
Cui, Ningning
Qin, Wenjian
ARRAY, 2024, 22

← 1 2 3 4 5 →