Anchor-free object detection in remote sensing images using a variable receptive field network

被引:4
作者
Fu, Shenshen [1 ]
He, Yifan [2 ]
Du, Xiaofeng [1 ]
Zhu, Yi [1 ]
机构
[1] Xiamen Univ Technol, Fac Comp Sci, Sch Comp & Informat Engn, 600 Polytech Rd, Xiamen 361024, Peoples R China
[2] Shenzhen Polytech, Inst Intelligence Sci & Engn, 7098 Liuxian Ave, Shenzhen 518055, Peoples R China
关键词
Anchor-free detector; Convolution neutral network; Deformable convolution; Object detection; One stage detector; Remote sensing;
D O I
10.1186/s13634-023-01013-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Object detection is one of the essential tasks in computer vision, with most detection methods relying on a limited number of sizes for anchor boxes. However, the boundaries of particular composite objects, such as ports, highways, and golf courses, are ambiguous in remote sensing images, and therefore, it is challenging for the anchor-based method to accommodate the substantial size variation of the objects. In addition, the dense placement of anchor boxes imbalances the positive and negative samples, which affects the end-to-end architecture of deep learning methods. Hence, this paper proposes a single-stage object detection model named Xnet to address this issue. The proposed method designs a deformable convolution backbone network used in the feature extraction stage. Compared to the standard convolution, it adds learnable parameters for dynamically analyzing the boundary and offset of the receptive field, rendering the model more adaptable to size variations within the same class. Moreover, this paper presents a novel anchor-free detector that classifies objects in feature images point-by-point, without relying on anchor boxes. Several experiments on the large remote sensing dataset DIOR challenging Xnet against other popular methods demonstrate that our method attains the best performance, surpassing by 4.7% on the mAP (mean average precision) metric.
引用
收藏
页数:19
相关论文
共 35 条
[1]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[2]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[3]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[4]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[5]   A light and faster regional convolutional neural network for object detection in optical remote sensing images [J].
Ding, Peng ;
Zhang, Ye ;
Deng, Wei-Jian ;
Jia, Ping ;
Kuijper, Arjan .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 141 :208-218
[6]   Sig-NMS-Based Faster R-CNN Combining Transfer Learning for Small Target Detection in VHR Optical Remote Sensing Imagery [J].
Dong, Ruchan ;
Xu, Dazhuan ;
Zhao, Jin ;
Jiao, Licheng ;
An, Jungang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11) :8534-8545
[7]   Attention-Based Multi-Level Feature Fusion for Object Detection in Remote Sensing Images [J].
Dong, Xiaohu ;
Qin, Yao ;
Gao, Yinghui ;
Fu, Ruigang ;
Liu, Songlin ;
Ye, Yuanxin .
REMOTE SENSING, 2022, 14 (15)
[8]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[9]  
Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, 10.48550/arXiv.2107.08430, DOI 10.48550/ARXIV.2107.08430]
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587