Dual-Resolution and Deformable Multihead Network for Oriented Object Detection in Remote Sensing Images

被引:2
作者
Yu, Donghang [1 ,2 ,3 ]
Xu, Qing [1 ,2 ,3 ]
Liu, Xiangyun [1 ,2 ,3 ]
Guo, Haitao [1 ,2 ,3 ]
Lu, Jun [1 ,2 ,3 ]
Lin, Yuzhun [1 ,2 ,3 ]
Lv, Liang [1 ,2 ,3 ]
机构
[1] PLA Strateg Support Force Informat Engn Univ, Zhengzhou 450001, Peoples R China
[2] Collaborat Innovat Ctr Geoinformat Technol Smart C, Zhengzhou 450001, Peoples R China
[3] Minist Nat Resources, Key Lab Spatiotemporal Percept & Intelligent Proc, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Object detection; Remote sensing; Inference algorithms; Proposals; Prediction algorithms; Convolution; Box boundary-aware vectors; deformable feature fusion; multihead network; oriented object detection; remote sensing image;
D O I
10.1109/JSTARS.2022.3230797
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Compared with general object detection, the scale variations, arbitrary orientations, and complex backgrounds of objects in remote sensing images make it more challenging to detect oriented objects. Especially for oriented objects that have large aspect ratios, it is more difficult to accurately detect their boundary. Many methods show excellent performance on oriented object detection, most of which are anchor-based algorithms. To mitigate the performance gap between anchor-free algorithms and anchor-based algorithms, this article proposes an anchor-free algorithm called dual-resolution and deformable multihead network (DDMNet) for oriented object detection. Specifically, the dual-resolution network with bilateral fusion is adopted to extract high-resolution feature maps which contain both spatial details and multiscale contextual information. Then, the deformable convolution is incorporated into the network to alleviate the misalignment problem of oriented object detection. And a dilated feature fusion module is performed on the deformable feature maps to expand their receptive fields. Finally, box boundary-aware vectors instead of the angle are leveraged to represent the oriented bounding box and the multihead network is designed to get robust predictions. DDMNet is a single-stage oriented object detection method without using anchors and exhibits promising performance on the public challenging benchmarks. DDMNet obtains 90.49%, 93.25%, and 78.66% mean average precision on the HRSC2016, FGSD2021, and DOTA datasets. In particular, DDMNet achieves 79.86% at mAP(75) and 53.85% at mAP(85) on the HRSC2016 dataset, respectively, outperforming the current state-of-the-art methods.
引用
收藏
页码:930 / 945
页数:16
相关论文
共 69 条
[1]   A Refined Single-Stage Detector With Feature Enhancement and Alignment for Oriented Objects [J].
Chen, Si-Bao ;
Dai, Bei-Min ;
Tang, Jin ;
Luo, Bin ;
Wang, Wei-Qiang ;
Lv, Ke .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 :8898-8908
[2]   Multi-class geospatial object detection and geographic image classification based on collection of part detectors [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Guo, Lei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 :119-132
[3]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[4]   ACE: Anchor-Free Corner Evolution for Real-Time Arbitrarily-Oriented Object Detection [J].
Dai, Pengwen ;
Yao, Siyuan ;
Li, Zekun ;
Zhang, Sanyi ;
Cao, Xiaochun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :4076-4089
[5]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[6]   Point-Based Estimator for Arbitrary-Oriented Object Detection in Aerial Images [J].
Fu, Kun ;
Chang, Zhonghan ;
Zhang, Yue ;
Sun, Xian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05) :4370-4387
[7]  
Ghorbanzadeh O, 2022, Arxiv, DOI [arXiv:2206.00515, DOI 10.48550/ARXIV.2206.00515, DOI 10.1109/TGRS.2022.3215209]
[8]   Transferable instance segmentation of dwellings in a refugee camp-integrating CNN and OBIA [J].
Ghorbanzadeh, Omid ;
Tiede, Dirk ;
Wendt, Lorenz ;
Sudmanns, Martin ;
Lang, Stefan .
EUROPEAN JOURNAL OF REMOTE SENSING, 2021, 54 (sup1) :127-140
[9]   Dual-det : a fast detector for oriented object detection in aerial images [J].
Guan, Qiuyu ;
Qu, Zhenshen ;
Zhao, Pengbo ;
Zeng, Ming ;
Liu, Junyu .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (24) :9542-9564
[10]   CGP Box: An effective direction representation strategy for oriented object detection in remote sensing images [J].
Guan, Qiuyu ;
Qu, Zhenshen ;
Zeng, Ming ;
Shen, Jianxiong ;
Du, Jingda .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2021, 42 (17) :6670-6691