Relational Part-Aware Learning for Complex Composite Object Detection in High-Resolution Remote Sensing Images

被引:4
作者
Yuan, Shuai [1 ]
Zhang, Lixian [2 ]
Dong, Runmin [3 ,4 ]
Xiong, Jie [5 ]
Zheng, Juepeng [6 ]
Fu, Haohuan [3 ,4 ,7 ]
Gong, Peng [1 ]
机构
[1] Univ Hong Kong, Dept Geog, Hong Kong, Peoples R China
[2] Natl Supercomp Ctr Shenzhen, High Performance Comp Dept, Shenzhen 518055, Peoples R China
[3] Tsinghua Univ, Dept Earth Syst Sci, Key Lab Earth Syst Modeling, Minist Educ, Beijing 100190, Peoples R China
[4] Tsinghua Univ, Xian Inst Surveying & Mapping, Joint Res Ctr Next Generat Smart Mapping, Beijing 100190, Peoples R China
[5] ESSCA Sch Management, Dept Strategy Entrepreneurship & Int Business, F-49000 Angers, France
[6] Sun Yat Sen Univ Zhuhai, Sch Artificial Intelligence, Zhuhai 510275, Peoples R China
[7] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Feature extraction; Transformers; Power generation; Semantics; Correlation; Remote sensing; Complex composite object detection; high-resolution remote sensing images (RSIs); inter-relationship; Transformer; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.1109/TCYB.2024.3392474
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In high-resolution remote sensing images (RSIs), complex composite object detection (e.g., coal-fired power plant detection and harbor detection) is challenging due to multiple discrete parts with variable layouts leading to complex weak inter-relationship and blurred boundaries, instead of a clearly defined single object. To address this issue, this article proposes an end-to-end framework, i.e., relational part-aware network (REPAN), to explore the semantic correlation and extract discriminative features among multiple parts. Specifically, we first design a part region proposal network (P-RPN) to locate discriminative yet subtle regions. With butterfly units (BFUs) embedded, feature-scale confusion problems stemming from aliasing effects can be largely alleviated. Second, a feature relation Transformer (FRT) plumbs the depths of the spatial relationships by part-and-global joint learning, exploring correlations between various parts to enhance significant part representation. Finally, a contextual detector (CD) classifies and detects parts and the whole composite object through multirelation-aware features, where part information guides to locate the whole object. We collect three remote sensing object detection datasets with four categories to evaluate our method. Consistently surpassing the performance of state-of-the-art methods, the results of extensive experiments underscore the effectiveness and superiority of our proposed method.
引用
收藏
页码:6118 / 6131
页数:14
相关论文
共 61 条
[51]   Deep Learning Based Fossil-Fuel Power Plant Monitoring in High Resolution Remote Sensing Images: A Comparative Study [J].
Zhang, Haopeng ;
Deng, Qin .
REMOTE SENSING, 2019, 11 (09)
[52]   Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training [J].
Zhang, Hongkai ;
Chang, Hong ;
Ma, Bingpeng ;
Wang, Naiyan ;
Chen, Xilin .
COMPUTER VISION - ECCV 2020, PT XV, 2020, 12360 :260-275
[53]   Learning a Mixture of Granularity-Specific Experts for Fine-Grained Categorization [J].
Zhang, Lianbo ;
Huang, Shaoli ;
Liu, Wei ;
Tao, Dacheng .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :8330-8339
[54]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[55]   Learning Rich Part Hierarchies With Progressive Attention Networks for Fine-Grained Image Recognition [J].
Zheng, Heliang ;
Fu, Jianlong ;
Zha, Zheng-Jun ;
Luo, Jiebo ;
Mei, Tao .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :476-488
[56]   Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition [J].
Zheng, Heliang ;
Fu, Jianlong ;
Mei, Tao ;
Luo, Jiebo .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5219-5227
[57]   Surveying coconut trees using high-resolution satellite imagery in remote atolls of the Pacific Ocean [J].
Zheng, Juepeng ;
Yuan, Shuai ;
Wu, Wenzhao ;
Li, Weijia ;
Yu, Le ;
Fu, Haohuan ;
Coomes, David .
REMOTE SENSING OF ENVIRONMENT, 2023, 287
[58]   Growing status observation for oil palm trees using Unmanned Aerial Vehicle (UAV) images [J].
Zheng, Juepeng ;
Fu, Haohuan ;
Li, Weijia ;
Wu, Wenzhao ;
Yu, Le ;
Yuan, Shuai ;
Tao, Wai Yuk William ;
Pang, Tan Kian ;
Kanniah, Kasturi Devi .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 173 (173) :95-121
[59]  
Zhu HG, 2015, IEEE IMAGE PROC, P3735, DOI 10.1109/ICIP.2015.7351502
[60]   Interpretable and Accurate Fine-grained Recognition via Region Grouping [J].
Huang, Zixuan ;
Li, Yin .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8659-8669