Relational Part-Aware Learning for Complex Composite Object Detection in High-Resolution Remote Sensing Images

被引:4
作者
Yuan, Shuai [1 ]
Zhang, Lixian [2 ]
Dong, Runmin [3 ,4 ]
Xiong, Jie [5 ]
Zheng, Juepeng [6 ]
Fu, Haohuan [3 ,4 ,7 ]
Gong, Peng [1 ]
机构
[1] Univ Hong Kong, Dept Geog, Hong Kong, Peoples R China
[2] Natl Supercomp Ctr Shenzhen, High Performance Comp Dept, Shenzhen 518055, Peoples R China
[3] Tsinghua Univ, Dept Earth Syst Sci, Key Lab Earth Syst Modeling, Minist Educ, Beijing 100190, Peoples R China
[4] Tsinghua Univ, Xian Inst Surveying & Mapping, Joint Res Ctr Next Generat Smart Mapping, Beijing 100190, Peoples R China
[5] ESSCA Sch Management, Dept Strategy Entrepreneurship & Int Business, F-49000 Angers, France
[6] Sun Yat Sen Univ Zhuhai, Sch Artificial Intelligence, Zhuhai 510275, Peoples R China
[7] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Feature extraction; Transformers; Power generation; Semantics; Correlation; Remote sensing; Complex composite object detection; high-resolution remote sensing images (RSIs); inter-relationship; Transformer; CONVOLUTIONAL NEURAL-NETWORK;
D O I
10.1109/TCYB.2024.3392474
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In high-resolution remote sensing images (RSIs), complex composite object detection (e.g., coal-fired power plant detection and harbor detection) is challenging due to multiple discrete parts with variable layouts leading to complex weak inter-relationship and blurred boundaries, instead of a clearly defined single object. To address this issue, this article proposes an end-to-end framework, i.e., relational part-aware network (REPAN), to explore the semantic correlation and extract discriminative features among multiple parts. Specifically, we first design a part region proposal network (P-RPN) to locate discriminative yet subtle regions. With butterfly units (BFUs) embedded, feature-scale confusion problems stemming from aliasing effects can be largely alleviated. Second, a feature relation Transformer (FRT) plumbs the depths of the spatial relationships by part-and-global joint learning, exploring correlations between various parts to enhance significant part representation. Finally, a contextual detector (CD) classifies and detects parts and the whole composite object through multirelation-aware features, where part information guides to locate the whole object. We collect three remote sensing object detection datasets with four categories to evaluate our method. Consistently surpassing the performance of state-of-the-art methods, the results of extensive experiments underscore the effectiveness and superiority of our proposed method.
引用
收藏
页码:6118 / 6131
页数:14
相关论文
共 61 条
[1]   Remote Sensing Image Scene Classification Using Multiscale Feature Fusion Covariance Network With Octave Convolution [J].
Bai, Lin ;
Liu, Qingxin ;
Li, Cuiling ;
Ye, Zhen ;
Hui, Meng ;
Jia, Xiuping .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[2]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[3]  
Bottou L., 2012, Neural Networks: Tricks of the Trade, P421, DOI DOI 10.1007/978-3-642-35289-8_25
[4]  
Branson G., 2014, ARXIV
[5]   Airport Detection Using End-to-End Convolutional Neural Network with Hard Example Mining [J].
Cai, Bowen ;
Jiang, Zhiguo ;
Zhang, Haopeng ;
Zhao, Danpei ;
Yao, Yuan .
REMOTE SENSING, 2017, 9 (11)
[6]  
Cao D., 2024, P 37 ADV NEUR INF PR, P1
[7]   Deep learning for autonomous ship-oriented small ship detection [J].
Chen, Zhijun ;
Chen, Depeng ;
Zhang, Yishi ;
Cheng, Xiaozhao ;
Zhang, Mingyang ;
Wu, Chaozhong .
SAFETY SCIENCE, 2020, 130
[8]   SFRNet: Fine-Grained Oriented Object Recognition via Separate Feature Refinement [J].
Cheng, Gong ;
Li, Qingyang ;
Wang, Guangxing ;
Xie, Xingxing ;
Min, Lingtong ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[9]   Anchor-Free Oriented Proposal Generator for Object Detection [J].
Cheng, Gong ;
Wang, Jiabao ;
Li, Ke ;
Xie, Xingxing ;
Lang, Chunbo ;
Yao, Yanqing ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10]   Dual-Aligned Oriented Detector [J].
Cheng, Gong ;
Yao, Yanqing ;
Li, Shengyang ;
Li, Ke ;
Xie, Xingxing ;
Wang, Jiabao ;
Yao, Xiwen ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60