Renormalized Connection for Scale-Preferred Object Detection in Satellite Imagery

被引:0
作者
Zhang, Fan [1 ]
Li, Lingling [1 ]
Jiao, Licheng [1 ]
Liu, Xu [1 ]
Liu, Fang [1 ]
Yang, Shuyuan [1 ]
Hou, Biao [1 ]
机构
[1] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Sch Artificial Intelligence, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Task analysis; Feature extraction; Object detection; Detectors; Focusing; Satellite images; Training; Feature extraction (FE); knowledge discovery network (KDN); object detection; remote sensing; renormalized connection (RC); satellite image object detection; small object detection;
D O I
10.1109/TGRS.2024.3440881
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Satellite imagery, due to its long-range imaging, brings with it a variety of scale-preferred tasks, such as the detection of tiny/small objects, making the precise localization and detection of small objects of interest a challenging task. In this article, we design a knowledge discovery network (KDN) to implement the renormalization group theory in terms of efficient feature extraction (FE). Renormalized connection (RC) on the KDN enables "synergistic focusing" of multiscale features. Based on our observations of KDN, we abstract a class of RCs with different connection strengths, called $n21$ C, and generalize it to feature pyramid network (FPN)-based multibranch detectors. In a series of FPN experiments on the scale-preferred tasks, we found that the "divide-and-conquer" idea of FPN severely hampers the detector's learning in the right direction due to the large number of large-scale negative samples and interference from background noise. Moreover, these negative samples cannot be eliminated by the focal loss function. The RCs extends the multilevel feature's "divide-and-conquer" mechanism of the FPN-based detectors to a wide range of scale-preferred tasks, and enables synergistic effects of multilevel features on the specific learning goal. In addition, interference activations in two aspects are greatly reduced and the detector learns in a more correct direction. Extensive experiments of 17 well-designed detection architectures embedded with $n21$ Cs on five different levels of scale-preferred tasks validate the effectiveness and efficiency of the RCs. Especially the simplest linear form of RC-E421C performs well in all tasks, and it satisfies the scaling property of renormalization group theory. All experiments can be trained and tested on a graphics card with 8 GB of video memory, which greatly enhances the applicability of our methodology. We hope that our approach will transfer a large number of well-designed detectors from the computer vision community to the remote sensing community. Datasets and codes will be available at: https://github.com/rabbitme/
引用
收藏
页数:23
相关论文
共 50 条
[31]   Object Detection in Remote Sensing Imagery with Multi-scale Deformable Convolutional Networks [J].
Deng Z. ;
Sun H. ;
Lei L. ;
Zhou S. ;
Zou H. .
Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2018, 47 (09) :1216-1227
[32]   Multi-scale object detection in remote sensing imagery with convolutional neural networks [J].
Deng, Zhipeng ;
Sun, Hao ;
Zhou, Shilin ;
Zhao, Juanping ;
Lei, Lin ;
Zou, Huanxin .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 :3-22
[33]   YoloOW: A Spatial Scale Adaptive Real-Time Object Detection Neural Network for Open Water Search and Rescue From UAV Aerial Imagery [J].
Xu, Jianhao ;
Fan, Xiangtao ;
Jian, Hongdeng ;
Xu, Chen ;
Bei, Weijia ;
Ge, Qifeng ;
Zhao, Teng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[34]   FLDCF: A Collaborative Framework for Forgery Localization and Detection in Satellite Imagery [J].
Sui, Jialu ;
Ma, Ding ;
Kuo, C. -C. Jay ;
Pun, Man-On .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[35]   Mutual-Feed Learning for Super-Resolution and Object Detection in Degraded Aerial Imagery [J].
Yang, Jinze ;
Fu, Kun ;
Wu, Youming ;
Diao, Wenhui ;
Dai, Wei ;
Sun, Xian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[36]   Self-Training-Based Unsupervised Domain Adaptation for Object Detection in Remote Sensing Imagery [J].
Luo, Sihao ;
Ma, Li ;
Yang, Xiaoquan ;
Luo, Dapeng ;
Du, Qian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[37]   Rotation Equivariant Feature Image Pyramid Network for Object Detection in Optical Remote Sensing Imagery [J].
Shamsolmoali, Pourya ;
Zareapoor, Masoumeh ;
Chanussot, Jocelyn ;
Zhou, Huiyu ;
Yang, Jie .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[38]   Intelligent Detection of Sports Fields Based on Very High-Resolution Stereo Satellite Imagery [J].
Shang, Dashuai ;
Zhang, Yongsheng ;
Dai, Chenguang ;
Zhang, Lei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[39]   PCViT: A Pyramid Convolutional Vision Transformer Detector for Object Detection in Remote-Sensing Imagery [J].
Li, Jiaojiao ;
Tian, Penghao ;
Song, Rui ;
Xu, Haitao ;
Li, Yunsong ;
Du, Qian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-15
[40]   Towards Large-Scale Small Object Detection: Survey and Benchmarks [J].
Cheng, Gong ;
Yuan, Xiang ;
Yao, Xiwen ;
Yan, Kebing ;
Zeng, Qinghua ;
Xie, Xingxing ;
Han, Junwei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) :13467-13488