Learning geometric Jensen-Shannon divergence for tiny object detection in remote sensing images

被引:3
作者
Ni, Shuyan [1 ]
Lin, Cunbao [1 ]
Wang, Haining [2 ,3 ]
Li, Yang [1 ]
Liao, Yurong [1 ]
Li, Na [2 ,3 ]
机构
[1] Space Engn Univ, Dept Elect & Opt Engn, Beijing, Peoples R China
[2] Beihang Univ, Inst Artificial Intelligence, Beijing, Peoples R China
[3] Beihang Univ, Key Lab Minist Ind & Informat Technol, Aerosp Opt Microwave Integrated Precis Intelligent, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
tiny object detection; remote sensing images; Jensen-Shannon divergence; deep learning; Gaussian distribution;
D O I
10.3389/fnbot.2023.1273251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tiny objects in remote sensing images only have a few pixels, and the detection difficulty is much higher than that of regular objects. General object detectors lack effective extraction of tiny object features, and are sensitive to the Intersection-over-Union (IoU) calculation and the threshold setting in the prediction stage. Therefore, it is particularly important to design a tiny-object-specific detector that can avoid the above problems. This article proposes the network JSDNet by learning the geometric Jensen-Shannon (JS) divergence representation between Gaussian distributions. First, the Swin Transformer model is integrated into the feature extraction stage as the backbone to improve the feature extraction capability of JSDNet for tiny objects. Second, the anchor box and ground-truth are modeled as two two-dimensional (2D) Gaussian distributions, so that the tiny object is represented as a statistical distribution model. Then, in view of the sensitivity problem faced by the IoU calculation for tiny objects, the JSDM module is designed as a regression sub-network, and the geometric JS divergence between two Gaussian distributions is derived from the perspective of information geometry to guide the regression prediction of anchor boxes. Experiments on the AI-TOD and DOTA datasets show that JSDNet can achieve superior detection performance for tiny objects compared to state-of-the-art general object detectors.
引用
收藏
页数:11
相关论文
共 37 条
[1]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[2]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[3]   Context-Aware Block Net for Small Object Detection [J].
Cui, Lisha ;
Lv, Pei ;
Jiang, Xiaoheng ;
Gao, Zhimin ;
Zhou, Bing ;
Zhang, Luming ;
Shao, Ling ;
Xu, Mingliang .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (04) :2300-2313
[4]  
Deasy J., 2020, ADV NEUR IN
[5]   Extended Feature Pyramid Network for Small Object Detection [J].
Deng, Chunfang ;
Wang, Mengmeng ;
Liu, Liang ;
Liu, Yong ;
Jiang, Yunliang .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1968-1979
[6]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[7]   TCANet: Triple Context-Aware Network for Weakly Supervised Object Detection in Remote Sensing Images [J].
Feng, Xiaoxu ;
Han, Junwei ;
Yao, Xiwen ;
Cheng, Gong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (08) :6946-6955
[8]   OTA: Optimal Transport Assignment for Object Detection [J].
Ge, Zheng ;
Liu, Songtao ;
Liu, Zeming ;
Yoshie, Osamu ;
Sun, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :303-312
[9]   Multi-Vision Network for Accurate and Real-Time Small Object Detection in Optical Remote Sensing Images [J].
Han, Wenxuan ;
Kuerban, Alifu ;
Yang, Yuchun ;
Huang, Zitong ;
Liu, Binghui ;
Gao, Jie .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[10]   GAN-Based Synthetic Data Augmentation for Infrared Small Target Detection [J].
Kim, Jun-Hyung ;
Hwang, Youngbae .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60