Progress of research on deep learning algorithms for object detection in optical remote sensing images

被引:0
作者
Xu, Danqing [1 ]
Wu, Yiquan [1 ]
机构
[1] College of Electronic and Information Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing
基金
中国国家自然科学基金;
关键词
dataset; deep learning; evaluation index; incomplete supervised learning; object characteristics; object detection; object detection framework; object detection process; optical remote sensing image;
D O I
10.11834/jrs.20243166
中图分类号
学科分类号
摘要
Among all applications of optical remote sensing images, object detection has always been given more attention by researchers. Object detection has a wide application prospect in military and civilian fields. This study reviews the progress of research on object detection algorithms in optical remote sensing images on the basis of deep learning. The characteristics of remote sensing objects are different from those of conventional objects. First, remote sensing equipment has a long imaging distance, so it can cover a large range. The images may have objects with large scale and shape changes. Second, in remote sensing images, the background tends to occupy a large area. As a result, some objects are often submerged in the complex background, and detectors cannot distinguish these objects effectively. Last, in remote sensing images, the objects do not only have a small size and changeable direction. Sometimes, remote sensing objects are densely distributed, posing challenges to the detection of optical remote sensing objects. This study introduces the development process of optical remote sensing object detection algorithms from template matching, prior knowledge, and machine learning to deep learning. Then, the process of optical remote sensing object detection based on deep learning, including data preprocessing, feature extraction, detection, and postprocessing, is introduced in detail. Classical deep learning-based object detection algorithms, including the one-stage algorithms represented by YOLO and SSD and the two-stage algorithm represented by Faster RCNN, are summarized. Afterward, in accordance with the characteristics of optical remote sensing image objects, various improved algorithms for addressing the optical remote sensing image object detection problems of scale diversity, direction diversity, shape diversity, small size, feature similarity, background complexity, distribution density, and weak features are systematically summarized. Non-strong supervised learning-based optical remote sensing image object detection methods and other advanced algorithms, such as Transformer-based algorithms, transfer learning-based algorithms, knowledge graph-based algorithms, and prior knowledge-based algorithms, are also summarized. In addition, open-source optical remote sensing image datasets and the performance of object detection evaluation indexes are introduced. The Mean Average Precision (mAP) of advanced algorithms on the NWPU-VHR10 dataset can exceed 90%. On the DOTA dataset, the mAP of each advanced algorithm decreases considerably, and the advanced algorithms proposed in recent years continue to improve their performance on this dataset. Multiscale fusion with a feature pyramid network has become the mainstream method of advanced algorithms, which can detect multiscale objects effectively. Many improved algorithms have been proposed to solve the abovementioned problems in optical remote sensing image object detection, and good detection results have been achieved. However, the research on object detection in large-scale remote sensing images and similar object detection between classes remains lacking. Furthermore, this study proposes future development directions, such as improved deep learning networks, lightweight networks, weakly supervised learning, small-object detection, and improved rotary detection mechanisms. © 2024 Science Press. All rights reserved.
引用
收藏
页码:3045 / 3073
页数:28
相关论文
共 141 条
[1]  
Azimi S M, Vig E, Bahmanyar R, Korner M, Reinartz P., Towards multi-class object detection in unconstrained remote sensing imagery, 14th Asian Conference on Computer Vision (ACCV), pp. 150-165, (2019)
[2]  
Bai Z, Li G Y, Liu Z., Global-local-global context-aware network for salient object detection in optical remote sensing images, ISPRS Journal of Photogrammetry and Remote Sensing, 198, pp. 184-196, (2023)
[3]  
Bilen H, Vedaldi A., Weakly supervised deep detection networks, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2846-2854, (2016)
[4]  
Bochkovskiy A, Wang C Y, Liao H Y M., YOLOv4: optimal speed and accuracy of object detection, (2020)
[5]  
Carion N, Massa F, Synnaeve G, Usunier N, Kirillov A, Zagoruyko S., End-to-end object detection with transformers, Proceedings of the 16th European Conference on Computer Vision, pp. 213-229, (2020)
[6]  
Chen G W, Liu L, Guo J Y, Pan Z X, Hu W L., Semi-supervised airplane detection in remote sensing images using generative adversarial networks, Journal of University of Chinese Academy of Sciences, 37, 4, pp. 539-546, (2020)
[7]  
Chen H, Cheng L, Zhuang Q Z, Zhang K, Li N, Liu L, Duan Z X., Structure-aware weakly supervised network for building extraction from remote sensing images, IEEE Transactions on, (2022)
[8]  
Geoscience and Remote Sensing, 60
[9]  
Chen H, Zhang L B, Ma J, Zhang J., Target heat-map network: an end-to-end deep network for target detection in remote sensing images, Neurocomputing, 331, pp. 375-387, (2019)
[10]  
Chen J Y, Sun J G, Li Y Q, Hou C B., Object detection in remote sensing images based on deep transfer learning, Multimedia Tools and Applications, 81, 9, pp. 12093-12109, (2022)