Aircraft type recognition method by integrating target segmentation and key points detection

被引:0
|
作者
Liu S. [1 ,2 ]
Wang Q. [1 ]
Zhang L. [1 ]
Han X. [1 ]
Wang B. [1 ]
Liu Y. [3 ]
机构
[1] Chinese Academy of Surveying and Mapping, Beijing
[2] Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou
[3] Shenzhen Investigation & Research Institute Co., Ltd, Shenzhen
关键词
aircraft type recognition; conditional random field; key points detection; object detection; segmentation;
D O I
10.11834/jrs.20221737
中图分类号
学科分类号
摘要
Aircraft detection via deep learning is a popular field in remote sensing image analysis. However, given the limited perspectives of satellite imagery and high similarities in image appearance, aircraft type recognition remains a challenging task. The existing deep learning methods cannot be satisfactorily applied to fine-grained aircraft type recognition tasks, which require refined labels for datasets. With the aim of effectively recognizing aircraft types in remote sensing images, we propose an integrated target segmentation and key point detection method for aircraft type recognition. The proposed method combines an organic multitask deep neural network with a conditional random field and template matching algorithm to achieve high-precision recognition of aircraft types by pretraining, fine-tuning, and postprocessing. First, we performed target aircraft position and mask and keypoint recognition by deploying multitask learning and transfer learning technology. Second, to facilitate high-precision template matching in the later stage, we utilized an aircraft target mask refinement algorithm and a keypoint-based mask attitude adjustment algorithm to achieve boundary refinement of the recognition target and aircraft target mask attitude adjustment. Finally, on the basis of the aircraft type template library constructed in this study, we matched the refined aircraft mask information with the template library to identify the aircraft type. The proposed algorithm was applied to the MTARSI dataset and remote sensing images for verification. The results showed that the recognition accuracy of the 11 types of images was 89%. Aircraft with simple structures and unique shapes, such as B-2 and B-1, exhibited high recognition accuracy, whereas aircraft with complex structures and high similarity with other shapes, such as E-3 reconnaissance aircraft, exhibited low recognition accuracy. Subsequently, the algorithm was compared with traditional algorithms and end-to-end deep learning methods. Eleven types of aircraft were studied. The results showed that the accuracy of our method was 15.4% and 20.7% better than those of the other two methods. The use of target segmentation and keypoint information has achieved good results in model recognition on high-resolution remote sensing images. However, limitations remain in terms of the breadth of identifiable aircraft types; therefore, further research is needed to address this research gap. © 2024 Science Press. All rights reserved.
引用
收藏
页码:1010 / 1024
页数:14
相关论文
共 35 条
  • [1] Ai J Q, Tian R T, Luo Q W, Jin J, Tang B., Multi-scale rotation-invariant haar-like feature integrated CNN-based ship detection algorithm of multiple-target environment in SAR imagery, IEEE Transactions on Geoscience and Remote Sensing, 57, 12, pp. 10070-10087, (2019)
  • [2] Cao Z, Simon T, Wei S E, Sheikh Y., Realtime multi-person 2D pose estimation using part affinity fields, Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1302-1310, (2017)
  • [3] Chen S Z, Wang H P, Xu F, Jin Y Q., Target classification using the deep convolutional networks for SAR images, IEEE Transactions on Geoscience and Remote Sensing, 54, 8, pp. 4806-4817, (2016)
  • [4] Chen Y L, Wang Z C, Peng Y X, Zhang Z Q, Yu G, Sun J., Cascaded pyramid network for multi-person pose estimation, Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7103-7112, (2018)
  • [5] Diao W H, Sun X, Dou F Z, Yan M L, Wang H Q, Fu K., Object recognition in remote sensing images using sparse deep belief networks, Remote Sensing Letters, 6, 10, pp. 745-754, (2015)
  • [6] Fang H S, Xie S Q, Tai Y W, Lu C W., RMPE: regional multi-person pose estimation, Proceedings of 2017 IEEE International Conference on Computer Vision, pp. 2353-2362, (2017)
  • [7] Fu K, Dai W, Zhang Y, Wang Z R, Yan M L, Sun X., MultiCAM: multiple class activation mapping for aircraft recognition in remote sensing images, Remote Sensing, 11, 5, (2019)
  • [8] Guo Q, Wang H P, Xu F., Scattering enhanced attention pyramid network for aircraft detection in SAR images, IEEE Transactions on Geoscience and Remote Sensing, 59, 9, pp. 7570-7587, (2021)
  • [9] He K M, Gkioxari G, Dollar P, Girshick R., Mask R-CNN, IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 2, pp. 386-397, (2020)
  • [10] He K M, Zhang X Y, Ren S Q, Sun J., Deep residual learning for image recognition, Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, (2016)