Instance segmentation for vehicle in UAV aerial images based on feature enhancement and calibration

被引：0

作者：

Yao Y. ^{[1
]}

Li X. ^{[1
]}

Xu Q. ^{[1
]}

Kong D. ^{[1
]}

机构：

[1] School of Instrument Science and Engineering, Southeast University, Nanjing

来源：

Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica | 2023年 / 44卷 / 24期

关键词：

attention mechanism; computer vision; feature enhancement; instance segmentation; UAV;

D O I：

10.7527/S1000-6893.2023.28397

中图分类号：

学科分类号：

摘要：

To overcome the challenges of dense vehicle distribution，occlusion，and large scale variation in instance segmentation of the vehicle in UAV aerial images，an instance segmentation method is proposed based on feature en⁃ hancement and calibration. Firstly，to make full use of the high-level semantic features extracted from the backbone and improve the prediction effect of vehicles of different sizes，a Multi-scale Semantic Enhancement Module（MSEM）is proposed，which is added to the highest level of the feature pyramid structure to fuse multi-scale features by densely connecting multiple dilated convolution blocks with different dilated rates and suppresses redundant information by us⁃ ing spatial attention，this module alleviates the loss of high-level semantic features and improves the multi-scale adapt⁃ ability of features. Secondly，to fully exploit the global context information of the features fused with the feature pyra⁃ mid to improve the prediction of small-scale vehicles and suppress the noise introduced by occlusion of local features，a Global-Local Feature Calibration Module（GLFCM）is proposed，which consists of a Global Feature Calibration Mod⁃ ule（GFCM）and a Local Feature Calibration Module（LFCM）. The GFCM applies channel attention and self-attention to achieve sufficient information interaction and calibration between feature levels，constructs dependencies between global feature maps and fully exploits global contextual information. The LFCM adopts the multi-head self-attention mechanism Transformer for the features at each level. The Transformer in the LFCM is linearized，which greatly re⁃ duces computation in comparison with the original one，and at the same time has a better suppression effect on the occlusion noise in local features. Experiments are conducted on the UVSD dataset. It is found that the proposed method improves the segmentation precision by 5. 1% for the one-stage network YOLACT，2. 9% for the two-stage network Mask R-CNN，and 1. 6% for the multi-stage network Cascade Mask R-CNN，which shows that the proposed method can significantly improve the instance segmentation precision for vehicles in UAV aerial images. © 2023 AAAS Press of Chinese Society of Aeronautics and Astronautics. All rights reserved.

引用

共 32 条

[1]

MARCATO J, MARQUES RAMOS A P，, Et al., A review on deep learning in UAV remote sens⁃ ing［J］, International Journal of Applied Earth Observa⁃ tion and Geoinformation, 102, (2021)

[2]

ZHANG H, Et al., A survey on instance segmentation：Recent advances and challenges［J］, Inter⁃ national Journal of Innovative Computing，Information and Control, 17, 3, pp. 1041-1053, (2021)

[3]

ZHU X X., Vehicle instance segmentation from aerial image and video using a multitask learning re⁃ sidual fully convolutional network［J］, IEEE Transac⁃ tions on Geoscience and Remote Sensing, 56, 11, pp. 6699-6711, (2018)

[4]

Deep learning approach for car detection in UAV imagery［J］, Remote Sensing, 9, 4, (2017)

[5]

ZISSERMAN A., Very deep convolu⁃ tional networks for large-scale image recognition［DB/ OL］, (2014)

[6]

LEFEVRE S., Segment-before-detect：Vehicle detection and classification through semantic segmentation of aerial images［J］, Remote Sens⁃ ing, 9, 4, (2017)

[7]

CIPOLLA R., SegNet：A deep convolutional encoder-decoder archi⁃ tecture for image segmentation［J］, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 12, pp. 2481-2495, (2017)

[8]

XIAO F Y，, Et al., YOLACT：Real-time instance segmentation［C］∥2019 IEEE/CVF International Conference on Computer Vision（ICCV）, pp. 9156-9165, (2020)

[9]

TIAN Z, CHEN H., Conditional convolu⁃ tions for instance segmentation［M］∥Computer Vision–ECCV 2020, pp. 282-298, (2020)

[10]

IGARSS 2022 - 2022 IEEE International Geo⁃ science and Remote Sensing Symposium, pp. 2187-2190, (2022)

← 1 2 3 4 →