SEMANTIC SEGMENTATION OF UAV IMAGES BASED ON U-NET IN URBAN AREA

被引:3
作者
Majidizadeh, A. [1 ]
Hasani, H. [1 ]
Jafari, M. [1 ]
机构
[1] Tafresh Univ, Dept Geodesy & Surveying Engn, Tafresh, Iran
来源
ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4 | 2023年
关键词
Semantic Segmentation; UAV; Deep Learning; Convolutional Neural Network; Encoder-Decoder Architecture; UNet; DETECTING CARS; DEEP; EXTRACTION; VEHICLE;
D O I
10.5194/isprs-annals-X-4-W1-2022-451-2023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation of aerial data has been one of the leading researches in the field of photogrammetry, remote sensing, and computer vision in recent years. Many applications, including airborne mapping of urban scenes, object positioning in aerial images, automatic extraction of buildings from remote sensing or high-resolution aerial images, etc., require accurate and efficient segmentation algorithms. According to the high potential of deep learning algorithms in the classification of complex scenes, this paper aims to train a deep learning model to evaluate the semantic segmentation accuracy of UAV-based images in urban areas. The proposed method implements a deep learning framework based on the U-Net encoder-decoder architecture, which extracts and classifies features through layers of convolution, max pooling, activation, and concatenation in an end-to-end process. The obtained results compare with two traditional machine learning models, Random Forest (RF) and Multi-Layer Perceptron (MLP). They rely on two steps that involve extracting features and classifying images. In this study, the experiments are performed on the UAVid2020 semantic segmentation dataset from the ISPRS database. Results show the effectiveness of the proposed deep learning framework, so that the U-Net architecture achieved the best results with 75.15% overall accuracy, compared to RF and MLP algorithms with 52.51% and 54.65% overall accuracy, respectively.
引用
收藏
页码:451 / 457
页数:7
相关论文
共 32 条
[1]   Deep Learning Approach for Car Detection in UAV Imagery [J].
Ammour, Nassim ;
Alhichri, Haikel ;
Bazi, Yakoub ;
Benjdira, Bilel ;
Alajlan, Naif ;
Zuair, Mansour .
REMOTE SENSING, 2017, 9 (04)
[2]   Drone Image Segmentation Using Machine and Deep Learning for Mapping Raised Bog Vegetation Communities [J].
Bhatnagar, Saheba ;
Gill, Laurence ;
Ghosh, Bidisha .
REMOTE SENSING, 2020, 12 (16)
[3]   Vehicle Detection in Satellite Images by Hybrid Deep Convolutional Neural Networks [J].
Chen, Xueyun ;
Xiang, Shiming ;
Liu, Cheng-Lin ;
Pan, Chun-Hong .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2014, 11 (10) :1797-1801
[4]   Neural Networks and Neuroscience-Inspired Computer Vision [J].
Cox, David Daniel ;
Dean, Thomas .
CURRENT BIOLOGY, 2014, 24 (18) :R921-R929
[5]   What is a good evaluation measure for semantic segmentation? [J].
Csurka, Gabriela ;
Larlus, Diane ;
Perronnin, Florent .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[6]   A survey on deep learning techniques for image and video semantic segmentation [J].
Garcia-Garcia, Alberto ;
Orts-Escolano, Sergio ;
Oprea, Sergiu ;
Villena-Martinez, Victor ;
Martinez-Gonzalez, Pablo ;
Garcia-Rodriguez, Jose .
APPLIED SOFT COMPUTING, 2018, 70 :41-65
[7]   Semantic segmentation of UAV aerial videos using convolutional neural networks [J].
Girisha, S. ;
Pai, Manohara M. M. ;
Verma, Ujjwal ;
Pai, Radhika M. .
2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, :21-27
[8]   Text-Attentional Convolutional Neural Network for Scene Text Detection [J].
He, Tong ;
Huang, Weilin ;
Qiao, Yu ;
Yao, Jian .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) :2529-2541
[9]   Computer Vision and Deep Learning Techniques for the Analysis of Drone-Acquired Forest Images, a Transfer Learning Study [J].
Kentsch, Sarah ;
Caceres, Maximo Larry Lopez ;
Serrano, Daniel ;
Roure, Ferran ;
Diez, Yago .
REMOTE SENSING, 2020, 12 (08)
[10]  
Khan SH, 2014, LECT NOTES COMPUT SC, V8689, P679, DOI 10.1007/978-3-319-10590-1_44