Building Area Estimation in Drone Aerial Images Based on Mask R-CNN

被引:17
作者
Chen, Jun [1 ,2 ]
Wang, Ganbei [1 ,2 ]
Luo, Linbo [3 ]
Gong, Wenping [4 ]
Cheng, Zhan [4 ]
机构
[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
[2] Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Sch Mech Engn & Elect Informat, Wuhan 430074, Peoples R China
[4] China Univ Geosci, Fac Engn, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Drones; Buildings; Image segmentation; Satellites; Task analysis; Feature extraction; Remote sensing; Building area; drone aerial images; Mask R-CNN; transfer learning; NETWORK;
D O I
10.1109/LGRS.2020.2988326
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In rural areas where disasters occur frequently, the calculation of building areas is crucial in property assessment. In the segmentation algorithm, Mask R-CNN can distinguish the adjacent objects and extract the outline of an object. Based on this observation, we propose a novel method to calculate the building areas based on Mask R-CNN and adopt the concept of transfer learning to train our model, which can achieve good results with a small number of drone aerial images as training samples. The proposed method involves three main steps: 1) pretraining using open-source satellite remote sensing images; 2) fine-tuning with a small number of drone aerial images; and 3) testing with new images and area calculation based on the number of building pixels. The experiments show that the proposed method can achieve good results in terms of F1 score and intersection over union.
引用
收藏
页码:891 / 894
页数:4
相关论文
共 25 条
[1]  
[Anonymous], 2017, ARXIV170905932
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]  
Girshick R., 2017, P IEEE C COMP VIS PA, DOI [DOI 10.1109/CVPR.2017.106, 10.1109/CVPR.2017.106]
[5]   DensePose: Dense Human Pose Estimation In The Wild [J].
Guler, Riza Alp ;
Neverova, Natalia ;
Kokkinos, Lasonas .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7297-7306
[6]  
He K, 2016, European conference on computer vision, P630, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]
[7]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[8]   BUILDING EXTRACTION FROM MULTI-SOURCE REMOTE SENSING IMAGES VIA DEEP DECONVOLUTION NEURAL NETWORKS [J].
Huang, Zuming ;
Cheng, Guangliang ;
Wang, Hongzhen ;
Li, Haichang ;
Shi, Limin ;
Pan, Chunhong .
2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, :1835-1838
[9]   TernausNetV2: Fully convolutional network for Instance Segmentation [J].
Iglovikov, Vladimir ;
Seferbekov, Selim ;
Buslaev, Alexander ;
Shvets, Alexey .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :228-232
[10]  
Li Y., 2016, ARXIV161107709