Comparative Study of Real-Time Semantic Segmentation Networks in Aerial Images During Flooding Events

被引:23
作者
Safavi, Farshad [1 ]
Rahnemoonfar, Maryam [2 ,3 ]
机构
[1] Univ Maryland Baltimore Cty, Comp Sci & Elect Engn, Baltimore, MD 21250 USA
[2] Lehigh Univ, Dept Comp Sci & Engn, Bethlehem, PA 18015 USA
[3] Lehigh Univ, Dept Civil & Environm Engn, Bethlehem, PA 18015 USA
关键词
Deep learning; Semantics; postdisaster assessment; real-time semantic segmentation; unmanned ariel vehicle (UAV) aerial imagery; DEEP; FRAMEWORK; BISENET;
D O I
10.1109/JSTARS.2022.3219724
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Real-time semantic segmentation of aerial imagery is essential for unmanned ariel vehicle applications, including military surveillance, land characterization, and disaster damage assessments. Recent real-time semantic segmentation neural networks promise low computation and inference time, appropriate for resource-limited platforms, such as edge devices. However, these methods are mainly trained on human-centric view datasets, such as Cityscapes and CamVid, unsuitable for aerial applications. Furthermore, we do not know the feasibility of these models under adversarial settings, such as flooding events. To solve these problems, we train the most recent real-time semantic segmentation architectures on the FloodNet dataset containing annotated aerial images captured after hurricane Harvey. This article comprehensively studies several lightweight architectures, including encoder-decoder and two-pathway architectures, evaluating their performance on aerial imagery datasets. Moreover, we benchmark the efficiency and accuracy of different models on the FloodNet dataset to examine the practicability of these models during emergency response for aerial image segmentation. Some lightweight models attain more than 60% test mIoU on the FloodNet dataset and yield qualitative results on images. This article highlights the strengths and weaknesses of current segmentation models for aerial imagery, requiring low computation and inference time. Our experiment has direct applications during catastrophic events, such as flooding events.
引用
收藏
页码:15 / 31
页数:17
相关论文
共 104 条
[1]  
Chen SA, 2018, Arxiv, DOI arXiv:1812.05581
[2]  
[Anonymous], 2014, Drowning
[3]  
[Anonymous], Isprs 2d semantic labeling contest
[4]   Bi-Directional ConvLSTM U-Net with Densley Connected Convolutions [J].
Azad, Reza ;
Asadi-Aghbolaghi, Maryam ;
Fathy, Mahmood ;
Escalera, Sergio .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :406-415
[5]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[6]   Eff-UNet: A Novel Architecture for Semantic Segmentation in Unstructured Environment [J].
Baheti, Bhakti ;
Innani, Shubham ;
Gajre, Suhas ;
Talbar, Sanjay .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :1473-1481
[7]   Comprehensive survey of deep learning in remote sensing: theories, tools, and challenges for the community [J].
Ball, John E. ;
Anderson, Derek T. ;
Chan, Chee Seng .
JOURNAL OF APPLIED REMOTE SENSING, 2017, 11
[8]   Squeeze U-Net: A Memory and Energy Efficient Image Segmentation Network [J].
Beheshti, Nazanin ;
Johnsson, Lennart .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :1495-1504
[9]  
Boykov YY, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P105, DOI 10.1109/ICCV.2001.937505
[10]  
Cai Z., 2022, IEEE 19 INT S BIOM I, P1