Comparative Study Between Real-Time and Non-Real-Time Segmentation Models on Flooding Events

被引:13
作者
Safavi, Farshad [1 ]
Chowdhury, Tashnim [1 ]
Rahnemoonfar, Maryam [1 ]
机构
[1] Univ Maryland Baltimore Cty, Comp Vis & Remote Sensing Lab, Baltimore, MD 21228 USA
来源
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2021年
关键词
Computer Vision; Deep Learning; Convolutional Neural Network(CNN); Aerial Image Segmentation; Real-Time Semantic Segmentation;
D O I
10.1109/BigData52589.2021.9671314
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene understanding of aerial imagery is essential for proper emergency response during catastrophic events such as hurricanes, earthquakes, and floods. Unmanned Aerial Vehicles (UAVs) capture aerial images and analyze the context by passing images into a semantic segmentation model for monitoring damaged areas. However, the state-of-the-art semantic segmentation models are mainly trained and evaluated on ground-based datasets such as Cityscapes, MS-COCO, and CamVid, unsuitable for aerial image segmentations. For example, extracted features from objects in aerial perspective are distinct from objects on the ground view. Hence, neural networks cannot properly segment an aerial scene, especially on deformed or damaged objects during disasters. This research analyzes current semantic segmentation models to explore the feasibility of applying these models for emergency response during catastrophic events. We compare the performance of real-time semantic segmentation models with non-real-time counterparts constrained by aerial images under adversarial settings. Furthermore, we train several models on the FloodNet dataset, containing UAV images captured after Hurricane Harvey, and benchmark their execution on special classes such as flooded-buildings vs. non-flooded buildings or flooded-roads vs. non-flooded roads. In this research, real-time UNet-MobileNetV3 yields 59.3% test mIoU while non-real-time PSPNet [1] attains 79.7% test mIoU on the FloodNet, demonstrating the trade-off between accuracy and efficiency in the segmentation models.
引用
收藏
页码:4199 / 4207
页数:9
相关论文
共 34 条
[1]  
[Anonymous], IEEE T PATTERN ANAL
[2]  
[Anonymous], PROC CVPR IEEE
[3]  
[Anonymous], IEEE T PATTERN ANAL
[4]  
[Anonymous], 2015, IEEE C COMPUTER VISI
[5]  
Badrinarayanan V., 2016, Segnet: A deep convolutional encoder-decoder architecture for image segmentation
[6]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[7]  
Bourdev L, 2010, LECT NOTES COMPUT SC, V6316, P168, DOI 10.1007/978-3-642-15567-3_13
[8]  
Brox T., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P2225, DOI 10.1109/CVPR.2011.5995659
[9]  
Chen L.C., 2018, ASIA-PAC J ATMOS SCI, P801, DOI DOI 10.1007/s13143-018-0064-5
[10]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709