FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding

被引:144
作者
Rahnemoonfar, Maryam [1 ]
Chowdhury, Tashnim [1 ]
Sarkar, Argho [1 ]
Varshney, Debvrat [1 ]
Yari, Masoud [1 ]
Murphy, Robin Roberson [2 ]
机构
[1] Univ Maryland, Comp Vis & Remote Sensing Lab, Bina Lab, Baltimore, MD 21250 USA
[2] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
关键词
Image segmentation; Semantics; Task analysis; Satellites; Computer vision; Image resolution; Buildings; Artificial intelligence; deep learning; hurricane Harvey; image classification; machine learning; natural disaster dataset; remote sensing; semantic segmentation; unmanned aerial vehicle (UAV); visual question answering; CLASSIFICATION;
D O I
10.1109/ACCESS.2021.3090981
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual scene understanding is the core task in making any crucial decision in any computer vision system. Although popular computer vision datasets like Cityscapes, MS-COCO, PASCAL provide good benchmarks for several tasks (e.g. image classification, segmentation, object detection), these datasets are hardly suitable for post disaster damage assessments. On the other hand, existing natural disaster datasets include mainly satellite imagery which has low spatial resolution and a high revisit period. Therefore, they do not have a scope to provide quick and efficient damage assessment tasks. Unmanned Aerial Vehicle (UAV) can effortlessly access difficult places during any disaster and collect high resolution imagery that is required for aforementioned tasks of computer vision. To address these issues we present a high resolution UAV imagery, FloodNet, captured after the hurricane Harvey. This dataset demonstrates the post flooded damages of the affected areas. The images are labeled pixel-wise for semantic segmentation task and questions are produced for the task of visual question answering. FloodNet poses several challenges including detection of flooded roads and buildings and distinguishing between natural water and flooded water. With the advancement of deep learning algorithms, we can analyze the impact of any disaster which can make a precise understanding of the affected areas. In this paper, we compare and contrast the performances of baseline methods for image classification, semantic segmentation, and visual question answering on our dataset. FloodNet dataset can be downloaded from here: https://github.com/BinaLab/FloodNet-Supervised_v1.0.
引用
收藏
页码:89644 / 89654
页数:11
相关论文
共 78 条
[1]  
Abacha A. B., 2019, P CLEF WORK NOTES
[2]   Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering [J].
Anderson, Peter ;
He, Xiaodong ;
Buehler, Chris ;
Teney, Damien ;
Johnson, Mark ;
Gould, Stephen ;
Zhang, Lei .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6077-6086
[3]  
[Anonymous], 2017, P MEDIAEVAL
[4]  
[Anonymous], P 3 INT C LEARNING R
[5]  
[Anonymous], Simple baseline for visual question answering
[6]  
[Anonymous], 2018, ADV NEURAL INFORM PR
[7]   VQA: Visual Question Answering [J].
Antol, Stanislaw ;
Agrawal, Aishwarya ;
Lu, Jiasen ;
Mitchell, Margaret ;
Batra, Dhruv ;
Zitnick, C. Lawrence ;
Parikh, Devi .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2425-2433
[8]   A Convolutional Neural Network Approach for Assisting Avalanche Search and Rescue Operations with UAV Imagery [J].
Bejiga, Mesay Belete ;
Zeggada, Abdallah ;
Nouffidj, Abdelhamid ;
Melgani, Farid .
REMOTE SENSING, 2017, 9 (02)
[9]   MUTAN: Multimodal Tucker Fusion for Visual Question Answering [J].
Ben-younes, Hedi ;
Cadene, Remi ;
Cord, Matthieu ;
Thome, Nicolas .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2631-2639
[10]  
Benjamin B., 2018, P MEDIAEVAL SOPH ANT