Performance evaluation of NLP and CNN models for disaster detection using social media data

被引:0
作者
Islam, Md. Azharul [1 ]
Rabbi, Fazla [2 ]
Hossain, Niamat Ullah Ibne [2 ]
机构
[1] SUNY Buffalo, Sch Mech & Aerosp Engn, Buffalo, NY 14228 USA
[2] Arkansas State Univ, Coll Engn & Comp Sci, Engn Management Dept, Jonesboro, AR 72401 USA
关键词
Disaster response; Natural language processing (NLP); Convolutional Neural Networks (CNN); Text Classification; Image Classification; EVENT DETECTION;
D O I
10.1007/s13278-024-01374-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The use of social media data for disaster-type identification has been turning progressively important in recent years. With the extensive dependency on social networking sites, people can share real-time information and updates about disasters, making it a valuable source of information for disaster management organizations. The use of natural language processing (NLP) and computer vision techniques can help process and examine large amounts of social media data to gain valuable insights into the nature and extent of a disaster. In this study, NLP, and convolutional neural networks (CNN) models were applied to social media data for disaster-type recognition. The language models used were BERT-Base-Uncased, DistilBERT-Base-Uncased, Twitter-RoBERTa-Base, and FinBERT. Two convolutional neural network (CNN) models, Inception v3 and DenseNet were also applied. The models were evaluated on the CrisisMMD dataset. The outcome proved that the language models achieved a uniform accuracy of 94% across disaster-related tweet classification tasks, while DistilBERT-Base-Uncased demonstrated the fastest training and testing time which is important for prompt response systems. In terms of the CNN models, DenseNet outperformed Inception v3 just by a small margin of 1 or 2% in terms of accuracy, recall, precision, and F1 score. This entails that the DistilBERT-Base-Uncased and DenseNet model has the potential to be better suited for disaster-type recognition using social media data in terms of accuracy and time.
引用
收藏
页数:17
相关论文
共 76 条
[61]   Identifying disaster-related tweets and their semantic, spatial and temporal context using deep learning, natural language processing and spatial analysis: a case study of Hurricane Irma [J].
Sit, Muhammed Ali ;
Koylu, Caglar ;
Demir, Ibrahim .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2019, 12 (11) :1205-1229
[62]   Automated Disaster Monitoring From Social Media Posts Using AI-Based Location Intelligence and Sentiment Analysis [J].
Sufi, Fahim K. ;
Khalil, Ibrahim .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) :4614-4624
[63]   AI-Landslide: Software for acquiring hidden insights from global landslide data using Artificial Intelligence [J].
Sufi, Fahim K. .
SOFTWARE IMPACTS, 2021, 10
[64]  
Szegedy C, 2015, Arxiv, DOI arXiv:1512.00567
[65]   A Critical Cybersecurity Analysis and Future Research Directions for the Internet of Things: A Comprehensive Review [J].
Tariq, Usman ;
Ahmed, Irfan ;
Bashir, Ali Kashif ;
Shaukat, Kamran .
SENSORS, 2023, 23 (08)
[66]  
Vaswani A, 2017, ADV NEUR IN, V30
[67]  
Vigna F. D., 2015, PUblication MAnagement
[68]   Multi-filed data fusion through attention-based networks for readiness prediction in aircraft maintenance: natural language processing (NLP) approach [J].
Wang, Yibin ;
Jaradat, Raed ;
Wang, Haifeng ;
Ibne Hossain, Niamat Ullah .
INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, 2025, 20 (01) :54-64
[69]   Spatial, temporal, and content analysis of Twitter for wildfire hazards [J].
Wang, Zheye ;
Ye, Xinyue ;
Tsou, Ming-Hsiang .
NATURAL HAZARDS, 2016, 83 (01) :523-540
[70]  
Wolf T, 2020, Arxiv, DOI arXiv:1910.03771