Performance evaluation of NLP and CNN models for disaster detection using social media data

被引:0
作者
Islam, Md. Azharul [1 ]
Rabbi, Fazla [2 ]
Hossain, Niamat Ullah Ibne [2 ]
机构
[1] SUNY Buffalo, Sch Mech & Aerosp Engn, Buffalo, NY 14228 USA
[2] Arkansas State Univ, Coll Engn & Comp Sci, Engn Management Dept, Jonesboro, AR 72401 USA
关键词
Disaster response; Natural language processing (NLP); Convolutional Neural Networks (CNN); Text Classification; Image Classification; EVENT DETECTION;
D O I
10.1007/s13278-024-01374-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The use of social media data for disaster-type identification has been turning progressively important in recent years. With the extensive dependency on social networking sites, people can share real-time information and updates about disasters, making it a valuable source of information for disaster management organizations. The use of natural language processing (NLP) and computer vision techniques can help process and examine large amounts of social media data to gain valuable insights into the nature and extent of a disaster. In this study, NLP, and convolutional neural networks (CNN) models were applied to social media data for disaster-type recognition. The language models used were BERT-Base-Uncased, DistilBERT-Base-Uncased, Twitter-RoBERTa-Base, and FinBERT. Two convolutional neural network (CNN) models, Inception v3 and DenseNet were also applied. The models were evaluated on the CrisisMMD dataset. The outcome proved that the language models achieved a uniform accuracy of 94% across disaster-related tweet classification tasks, while DistilBERT-Base-Uncased demonstrated the fastest training and testing time which is important for prompt response systems. In terms of the CNN models, DenseNet outperformed Inception v3 just by a small margin of 1 or 2% in terms of accuracy, recall, precision, and F1 score. This entails that the DistilBERT-Base-Uncased and DenseNet model has the potential to be better suited for disaster-type recognition using social media data in terms of accuracy and time.
引用
收藏
页数:17
相关论文
共 76 条
[1]  
Aipe A., 2018, P 15 ISCRAM C
[2]  
Alam F, 2017, INT C ADV SOC NETW A
[3]   Processing Social Media Images by Combining Human and Machine Computing during Crises [J].
Alam, Firoj ;
Ofli, Ferda ;
Imran, Muhammad .
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2018, 34 (04) :311-327
[4]  
Amit SNKB, 2017, 2017 INTERNATIONAL ELECTRONICS SYMPOSIUM ON KNOWLEDGE CREATION AND INTELLIGENT COMPUTING (IES-KCIC), P239, DOI 10.1109/KCIC.2017.8228593
[5]  
Amit SNKB, 2016, INT GEOSCI REMOTE SE, P5189, DOI 10.1109/IGARSS.2016.7730352
[6]  
ASHKTORAB Z, 2014, ISCRAM, P269, DOI DOI 10.1145/1835449.1835643
[7]  
Ashktorab Z., 2014, P 11 INT INFORM SYST
[8]   MBi-GRUMCONV: A novel Multi Bi-GRU and Multi CNN-Based deep learning model for social media sentiment analysis [J].
Basarslan, Muhammet Sinan ;
Kayaalp, Fatih .
JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2023, 12 (01)
[9]  
Beigi G, 2016, STUD COMPUT INTELL, V639, P313, DOI 10.1007/978-3-319-30319-2_13
[10]  
Bischke B, 2017, MediaEval'17