Comparative analysis with topic modeling and word embedding methods after the Aegean Sea earthquake on Twitter

被引:1
作者
Eliguzel, Nazmiye [1 ]
Cetinkaya, Cihan [2 ]
Dereli, Turkay [3 ]
机构
[1] Gaziantep Islam Sci & Technol Univ, Dept Ind Engn, TR-27010 Gaziantep, Turkey
[2] Adana Alparslan Turkes Sci & Technol Univ, Dept Management Informat Syst, TR-01250 Adana, Turkey
[3] Hasan Kalyoncu Univ, Off President, Gaziantep, Turkey
关键词
Aegean Sea; Clustering; Earthquake; Topic modeling; Twitter; Word embedding;
D O I
10.1007/s12530-022-09450-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic detection from Twitter is a significant task that provides insight into real-time information. Recently, word embedding methods and topic modeling techniques have been utilized to find latent topics in various fields. Detecting topics leads to effective semantic structure and provides a better understanding of users. In the proposed study, different types of topic detection techniques are utilized, which are latent semantic analysis (LSA), Word2Vec, and latent Dirichlet allocation (LDA), and their performances are evaluated by the implementation of the K-means clustering technique on a real life application. In this case study, tweets were gathered after an earthquake with a magnitude of 6.6 on the Richter scale that took place on October 30, 2020, on the coast of the Aegean Sea (Izmir), Turkey. Tweets are clustered under fifteen hashtags separately, and the aforementioned techniques are applied to data-sets which vary in size. Therefore, the novelty of the proposed paper can be expressed as the comparison of different topic models and word embedding methods implemented for different sizes of documents in order to demonstrate the performance of these methods. While Word2Vec gives good results in small data-sets, LDA generally gives better results than Word2Vec and LSA in medium and large data-sets. Another aim of the proposed study is to provide information to decision makers for supporting victims and society. Therefore, the general situation of society is analyzed and society's attitude is demonstrated for decision-makers to take actionable activities such as psychological support, educational support, financial support, and political activities, etc.
引用
收藏
页码:245 / 261
页数:17
相关论文
共 46 条
[1]  
Ahanin Z., 2020, Int. J. Technol. Manage. Inform. Sys., V2, P102
[2]   Enhancing topic clustering for Arabic security news based on k-means and topic modelling [J].
Alharbi, Adel R. ;
Hijji, Mohammad ;
Aljaedi, Amer .
IET NETWORKS, 2021, 10 (06) :278-294
[3]   Traffic accident detection and condition analysis based on social networking data [J].
Ali, Farman ;
Ali, Amjad ;
Imran, Muhammad ;
Naqvi, Rizwan Ali ;
Siddiqi, Muhammad Hameed ;
Kwak, Kyung-Sup .
ACCIDENT ANALYSIS AND PREVENTION, 2021, 151
[4]   An intelligent healthcare monitoring framework using wearable sensors and social networking data [J].
Ali, Farman ;
El-Sappagh, Shaker ;
Islam, S. M. Riazul ;
Ali, Amjad ;
Attique, Muhammad ;
Imran, Muhammad ;
Kwak, Kyung-Sup .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 114 :23-43
[5]   Transportation sentiment analysis using word embedding and ontology-based topic modeling [J].
Ali, Farman ;
Kwak, Daehan ;
Khan, Pervez ;
El-Sappagh, Shaker ;
Ali, Amjad ;
Ullah, Sana ;
Kim, Kye Hyun ;
Kwak, Kyung-Sup .
KNOWLEDGE-BASED SYSTEMS, 2019, 174 :27-42
[6]   Fuzzy Ontology and LSTM-Based Text Mining: A Transportation Network Monitoring System for Assisting Travel [J].
Ali, Farman ;
El-Sappagh, Shaker ;
Kwak, Daehan .
SENSORS, 2019, 19 (02)
[7]   The interpretation of dream meaning: Resolving ambiguity using Latent Semantic Analysis in a small corpus of text [J].
Altszyler, Edgar ;
Ribeiro, Sidarta ;
Sigman, Mariano ;
Fernandez Slezak, Diego .
CONSCIOUSNESS AND COGNITION, 2017, 56 :178-187
[8]  
Ancheta Jeffrey Rosario, 2020, 2020 12th International Conference on Knowledge and Smart Technology (KST), P103, DOI 10.1109/KST48564.2020.9059411
[9]  
[Anonymous], 2015, 2015 AI ROBOT IRANOP, DOI DOI 10.1109/RIOS.2015.7270727
[10]  
Bastas Nikolaos, 2019, 2019 European Intelligence and Security Informatics Conference (EISIC). Proceedings, P54, DOI 10.1109/EISIC49498.2019.9108898