Machine Learning Methods to Predict Social Media Disaster Rumor Refuters

被引:23
|
作者
Wang, Shihang [1 ]
Li, Zongmin [1 ]
Wang, Yuhong [2 ]
Zhang, Qi [1 ]
机构
[1] Sichuan Univ, Business Sch, Chengdu 610064, Sichuan, Peoples R China
[2] Sichuan Normal Univ, Coll Movie & Media, Chengdu 610064, Sichuan, Peoples R China
基金
中国博士后科学基金;
关键词
rumor refutation; disaster-related; NLP; machine learning; XGBoost; group behavior; SPREADING MODEL; PROPAGATION;
D O I
10.3390/ijerph16081452
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This research provides a general methodology for distinguishing disaster-related anti-rumor spreaders from a non-ignorant population base, with strong connections in their social circle. Several important influencing factors are examined and illustrated. User information from the most recent posted microblog content of 3793 Sina Weibo users was collected. Natural language processing (NLP) was used for the sentiment and short text similarity analyses, and four machine learning techniques, i.e., logistic regression (LR), support vector machines (SVM), random forest (RF), and extreme gradient boosting (XGBoost) were compared on different rumor refuting microblogs; after which a valid and robust distinguishing XGBoost model was trained and validated to predict who would retweet disaster-related rumor refuting microblogs. Compared with traditional prediction variables that only access user information, the similarity and sentiment analyses of the most recent user microblog contents were found to significantly improve prediction precision and robustness. The number of user microblogs also proved to be a valuable reference for all samples during the prediction process. This prediction methodology could be possibly more useful for WeChat or Facebook as these have relatively stable closed-loop communication channels, which means that rumors are more likely to be refuted by acquaintances. Therefore, the methodology is going to be further optimized and validated on WeChat-like channels in the future. The novel rumor refuting approach presented in this research harnessed NLP for the user microblog content analysis and then used the analysis results of NLP as additional prediction variables to identify the anti-rumor spreaders. Therefore, compared to previous studies, this study presents a new and effective decision support for rumor countermeasures.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Using Machine Learning Methods to Predict Autism Syndrome
    Alhakami, Hosam
    Alajlani, Fatimah
    Alghamdi, Alshymaa
    Baz, Abdullah
    Alsubait, Tahani
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (04): : 221 - 228
  • [22] The Comparison of Machine-Learning Methods XGBoost and LightGBM to Predict Energy Development
    Nemeth, Martin
    Borkin, Dmitrii
    Michalconok, German
    COMPUTATIONAL STATISTICS AND MATHEMATICAL MODELING METHODS IN INTELLIGENT SYSTEMS, VOL. 2, 2019, 1047 : 208 - 215
  • [23] Suitability of Machine Learning Methods for Prediction of Popularity on Social Media in Comparison of Different Data Sets
    Hruska, Jan
    HRADEC ECONOMIC DAYS, VOL 11(1), 2021, 11 : 251 - 259
  • [24] Stability analysis of an SDILR model based on rumor recurrence on social media
    Yao, Yao
    Xiao, Xi
    Zhang, Chengping
    Dou, Changsheng
    Xia, Shutao
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 535
  • [25] Combining machine-learning topic models and spatiotemporal analysis of social media data for disaster footprint and damage assessment
    Resch, Bernd
    Uslander, Florian
    Havas, Clemens
    CARTOGRAPHY AND GEOGRAPHIC INFORMATION SCIENCE, 2018, 45 (04) : 362 - 376
  • [26] Machine Learning and Social Media Harvesting for Wildfire Prevention
    Laksito, Arif Dwi
    Kusrini, Kusrini
    Setyanto, Arief
    Johari, Muhammad Zuhdi Fikri
    Maruf, Zauvik Rizaldi
    Yuana, Kumara Ari
    Adninda, Gardyas Bidari
    Kartikakirana, Renindya Azizza
    Nucifera, Fitria
    Widayani, Wiwi
    Chandramouli, Krishna
    Ezquierdo, Ebroul
    2023 IEEE 13TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS, ICPRS, 2023,
  • [27] Machine learning algorithms for social media analysis: A survey
    Balaji, T. K.
    Annavarapu, Chandra Sekhara Rao
    Bablani, Annushree
    COMPUTER SCIENCE REVIEW, 2021, 40
  • [28] Ensemble Deep Learning on Time-Series Representation of Tweets for Rumor Detection in Social Media
    Kotteti, Chandra Mouli Madhav
    Dong, Xishuang
    Qian, Lijun
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 21
  • [29] Explainable machine learning methods to predict postpartum depression risk
    Shivaprasad, Susmita
    Chadaga, Krishnaraj
    Sampathila, Niranjana
    Prabhu, Srikanth
    Chadaga, P. Rajagopala
    Swathi, K. S.
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2024, 12 (01)
  • [30] Detecting suicidality on social media: Machine learning at rescue
    Rabani, Syed Tanzeel
    Khanday, Akib Mohi Ud Din
    Khan, Qamar Rayees
    Hajam, Umar Ayoub
    Imran, Ali Shariq
    Kastrati, Zenun
    EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (02) : 291 - 302