Bilingual COVID-19 Fake News Detection Based on LDA Topic Modeling and BERT Transformer

被引:1
|
作者
Omrani, Pouria [1 ,3 ]
Ebrahimian, Zahra [2 ,3 ]
Toosi, Ramin [2 ,3 ]
Akhaee, Mohammad Ali [2 ]
机构
[1] K N Toosi Univ Technol, Fac Elect Engn, Tehran, Iran
[2] Univ Tehran, Sch Elect & Comp Engn, Coll Engn, Tehran, Iran
[3] Adak Vira Iranian Rahjoo Co, Tehran, Iran
来源
2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA | 2023年
关键词
BERT Transformer; Topic Modeling; Fake News Detection; COVID-19;
D O I
10.1109/IPRIA59240.2023.10147179
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spread of fake news has become more prevalent given the popularity of social media and the various news that circulates on it. As a result, it is crucial to discern between real and fake news. During the COVID-19 pandemic, there have been numerous tweets, posts, and news about this illness in social media and electronic media worldwide. This research presents a bilingual model combining Latent Dirichlet Allocation (LDA) topic modeling and the BERT transformer to detect COVID-19 fake news in both Persian and English. First, the dataset is prepared in Persian and English, and then the proposed method is used to detect COVID-19 fake news on the prepared dataset. Finally, the proposed model is evaluated using various metrics such as accuracy, precision, recall, and the f1-score. As a result of this approach, we achieve 92.18% accuracy, which shows that adding topic information to the pre-trained contextual representations given by the BERT network, significantly improves the solving of instances that are domain-specific. Also, the results show that our proposed approach outperforms previous state-of-the-art methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Topic Modeling for Tracking COVID-19 Communication on Twitter
    Bogovic, Petar Kristijan
    Mestrovic, Ana
    Martincic-Ipsic, Sanda
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2022, 2022, 1665 : 248 - 258
  • [42] Multitask Fake News Detection in Arabic Language using AraELECTRA model: COVID-19 Case Study
    Sellami, Meriem
    Hadrouk, Ramzi
    Chelghoum, Sofiane
    Badache, Ramzi
    Kamel, Nadjet
    Lakhfif, Abdelazziz
    2024 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES FOR DISASTER MANAGEMENT, ICT-DM 2024, 2024,
  • [43] COVID-19 fake news detection: A hybrid CNN-BiLSTM-AM model
    Xia, Huosong
    Wang, Yuan
    Zhang, Justin Zuopeng
    Zheng, Leven J.
    Kamal, Muhammad Mustafa
    Arya, Varsha
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2023, 195
  • [44] News Coverage of Face Masks in Australia During the Early COVID-19 Pandemic: Topic Modeling Study
    Dasgupta, Pritam
    Amin, Janaki
    Paris, Cecile
    MacIntyre, C. Raina
    JMIR INFODEMIOLOGY, 2023, 3 (01):
  • [45] Comparing News Articles and Tweets About COVID-19 in Brazil: Sentiment Analysis and Topic Modeling Approach
    de Melo, Tiago
    Figueiredo, Carlos M. S.
    JMIR PUBLIC HEALTH AND SURVEILLANCE, 2021, 7 (02):
  • [46] Lessons Learned from Topic Modeling Analysis of COVID-19 News to Enrich Statistics Education in Korea
    Kang, Seokmin
    Kim, Sungyeun
    SUSTAINABILITY, 2022, 14 (06)
  • [47] Using Topic Modeling and Adversarial Neural Networks for Fake News Video Detection
    Choi, Hyewon
    Ko, Youngjoong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2950 - 2954
  • [48] COVID-19 Concerns in US: Topic Detection in Twitter
    Comito, Carmela
    IDEAS 2021: 25TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, 2021, : 103 - 110
  • [49] Sustainable Consumption in Consumer Behavior in the Time of COVID-19: Topic Modeling on Twitter Data Using LDA
    Brzustewicz, Pawel
    Singh, Anupam
    ENERGIES, 2021, 14 (18)
  • [50] Spatio-temporal approach for classification of COVID-19 pandemic fake news
    Agarwal, I. Y.
    Rana, D. P.
    Shaikh, M.
    Poudel, S.
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)