D-FEND: A Diffusion-Based Fake News Detection Framework for News Articles Related to COVID-19

被引:3
作者
Han, Soeun [1 ]
Ko, Yunyong [1 ]
Kim, Yushim [2 ]
Oh, Seong Soo [1 ]
Park, Heejin [1 ]
Kim, Sang-Wook [1 ]
机构
[1] Hanyang Univ, Seoul, South Korea
[2] Arizona State Univ, Tempe, AZ 85287 USA
来源
37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING | 2022年
基金
新加坡国家研究基金会;
关键词
fake news detection; diffusion-based detection; COVID-19; dataset; OUT CROSS-VALIDATION; KERNEL;
D O I
10.1145/3477314.3507134
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The social confusion caused by the recent pandemic of COVID-19 has been further facilitated by fake news diffused via social media on the Internet. For this reason, many studies have been proposed to detect fake news as early as possible. The content-based detection methods consider the difference between the contents of true and fake news articles. However, they suffer from the two serious limitations: (1) the publisher can manipulate the content of a news article easily, and (2) the content depends upon the language, with which the article is written. To overcome these limitations, the diffusion-based fake news detection methods have been proposed. The diffusion-based methods consider the difference among the diffusion patterns of true and fake news articles on social media. Despite its success, however, the lack of the diffusion information regarding to the COVID-19 related fake news prevents from studying the diffusion-based fake news detection methods. Therefore, for overcoming the limitation, we propose a diffusion-based fake news detection framework (D-FEND), which consists of four components: (C1) diffusion data collection, (C2) analysis of the data and feature extraction, (C3) model training, and (C4) inference. Our work contributes to the effort to mitigate the risk of infodemics during a pandemic by (1) building a new diffusion dataset, named CoAID+, (2) identifying and addressing the class imbalance problem of CoAID+, and (3) demonstrating that D-FEND successfully detects fake news articles with 88.89% model accuracy on average.
引用
收藏
页码:1771 / 1778
页数:8
相关论文
共 41 条
[1]   Detection of Online Fake News Using N-Gram Analysis and Machine Learning Techniques [J].
Ahmed, Hadeer ;
Traore, Issa ;
Saad, Sherif .
INTELLIGENT, SECURE, AND DEPENDABLE SYSTEMS IN DISTRIBUTED AND CLOUD ENVIRONMENTS (ISDDC 2017), 2017, 10618 :127-138
[2]  
[Anonymous], BuzzFeedNews dataset taken
[3]  
Badaskar Sameer, 2008, P 3 INT JOINT C NATU
[4]  
Castillo C., 2011, P 20 INT C WORLD WID, P675
[5]  
Cawley GC, 2006, IEEE IJCNN, P1661
[6]   Efficient leave-one-out cross-validation of kernel Fisher discriminant classifiers [J].
Cawley, GC ;
Talbot, NLC .
PATTERN RECOGNITION, 2003, 36 (11) :2585-2592
[7]  
Chawla N. V., 2004, SIGKDD Explor. Newsl., V6, P1
[8]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[9]  
Cui LM, 2020, Arxiv, DOI arXiv:2006.00885
[10]  
Elhadad Mohamed K., 2021, Advances in Networked-Based Information Systems. 23rd International Conference on Network-Based Information Systems (NBiS-2020). Advances in Intelligent Systems and Computing (AISC 1264), P163, DOI 10.1007/978-3-030-57811-4_16