Fake news detection: Taxonomy and comparative study

被引:28
作者
Farhangian, Faramarz [1 ]
Cruz, Rafael M. O. [1 ]
Cavalcanti, George D. C. [2 ]
机构
[1] Univ Quebec, Ecole Technol Super, Montreal, PQ, Canada
[2] Univ Fed Pernambuco, Ctr Informat, Recife, PE, Brazil
基金
加拿大自然科学与工程研究理事会;
关键词
Disinformation; Misinformation; Machine learning; Deep learning; Natural language processing; Fake news detection;
D O I
10.1016/j.inffus.2023.102140
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The proliferation of social networks has presented a significant challenge in combating the pervasive issue of fake news within modern societies. Due to the large amount of information and news produced daily in text, audio, and video, the validation and verification of this information have become crucial tasks. Leveraging advancements in artificial intelligence, distinguishing between fake news and factual information through automatic fake news detection systems has become more feasible. Automatic fake news detection has been explored from diverse perspectives, employing various feature extraction and classification models. Nonetheless, empirical evaluations, categorization, and comparisons of existing techniques for handling this problem remain limited. In this paper, we revisit the definitions and perspectives of fake news and propose an updated taxonomy for the field based on multiple criteria: (1) Type of features used in fake news detection; (2) Fake news detection perspectives; (3) Feature representation methods; and (4) Classification approaches. Moreover, we conduct an extensive empirical study to evaluate several feature representation techniques and classification approaches based on accuracy and computational cost. Our experimental results demonstrate that the optimal feature extraction techniques vary depending on the characteristics of the dataset. Notably, context-dependent models based on transformer models consistently exhibit superior performance. Additionally, employing transformer models as feature extraction methods, rather than solely fine-tuning the network for the downstream task, improves overall performance. Through extensive error analysis, we identify that a combination of feature representation methods and classification algorithms, including classical ones, offer complementary aspects and should be considered for achieving better generalization performance while maintaining a relatively low computational cost. For further details, including source codes, figures, and datasets, please refer to our project's GitHub repository: [https://github.com/FFarhangian/Fake-news-detection-Comparative-Study].
引用
收藏
页数:24
相关论文
共 171 条
[1]  
Abadi M., 2015, TENSORFLOW LARGE SCA
[2]  
Abedalla A., 2019, P 2019 3 INT C ADV A, P24
[3]   Language-Independent Fake News Detection: English, Portuguese, and Spanish Mutual Features [J].
Abonizio, Hugo Queiroz ;
de Morais, Janaina Ignacio ;
Tavares, Gabriel Marques ;
Barbon Junior, Sylvio .
FUTURE INTERNET, 2020, 12 (05)
[4]  
Agarwal A, 2020, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), P1178, DOI [10.1109/iciccs48265.2020.9121030, 10.1109/ICICCS48265.2020.9121030]
[5]   Analysis of Classifiers for Fake News Detection [J].
Agarwala, Vasu ;
Sultanaa, H. Parveen ;
Malhotra, Srijan ;
Sarkar, Amitrajit .
2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 :377-383
[6]   Fake News Detection Using Machine Learning Ensemble Methods [J].
Ahmad, Iftikhar ;
Yousaf, Muhammad ;
Yousaf, Suhail ;
Ahmad, Muhammad Ovais .
COMPLEXITY, 2020, 2020
[7]   Detecting opinion spams and fake news using text classification [J].
Ahmed, Hadeer ;
Traore, Issa ;
Saad, Sherif .
SECURITY AND PRIVACY, 2018, 1 (01)
[8]   Detection of Online Fake News Using N-Gram Analysis and Machine Learning Techniques [J].
Ahmed, Hadeer ;
Traore, Issa ;
Saad, Sherif .
INTELLIGENT, SECURE, AND DEPENDABLE SYSTEMS IN DISTRIBUTED AND CLOUD ENVIRONMENTS (ISDDC 2017), 2017, 10618 :127-138
[9]  
Ahn YC, 2019, INT JOINT CONF COMP, P289, DOI [10.1109/jcsse.2019.8864171, 10.1109/JCSSE.2019.8864171]
[10]   A Tool for Fake News Detection [J].
Al Asaad, Bashar ;
Erascu, Madalina .
2018 20TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2018), 2019, :379-386