Survey of machine learning techniques for Arabic fake news detection

被引:3
作者
Touahri, Ibtissam [1 ]
Mazroui, Azzeddine [2 ]
机构
[1] Univ Moulay Ismail, Super Sch Technol, Dept Comp Sci, Meknes, Morocco
[2] Mohamed First Univ, Fac Sci, Dept Math & Comp Sci, Oujda, Morocco
关键词
Arabic; Natural language processing; Fake news detection; Machine learning; Deep learning; STANCE DETECTION; TWEETS;
D O I
10.1007/s10462-024-10778-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media platforms have emerged as primary information sources, offering easy access to a wide audience. Consequently, a significant portion of the global population relies on these platforms for updates on current events. However, fraudulent actors exploit social networks to disseminate false information, either for financial gain or to manipulate public opinion. Recognizing the detrimental impact of fake news, researchers have turned their attention to automating its detection. In this paper, we provide a thorough review of fake news detection in Arabic, a low-resource language, to contextualize the current state of research in this domain. In our research methodology, we recall fake news terminology, provide examples for clarity, particularly in Arabic contexts, and explore its impact on public opinion. We discuss the challenges in fake news detection, outline the used datasets, and provide Arabic annotation samples for label assignment. Likewise, preprocessing steps for Arabic language nuances are highlighted. We also explore features from shared tasks and their implications. Lastly, we address open issues, proposing some future research directions like dataset improvement, feature refinement, and increased awareness to combat fake news proliferation. We contend that incorporating our perspective into the examination of fake news aspects, along with suggesting enhancements, sets this survey apart from others currently available.
引用
收藏
页数:33
相关论文
共 77 条
[11]  
Awajan A., 2023, J Theor Appl Inf Technol., V101, P1745
[12]   Stance detection using diverse feature sets based on machine learning techniques [J].
Ayyub, Kashif ;
Iqbal, Saqib ;
Nisar, Muhammad Wasif ;
Ahmad, Saima Gulzar ;
Munir, Ehsan Ullah .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (05) :9721-9740
[13]  
Baly R., 2018, INTEGRATING STANCE D, DOI DOI 10.18653/V1/N18-2004
[14]  
Barron-Cedeno A, 2020, ArXiv200707997 Cs
[15]   Overview of CheckThat! 2020: Automatic Identification and Verification of Claims in Social Media [J].
Barron-Cedeno, Alberto ;
Elsayed, Tamer ;
Nakov, Preslav ;
Martino, Giovanni Da San ;
Hasanain, Maram ;
Suwaileh, Reem ;
Haouari, Fatima ;
Babulkov, Nikolay ;
Hamdan, Bayan ;
Nikolov, Alex ;
Shaar, Shaden ;
Ali, Zien Sheikh .
EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, CLEF 2020, 2020, 12260 :215-236
[16]  
Basol Melisa, 2020, J Cogn, V3, P2, DOI [10.5334/joc.91, 10.5334/joc.91]
[17]   Influence of fake news in Twitter during the 2016 US presidential election [J].
Bovet, Alexandre ;
Makse, Hernan A. .
NATURE COMMUNICATIONS, 2019, 10 (1)
[18]   Timing matters when correcting fake news [J].
Brashier, Nadia M. ;
Pennycook, Gordon ;
Berinsky, Adam J. ;
Rand, David G. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (05)
[19]   Aging in an Era of Fake News [J].
Brashier, Nadia M. ;
Schacter, Daniel L. .
CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2020, 29 (03) :316-323
[20]  
Da San Martino G, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P5636