ArabFake: A Multitask Deep Learning Framework for Arabic Fake News Detection, Categorization, and Risk Prediction

被引:0
作者
Shehata, Ahmed Maher Khafaga [1 ]
Al-Suqri, Mohammed Nasser [1 ]
Osman, Nour Eldin Mohamed Elshaiekh [1 ]
Hamad, Faten [1 ,2 ]
Alhusaini, Yousuf Nasser [3 ]
Mahfouz, Ahmed [3 ,4 ]
机构
[1] Sultan Qaboos Univ, Informat Studies Dept, Seeb 123, Oman
[2] Univ Jordan, Lib & Informat Sci Dept, Amman, Jordan
[3] Arab Open Univ, Fac Comp Studies, Muscat 121, Oman
[4] Minia Univ, Comp Sci Dept, Al Minya 1596, Egypt
关键词
Fake news; Social networking (online); COVID-19; Accuracy; Hate speech; Emojis; Linguistics; Feature extraction; Transformers; Data mining; Fake news detection; misinformation detection; Arabic language; OSNs;
D O I
10.1109/ACCESS.2024.3518204
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The spread of fake news among Arabic media including social media represents a great challenge to the integrity of information and the trust of the public in it. In this paper, we introduce a comprehensive deep-learning framework, named ArabFake, that goes beyond the binary classification on Arabic fake news detection. ArabFake, built over MARBERTv2 (a state-of-the-art model for multi-dialectal Arabic tweets), proficiently address the complexity of the Arabic language while performing three unified tasks which are fake news detection, content categorization and its risk assessment. The framework promotes efficiency and performance both by enabling multi-task learning through shared knowledge representation across tasks. In order to facilitate development and evaluation, we present the ArabFake Dataset consisting of 2,495 manually labelled news items with labels that are verified by experts regarding fake news categories and risk levels. ArabFake demonstrates robust performance, achieving an F1 score of 94.12% for fake news detection, 84.92% for categorization, and 88.91% for risk zone assessment, highlighting its reliability and effectiveness across multiple tasks. We improve interpretability and extract insight into manipulative techniques by integrating valence scoring as part of the framework that emphasizes misleading linguistic cues used to disseminate fake news within the produced image. The results show that ArabFake is a holistic Arabic fake news detection framework that has practical implications on news organizations and fact checking projects.
引用
收藏
页码:191345 / 191360
页数:16
相关论文
共 57 条
[1]   A Hybrid Multitask Learning Framework with a Fire Hawk Optimizer for Arabic Fake News Detection [J].
Abd Elaziz, Mohamed ;
Dahou, Abdelghani ;
Orabi, Dina Ahmed ;
Alshathri, Samah ;
Soliman, Eman M. ;
Ewees, Ahmed A. .
MATHEMATICS, 2023, 11 (02)
[2]  
Abdelali A., 2021, arXiv
[3]  
Abu Farha I, 2019, FOURTH ARABIC NATURAL LANGUAGE PROCESSING WORKSHOP (WANLP 2019), P192
[4]   A Combined CNN and LSTM Model for Arabic Sentiment Analysis [J].
Alayba, Abdulaziz M. ;
Palade, Vasile ;
England, Matthew ;
Iqbal, Rahat .
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, CD-MAKE 2018, 2018, 11015 :179-191
[5]  
AlKhamissi B., 2022, P 5 WORKSH OP SOURC, P186
[6]  
AlKhamissi B., 2022, arXiv
[7]  
AlKhamissi B, 2021, Arxiv, DOI arXiv:2103.01065
[8]   A Literature Review of Textual Hate Speech Detection Methods and Datasets [J].
Alkomah, Fatimah ;
Ma, Xiaogang .
INFORMATION, 2022, 13 (06)
[9]  
Altowayan AA, 2016, 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), P3820, DOI 10.1109/BigData.2016.7841054
[10]   Arab women feelings while seeking information during COVID-19 pandemic: applying PMT constructs [J].
Alwreikat, Asma ;
Shehata, Ahmed ;
Edakar, Metwaly Ali Mohamed .
ONLINE INFORMATION REVIEW, 2022, 46 (03) :525-546