ArabFake: A Multitask Deep Learning Framework for Arabic Fake News Detection, Categorization, and Risk Prediction

被引：0

作者：

Shehata, Ahmed Maher Khafaga ^{[1
]}

Al-Suqri, Mohammed Nasser ^{[1
]}

Osman, Nour Eldin Mohamed Elshaiekh ^{[1
]}

Hamad, Faten ^{[1
,2
]}

Alhusaini, Yousuf Nasser ^{[3
]}

Mahfouz, Ahmed ^{[3
,4
]}

机构：

[1] Sultan Qaboos Univ, Informat Studies Dept, Seeb 123, Oman

[2] Univ Jordan, Lib & Informat Sci Dept, Amman, Jordan

[3] Arab Open Univ, Fac Comp Studies, Muscat 121, Oman

[4] Minia Univ, Comp Sci Dept, Al Minya 1596, Egypt

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Fake news; Social networking (online); COVID-19; Accuracy; Hate speech; Emojis; Linguistics; Feature extraction; Transformers; Data mining; Fake news detection; misinformation detection; Arabic language; OSNs;

D O I：

10.1109/ACCESS.2024.3518204

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The spread of fake news among Arabic media including social media represents a great challenge to the integrity of information and the trust of the public in it. In this paper, we introduce a comprehensive deep-learning framework, named ArabFake, that goes beyond the binary classification on Arabic fake news detection. ArabFake, built over MARBERTv2 (a state-of-the-art model for multi-dialectal Arabic tweets), proficiently address the complexity of the Arabic language while performing three unified tasks which are fake news detection, content categorization and its risk assessment. The framework promotes efficiency and performance both by enabling multi-task learning through shared knowledge representation across tasks. In order to facilitate development and evaluation, we present the ArabFake Dataset consisting of 2,495 manually labelled news items with labels that are verified by experts regarding fake news categories and risk levels. ArabFake demonstrates robust performance, achieving an F1 score of 94.12% for fake news detection, 84.92% for categorization, and 88.91% for risk zone assessment, highlighting its reliability and effectiveness across multiple tasks. We improve interpretability and extract insight into manipulative techniques by integrating valence scoring as part of the framework that emphasizes misleading linguistic cues used to disseminate fake news within the produced image. The results show that ArabFake is a holistic Arabic fake news detection framework that has practical implications on news organizations and fact checking projects.

引用

页码：191345 / 191360

页数：16

共 57 条

[1] A Hybrid Multitask Learning Framework with a Fire Hawk Optimizer for Arabic Fake News Detection [J].