ArabFake: A Multitask Deep Learning Framework for Arabic Fake News Detection, Categorization, and Risk Prediction

被引:0
作者
Shehata, Ahmed Maher Khafaga [1 ]
Al-Suqri, Mohammed Nasser [1 ]
Osman, Nour Eldin Mohamed Elshaiekh [1 ]
Hamad, Faten [1 ,2 ]
Alhusaini, Yousuf Nasser [3 ]
Mahfouz, Ahmed [3 ,4 ]
机构
[1] Sultan Qaboos Univ, Informat Studies Dept, Seeb 123, Oman
[2] Univ Jordan, Lib & Informat Sci Dept, Amman, Jordan
[3] Arab Open Univ, Fac Comp Studies, Muscat 121, Oman
[4] Minia Univ, Comp Sci Dept, Al Minya 1596, Egypt
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Fake news; Social networking (online); COVID-19; Accuracy; Hate speech; Emojis; Linguistics; Feature extraction; Transformers; Data mining; Fake news detection; misinformation detection; Arabic language; OSNs;
D O I
10.1109/ACCESS.2024.3518204
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The spread of fake news among Arabic media including social media represents a great challenge to the integrity of information and the trust of the public in it. In this paper, we introduce a comprehensive deep-learning framework, named ArabFake, that goes beyond the binary classification on Arabic fake news detection. ArabFake, built over MARBERTv2 (a state-of-the-art model for multi-dialectal Arabic tweets), proficiently address the complexity of the Arabic language while performing three unified tasks which are fake news detection, content categorization and its risk assessment. The framework promotes efficiency and performance both by enabling multi-task learning through shared knowledge representation across tasks. In order to facilitate development and evaluation, we present the ArabFake Dataset consisting of 2,495 manually labelled news items with labels that are verified by experts regarding fake news categories and risk levels. ArabFake demonstrates robust performance, achieving an F1 score of 94.12% for fake news detection, 84.92% for categorization, and 88.91% for risk zone assessment, highlighting its reliability and effectiveness across multiple tasks. We improve interpretability and extract insight into manipulative techniques by integrating valence scoring as part of the framework that emphasizes misleading linguistic cues used to disseminate fake news within the produced image. The results show that ArabFake is a holistic Arabic fake news detection framework that has practical implications on news organizations and fact checking projects.
引用
收藏
页码:191345 / 191360
页数:16
相关论文
共 58 条
  • [1] Shehata A.M.K., The problem of misinformation and fake news, Mass Communications and the Influence of Information during Times of Crise Global., pp. 99-122, (2022)
  • [2] Saadany H., Mohamed E.A., Orasan C., Fake or real? A study of Arabic satirical fake news, Proc. 3rd Int.Workshop Rumours Deception Social Media (RDSM), pp. 70-80, (2020)
  • [3] Alwreikat A., Shehata A., Edakar M.A.M., Arab women feelings while seeking information during COVID-19 pandemic: Applying PMT constructs, Online Inf. Rev., 46, 3, pp. 525-546, (2022)
  • [4] Dang H.L., Social media, fake news, and the COVID-19 pandemic: Sketching the case of Southeast Asia, Austrian J. South-East Asian Stud., 14, 1, pp. 37-58, (2021)
  • [5] Schuldt L., Official truths in a war on fake news: Governmental factchecking in Malaysia, Singapore, and Thailand, J. Current Southeast Asian Affairs, 40, 2, pp. 340-371, (2021)
  • [6] Pennycook G., McPhetres J., Zhang Y., Lu J.G., Rand D.G., Fighting COVID-19 misinformation on social media: Experimental evidence for a scalable accuracy-nudge intervention, Psychol. Sci., 31, 7, pp. 770-780, (2020)
  • [7] Su Y., It doesn't take a village to fall for misinformation: Social media use, discussion heterogeneity preference, worry of the virus, faith in scientists, and COVID-19-related misinformation beliefs, Telematics Informat., 58, (2021)
  • [8] Sylvia Chou W.-Y., Gaysynsky A., Cappella J.N., Where we go from here: Health misinformation on social media, Amer. J. Public Health, 110, pp. S273-S275, (2020)
  • [9] Madani Y., Erritali M., Bouikhalene B., Using artificial intelligence techniques for detecting COVID-19 epidemic fake news in Moroccan tweets, Results Phys., 25, (2021)
  • [10] Amer A.Y.A., Siddiqui T., Detection of COVID-19 fake news text data using randomforest and decision tree classifiers, Int. J. Comput. Sci. Inf. Secur. (IJCSIS), 18, 12, pp. 1-24, (2020)