Arabic Fake News Detection: A Fact Checking Based Deep Learning Approach

被引:20
作者
Harrag, Fouzi [1 ]
Djahli, Mohamed Khalil [1 ]
机构
[1] Ferhat Abbas Univ, Coll Sci, Comp Sci Dept, Setif 19000, Algeria
关键词
Fake news detection; fact checking; social media; deep learning; Convolutional Neural Network; Arabic corpus; SOCIAL MEDIA;
D O I
10.1145/3501401
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fake news stories can polarize society, particularly during political events. They undermine confidence in the media in general. Current NLP systems are still lacking the ability to properly interpret and classify Arabic fake news. Given the high stakes involved, determining truth in social media has recently become an emerging research that is attracting tremendous attention. Our literature review indicates that applying the state-of-the-art approaches on news content address some challenges in detecting fake news' characteristics, which needs auxiliary information to make a clear determination. Moreover, the 'Social-context-based' and 'propagation-based' approaches can be either an alternative or complementary strategy to content-based approaches. The main goal of our research is to develop a model capable of automatically detecting truth given an Arabic news or claim. In particular, we propose a deep neural network approach that can classify fake and real news claims by exploiting 'Convolutional Neuron Networks'. Our approach attempts to solve the problem from the fact checking perspective, where the fact-checking task involves predicting whether a given news text claim is factually authentic or fake. We opt to use an Arabic balanced corpus to build our model because it unifies stance detection, stance rationale, relevant document retrieval and fact-checking. The model is trained on different well selected attributes. An extensive evaluation has been conducted to demonstrate the ability of the fact-checking task in detecting the Arabic fake news. Our model outperforms the performance of the state-of-the-art approaches when applied to the same Arabic dataset with the highest accuracy of 91%.
引用
收藏
页数:34
相关论文
共 66 条
  • [51] Rakhmanberdieva Nurzat, 2018, WORD REPR NAT LANG 1
  • [52] Rehling J., 2011, How Natural Language Processing Helps Uncover Social Media Sentiment
  • [53] Riedel B., 2017, arXiv preprint arXiv:1707.03264
  • [54] Saha S, 2018, COMPREHENSIVE GUIDE
  • [55] scikit-learn sklearn.preprocessing, 2019, MULTILABELBINARIZER
  • [56] Beyond News Contents: The Role of Social Context for Fake New Detection
    Shu, Kai
    Wang, Suhang
    Liu, Huan
    [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 312 - 320
  • [57] Spyder, 2018, SPYD OV
  • [58] Steinberg Luc., 2017, FAKE NEWS 10 TYPES M
  • [59] Subedi Nishan, 2018, FASTTEXT HOOD
  • [60] Conspiracy Theories: Evolved Functions and Psychological Mechanisms
    van Prooijen, Jan-Willem
    van Vugt, Mark
    [J]. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE, 2018, 13 (06) : 770 - 788