Automated and Interpretable Fake News Detection With Explainable Artificial Intelligence

被引:1
作者
Giri, Moyank [1 ]
Eswaran, Sivaraman [2 ]
Honnavalli, Prasad [1 ]
Daniel, D. [3 ]
机构
[1] PES Univ, Res Ctr Informat Secur Forens & Cyber Resilience, Bangalore, Karnataka, India
[2] Curtin Univ, Dept Elect & Comp Engn, Miri, Sarawak, Malaysia
[3] CHRIST, Dept Comp Sci & Engn, Bangalore, Karnataka, India
关键词
Convolution Neural Network; Decision Tree; explainable AI; ensemble model; error level analysis; Na & iuml; ve Bayes classifier; Random Forest classifier;
D O I
10.1080/19361610.2024.2356431
中图分类号
DF [法律]; D9 [法律];
学科分类号
0301 ;
摘要
Fake news is a piece of misleading or forged information that affects society, business, governments, etc., hence is an imperative issue. The solution presented here to detect fake news involves purely using rigorous machine learning approaches in implementing a hybrid of simple yet accurate fake text detection models and fake image detection models to detect fake news. The solution considers the text and images of any news article, extracted using web scraping, where the text segment of a news article is analyzed using an ensemble model of the Na & iuml;ve Bayes, Random Forest, and Decision Tree classifier, which showed improved results than the individual models. The image segment of a news article is analyzed using only a Convolution Neural Network, which showed optimal accuracy similar to the text model. To better train the text models, data preprocessing and aggregation methods were used to combine various fake-real news datasets to have ample amounts of data. Similarly, the CASIA dataset was used to train the image model, over which Error Level Analysis was performed to detect fake images. model results are represented as confusion matrices and are measured using various performance metrics. Also, to explain predictions from the hybrid model, Explainable Artificial Intelligence is used.
引用
收藏
页码:628 / 648
页数:21
相关论文
共 17 条
  • [1] [Anonymous], 2005, AI 2004: Advances in artificial intelligence
  • [2] Beer Dylan, 2021, Integrated Science in Digital Age 2020. Lecture Notes in Networks and Systems (LNNS 136), P13, DOI 10.1007/978-3-030-49264-9_2
  • [3] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [4] Burkhardt J. M., 2017, Libr. Technol. Rep, V53, P5
  • [5] A Topic-Agnostic Approach for Identifying Fake News Pages
    Castelo, Sonia
    Almeida, Thais
    Elghafari, Anas
    Santos, Aecio
    Pham, Kien
    Nakamura, Eduardo
    Freire, Juliana
    [J]. COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2019 ), 2019, : 975 - 980
  • [6] Chen Y., 2015, Proceedings of the Association for Information Science and Technology, V52, P1, DOI [DOI 10.1002/PRA2.2015.145052010081, 10.1002/pra2.2015.145052010081]
  • [7] Dieber J., 2020, arXiv, DOI DOI 10.48550/ARXIV.2012.00093
  • [8] The current state of fake news: challenges and opportunities
    Figueira, Alvaro
    Oliveira, Luciana
    [J]. CENTERIS 2017 - INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS / PROJMAN 2017 - INTERNATIONAL CONFERENCE ON PROJECT MANAGEMENT / HCIST 2017 - INTERNATIONAL CONFERENCE ON HEALTH AND SOCIAL CARE INFORMATION SYSTEMS AND TECHNOLOGIES, CENTERI, 2017, 121 : 817 - 825
  • [9] Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster
    Hassan, Naeemul
    Arslan, Fatma
    Li, Chengkai
    Tremayne, Mark
    [J]. KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1803 - 1812
  • [10] Horne B., 2017, P INT AAAI C WEB SOC, DOI DOI 10.1609/ICWSM.V11I1.14976