Linguistic features based framework for automatic fake news detection

被引:29
|
作者
Garg, Sonal [1 ]
Sharma, Dilip Kumar [1 ]
机构
[1] GLA Univ, Mathura, India
关键词
Artificial Intelligence; Linguistic features; Machine-learning; Statistical Measure; Text classification; DECEPTION; CUES;
D O I
10.1016/j.cie.2022.108432
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Social media platforms now a day are mainly used for news consumption among users. Political groups use social media platforms to attract users by enclosing users' votes in their favor. Due to the large volume of data on social media, it is essential to verify the authenticity of the content. The use of artificial intelligence techniques including the development of embedding and deployment of the machine-learning algorithm is required to combat misinformation. This paper focused on various categories of linguistic features covering complexity features, readability index, psycholinguistic features, and stylometric features for competent fake news identi-fication. The linguistic model helps in computing language-driven features by learning the properties of news content. In this work, we have selected twenty-six significant features and applied various machine learning models for implementation. For feature extraction, three different techniques named term frequency-inverse document frequency (tf-idf), count vectorizer (CV), and hash-vectorizer (HV) are applied. Then, we tested those models in different training dataset sizes to obtain accuracy for each model and compared them. We used four existing datasets for the experiment. The proposed framework achieved 90.8 % accuracy using Reuter dataset. Buzzfeed dataset obtained highest of 90% accuracy. Random Political and Mc_Intire dataset achieved an accuracy of 93.8 and 86.9% respectively.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] WELFake: Word Embedding Over Linguistic Features for Fake News Detection
    Verma, Pawan Kumar
    Agrawal, Prateek
    Amorim, Ivone
    Prodan, Radu
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 8 (04) : 881 - 893
  • [2] AI and Fake News: A Conceptual Framework for Fake News Detection
    Ameli, Leila
    Chowdhury, Md Shah Alam
    Farid, Farnaz
    Bello, Abubakar
    Sabrina, Fariza
    Maurushat, Alana
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON CYBER SECURITY, CSW 2022, 2022, : 34 - 39
  • [3] An empiric validation of linguistic features in machine learning models for fake news detection
    Puraivan, Eduardo
    Venegas, Rene
    Riquelme, Fabian
    DATA & KNOWLEDGE ENGINEERING, 2023, 147
  • [4] Fighting the Fake: A Forensic Linguistic Analysis to Fake News Detection
    Rui Sousa-Silva
    International Journal for the Semiotics of Law - Revue internationale de Sémiotique juridique, 2022, 35 : 2409 - 2433
  • [5] Fighting the Fake: A Forensic Linguistic Analysis to Fake News Detection
    Sousa-Silva, Rui
    INTERNATIONAL JOURNAL FOR THE SEMIOTICS OF LAW-REVUE INTERNATIONALE DE SEMIOTIQUE JURIDIQUE, 2022, 35 (06): : 2409 - 2433
  • [6] Automatic Fake News Detection for Romanian Online News
    Buzea, Marius Cristian
    Trausan-Matu, Stefan
    Rebedea, Traian
    INFORMATION, 2022, 13 (03)
  • [7] Automatic Fake News Detection based on Deep Learning, FastText and News Title
    Taher, Youssef
    Moussaoui, Adelmoutalib
    Moussaoui, Fouad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (01) : 146 - 158
  • [8] Linguistic feature based learning model for fake news detection and classification
    Choudhary, Anshika
    Arora, Anuja
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 169
  • [9] A deep neural network approach for fake news detection using linguistic and psychological features
    Arunthavachelvan, Keshopan
    Raza, Shaina
    Ding, Chen
    USER MODELING AND USER-ADAPTED INTERACTION, 2024, 34 (04) : 1043 - 1070
  • [10] Multiple features based approach for automatic fake news detection on social networks using deep learning
    Sahoo, Somya Ranjan
    Gupta, B. B.
    APPLIED SOFT COMPUTING, 2021, 100