Albanian Fake News Detection

被引:10
|
作者
Canhasi, Ercan [1 ]
Shijaku, Rexhep [1 ]
Berisha, Erblin [1 ]
机构
[1] Univ Prizren, POB 1212, Prizren, Kosovo
关键词
Fake news; text categorization; natural language processing; machine learning; corpus construction; DECEPTION;
D O I
10.1145/3487288
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the vast increase of the phenomenon known as the fake news. Among the main reasons for this increase are the continuous growth of internet and social media usage and the real-time information dissemination opportunity offered by them. Deceiving, misleading content, such as the fake news, especially the type made by and for social media users, is becoming eminently hazardous. Hence, the fake news detection problem has become an important research topic. Despite the recent advances in fake news detection, the lack of fake news corpora for the under-resourced languages is compromising the development and the evaluation of existing approaches in these languages. To fill this huge gap, in this article, we investigate the issue of fake news detection for the Albanian language. In it, we present a new public dataset of labeled true and fake news in Albanian and perform an extensive analysis of machine learning methods for fake news detection. We performed a comprehensive feature engineering and feature selection experiments. In doing so, we explored the Albanian language-related feature categories such as the lexical, syntactic, lying-detection, and psycho-linguistic features. Each article was also modeled in four different ways: with the traditional bag-of-words (BoW) and with three distributed text representations using the state-of-the-art Word2Vec, FastText, and BERT methods. Additionally, we investigated the best combination of features and various types of classification methods. The conducted experiments and obtained results from evaluations are finally used to draw some conclusions. They shed light on the potentiality of the methods and the challenges that the Albanian fake news detection presents.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Supervised Learning for Fake News Detection
    Reis, Julio C. S.
    Correia, Andre
    Murai, Fabricio
    Veloso, Adriano
    Benevenuto, Fabricio
    IEEE INTELLIGENT SYSTEMS, 2019, 34 (02) : 76 - 81
  • [22] Sentiment Analysis for Fake News Detection
    Alonso, Miguel A.
    Vilares, David
    Gomez-Rodriguez, Carlos
    Vilares, Jesus
    ELECTRONICS, 2021, 10 (11)
  • [23] A study on the Detection of Fake news in Spanish
    Galvez, Alba Maribel Sanchez
    Albores, Francisco Javier
    Gonzalez, Ricardo Alvarez
    Conde, Said Gonzalez
    Galvez, Sully Sanchez
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2024, 15 (02): : 85 - 94
  • [24] Explore the Style for Fake News Detection
    Wilbert
    Yang, Hui-kuo
    Peng, Wen-chih
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (06) : 1349 - 1361
  • [25] Analysis of Classifiers for Fake News Detection
    Agarwala, Vasu
    Sultanaa, H. Parveen
    Malhotra, Srijan
    Sarkar, Amitrajit
    2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ADVANCED COMPUTING ICRTAC -DISRUP - TIV INNOVATION , 2019, 2019, 165 : 377 - 383
  • [26] Fake News Detection Automating the process
    不详
    CURRENT SCIENCE, 2019, 117 (11): : 1773 - 1773
  • [27] Fake News: The Truth of Fake News
    Coelho da Silva, Rafael Alexandre
    REVISTA INTERNACIONAL DE RELACIONES PUBLICAS, 2021, 11 (21): : 247 - 250
  • [28] Fake News Detection on English News Article's Title
    Castillo, Jaydine M.
    Fadera, Kyla Dann F.
    Ladao, Alexana Alian A.
    Go, Jeline G.
    Tamayo, Melecia B.
    Octaviano Jr, Manolito, V
    2021 1ST INTERNATIONAL CONFERENCE IN INFORMATION AND COMPUTING RESEARCH (ICORE 2021), 2021, : 151 - 156
  • [29] Fake Detect: A Deep Learning Ensemble Model for Fake News Detection
    Aslam, Nida
    Ullah Khan, Irfan
    Alotaibi, Farah Salem
    Aldaej, Lama Abdulaziz
    Aldubaikil, Asma Khaled
    COMPLEXITY, 2021, 2021
  • [30] Fake News Detection Based on Multimodal Inputs
    Liang, Zhiping
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (02): : 4519 - 4534