Albanian Fake News Detection

被引:10
|
作者
Canhasi, Ercan [1 ]
Shijaku, Rexhep [1 ]
Berisha, Erblin [1 ]
机构
[1] Univ Prizren, POB 1212, Prizren, Kosovo
关键词
Fake news; text categorization; natural language processing; machine learning; corpus construction; DECEPTION;
D O I
10.1145/3487288
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the vast increase of the phenomenon known as the fake news. Among the main reasons for this increase are the continuous growth of internet and social media usage and the real-time information dissemination opportunity offered by them. Deceiving, misleading content, such as the fake news, especially the type made by and for social media users, is becoming eminently hazardous. Hence, the fake news detection problem has become an important research topic. Despite the recent advances in fake news detection, the lack of fake news corpora for the under-resourced languages is compromising the development and the evaluation of existing approaches in these languages. To fill this huge gap, in this article, we investigate the issue of fake news detection for the Albanian language. In it, we present a new public dataset of labeled true and fake news in Albanian and perform an extensive analysis of machine learning methods for fake news detection. We performed a comprehensive feature engineering and feature selection experiments. In doing so, we explored the Albanian language-related feature categories such as the lexical, syntactic, lying-detection, and psycho-linguistic features. Each article was also modeled in four different ways: with the traditional bag-of-words (BoW) and with three distributed text representations using the state-of-the-art Word2Vec, FastText, and BERT methods. Additionally, we investigated the best combination of features and various types of classification methods. The conducted experiments and obtained results from evaluations are finally used to draw some conclusions. They shed light on the potentiality of the methods and the challenges that the Albanian fake news detection presents.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] AI and Fake News: A Conceptual Framework for Fake News Detection
    Ameli, Leila
    Chowdhury, Md Shah Alam
    Farid, Farnaz
    Bello, Abubakar
    Sabrina, Fariza
    Maurushat, Alana
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON CYBER SECURITY, CSW 2022, 2022, : 34 - 39
  • [2] Multimodal Fake News Detection
    Segura-Bedmar, Isabel
    Alonso-Bartolome, Santiago
    INFORMATION, 2022, 13 (06)
  • [3] A Tool for Fake News Detection
    Al Asaad, Bashar
    Erascu, Madalina
    2018 20TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2018), 2019, : 379 - 386
  • [4] Fake news detection on Twitter
    Sharma, Srishti
    Saraswat, Mala
    Dubey, Anil Kumar
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2022, 18 (5/6) : 388 - 412
  • [5] Feature analysis of fake news: improving fake news detection in social media
    Leung, Johnathan
    Vatsalan, Dinusha
    Arachchilage, Nalin
    Journal of Cyber Security Technology, 2023, 7 (04) : 224 - 241
  • [6] A hybrid model for fake news detection: Leveraging news content and user comments in fake news
    Albahar, Marwan
    IET INFORMATION SECURITY, 2021, 15 (02) : 169 - 177
  • [7] Automatic Fake News Detection for Romanian Online News
    Buzea, Marius Cristian
    Trausan-Matu, Stefan
    Rebedea, Traian
    INFORMATION, 2022, 13 (03)
  • [8] Fake News Detection with Generated Comments for News Articles
    Yanagi, Yuta
    Orihara, Ryohei
    Sei, Yuichi
    Tahara, Yasuyuki
    Ohsuga, Akihiko
    2020 IEEE 24TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS (INES 2020), 2020, : 85 - 89
  • [9] A comprehensive Benchmark for fake news detection
    Antonio Galli
    Elio Masciari
    Vincenzo Moscato
    Giancarlo Sperlí
    Journal of Intelligent Information Systems, 2022, 59 : 237 - 261
  • [10] Fake News Detection on Indian Sources
    Gogineni, Navyadhara
    Rachamallu, Yashashvini
    Mekala, Ruchitha
    Mamatha, H. R.
    THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND CAPSULE NETWORKS (ICIPCN 2022), 2022, 514 : 23 - 35