Albanian Fake News Detection

被引:10
|
作者
Canhasi, Ercan [1 ]
Shijaku, Rexhep [1 ]
Berisha, Erblin [1 ]
机构
[1] Univ Prizren, POB 1212, Prizren, Kosovo
关键词
Fake news; text categorization; natural language processing; machine learning; corpus construction; DECEPTION;
D O I
10.1145/3487288
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the vast increase of the phenomenon known as the fake news. Among the main reasons for this increase are the continuous growth of internet and social media usage and the real-time information dissemination opportunity offered by them. Deceiving, misleading content, such as the fake news, especially the type made by and for social media users, is becoming eminently hazardous. Hence, the fake news detection problem has become an important research topic. Despite the recent advances in fake news detection, the lack of fake news corpora for the under-resourced languages is compromising the development and the evaluation of existing approaches in these languages. To fill this huge gap, in this article, we investigate the issue of fake news detection for the Albanian language. In it, we present a new public dataset of labeled true and fake news in Albanian and perform an extensive analysis of machine learning methods for fake news detection. We performed a comprehensive feature engineering and feature selection experiments. In doing so, we explored the Albanian language-related feature categories such as the lexical, syntactic, lying-detection, and psycho-linguistic features. Each article was also modeled in four different ways: with the traditional bag-of-words (BoW) and with three distributed text representations using the state-of-the-art Word2Vec, FastText, and BERT methods. Additionally, we investigated the best combination of features and various types of classification methods. The conducted experiments and obtained results from evaluations are finally used to draw some conclusions. They shed light on the potentiality of the methods and the challenges that the Albanian fake news detection presents.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Fake News Detection Using Ethereum Blockchain
    Upadhyay, Akanksha
    Baranwal, Gaurav
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2021, 2022, 1534 : 142 - 152
  • [32] Fake News Detection: An Ensemble Learning Approach
    Agarwal, Arush
    Dixit, Akhil
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 1178 - 1183
  • [33] Curriculum Contrastive Learning for Fake News Detection
    Ma, Jiachen
    Liu, Yong
    Liu, Meng
    Han, Meng
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4309 - 4313
  • [34] Multimodal Approaches based on Fake News Detection
    Reddy, Bandi Sravani
    Siva Kumar, A.P.
    Proceedings of the 3rd International Conference on Artificial Intelligence and Smart Energy, ICAIS 2023, 2023, : 751 - 755
  • [35] Fake news detection by image montage recognition
    Steinebach M.
    Gotkowski K.
    Liu H.
    Journal of Cyber Security and Mobility, 2020, 9 (02): : 175 - 202
  • [36] Fake News Detection Utilizing Textual Cues
    Chouliara, Vasiliki
    Koukaras, Paraskevas
    Tjortjis, Christos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT I, 2023, 675 : 393 - 403
  • [37] A Performance Comparison of Fake News Detection Approaches
    Zhu, Haichao
    Sinnott, Richard O.
    2021 IEEE ASIA-PACIFIC CONFERENCE ON COMPUTER SCIENCE AND DATA ENGINEERING (CSDE), 2021,
  • [38] Fake News Detection Using Deep Learning
    Lee, Dong-Ho
    Kim, Yu-Ri
    Kim, Hyeong-Jun
    Park, Seung-Myun
    Yang, Yu-Jun
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2019, 15 (05): : 1119 - 1130
  • [39] Detection of Fake News Using Clustering Algorithms
    Lavanya, K.
    Yasaswini, L.
    Anusha, Ch. Naga
    Vyshnavi, K.
    Vyshnavi, M.
    SOFT COMPUTING FOR SECURITY APPLICATIONS, ICSCS 2022, 2023, 1428 : 655 - 664
  • [40] IFND: a benchmark dataset for fake news detection
    Dilip Kumar Sharma
    Sonal Garg
    Complex & Intelligent Systems, 2023, 9 : 2843 - 2863