Fake News Classification and Topic Modeling in Brazilian Portuguese

被引:7
作者
Paixao, Maik [1 ]
Lima, Rinaldo [1 ]
Espinasse, Bernard [2 ]
机构
[1] Univ Fed Rural Pernambuco, Dept Comp, Recife, PE, Brazil
[2] Aix Marseille Univ, LIS UMR CNRS, Marseille, France
来源
2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020) | 2020年
关键词
fake news detection; topic modeling; machine learning;
D O I
10.1109/WIIAT50758.2020.00063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
All over the world, people receive daily news on many subjects through web-based information sharing platforms such as social networks. However, some of such news are false (fake) with the potential to deceive them. Thus, the automatic detection of false news is a major issue and is gaining careful attention from the scientific community. In this paper, we present experimental analysis using both supervised and unsupervised learning on the Fake.Br corpus, a fake news dataset in Brazilian Portuguese. We propose a classification method for fake news detection based on distinct types of features, and deep learning supervised algorithms. Our best classification model achieved F1 scores up to 96% and was compared with other non-deep learning classifiers. Furthermore, we provide a complementary analysis of the same dataset by performing topic modeling based on both uni-grams and bi-grams.
引用
收藏
页码:427 / 432
页数:6
相关论文
共 50 条
[21]   Fake News Detection Using LDA Topic Modelling and K-Nearest Neighbor Classifier [J].
Casillo, Mario ;
Colace, Francesco ;
Gupta, Brij B. ;
Santaniello, Domenico ;
Valentino, Carmine .
COMPUTATIONAL DATA AND SOCIAL NETWORKS, CSONET 2021, 2021, 13116 :330-339
[22]   Detecting Incoming Fake News in News Streams via Efficient Topic-Based Correlation [J].
Wei, Xiaomei ;
Zhang, Yongcheng ;
Yang, Ruohan ;
Wang, Huan .
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 2, 2025, 14851 :446-455
[23]   Multi-view co-attention network for fake news detection by modeling topic-specific user and news source credibility [J].
Bazmi, Parisa ;
Asadpour, Masoud ;
Shakery, Azadeh .
INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (01)
[24]   Fake News Classification Based on Content Level Features [J].
Lai, Chun-Ming ;
Chen, Mei-Hua ;
Kristiani, Endah ;
Verma, Vinod Kumar ;
Yang, Chao-Tung .
APPLIED SCIENCES-BASEL, 2022, 12 (03)
[25]   Application of Machine Learning Techniques for Fake News Classification [J].
Silva, Kim ;
Paixao, Crysttian ;
Rodrigues, Paulo Canas .
MEASUREMENT-INTERDISCIPLINARY RESEARCH AND PERSPECTIVES, 2024,
[26]   An Experimental Evaluation of Data Classification Models for Credibility Based Fake News Detection [J].
Ramkissoon, Amit Neil ;
Mohammed, Shareeda .
20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2020), 2020, :93-100
[27]   Natural Language Processing with Optimal Deep Learning Based Fake News Classification [J].
Althubiti, Sara A. ;
Alenezi, Fayadh ;
Mansour, Romany F. .
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02) :3529-3544
[28]   Fake News Detection via Multi-Modal Topic Memory Network [J].
Ying, Long ;
Yu, Hui ;
Wang, Jinguang ;
Ji, Yongze ;
Qian, Shengsheng .
IEEE ACCESS, 2021, 9 :132818-132829
[29]   Semantic graph based topic modelling framework for multilingual fake news detection [J].
Mohawesh, Rami ;
Liu, Xiao ;
Arini, Hilya Mudrika ;
Wu, Yutao ;
Yin, Hui .
AI OPEN, 2023, 4 :33-41
[30]   Contributions to the Study of Fake News in Portuguese: New Corpus and Automatic Detection Results [J].
Monteiro, Rafael A. ;
Santos, Roney L. S. ;
Pardo, Thiago A. S. ;
de Almeida, Tiago A. ;
Ruiz, Evandro E. S. ;
Vale, Oto A. .
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 :324-334