Fake News Classification and Topic Modeling in Brazilian Portuguese

被引:7
作者
Paixao, Maik [1 ]
Lima, Rinaldo [1 ]
Espinasse, Bernard [2 ]
机构
[1] Univ Fed Rural Pernambuco, Dept Comp, Recife, PE, Brazil
[2] Aix Marseille Univ, LIS UMR CNRS, Marseille, France
来源
2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020) | 2020年
关键词
fake news detection; topic modeling; machine learning;
D O I
10.1109/WIIAT50758.2020.00063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
All over the world, people receive daily news on many subjects through web-based information sharing platforms such as social networks. However, some of such news are false (fake) with the potential to deceive them. Thus, the automatic detection of false news is a major issue and is gaining careful attention from the scientific community. In this paper, we present experimental analysis using both supervised and unsupervised learning on the Fake.Br corpus, a fake news dataset in Brazilian Portuguese. We propose a classification method for fake news detection based on distinct types of features, and deep learning supervised algorithms. Our best classification model achieved F1 scores up to 96% and was compared with other non-deep learning classifiers. Furthermore, we provide a complementary analysis of the same dataset by performing topic modeling based on both uni-grams and bi-grams.
引用
收藏
页码:427 / 432
页数:6
相关论文
共 50 条
  • [1] Using Topic Modeling in Classification of Brazilian Lawsuits
    Aguiar, Andre
    Silveira, Raquel
    Furtado, Vasco
    Pinheiro, Vladia
    Monteiro Neto, Joao A.
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 233 - 242
  • [2] Towards automatic fake news classification
    Ghosh S.
    Shah C.
    2018, John Wiley and Sons Inc (55) : 805 - 807
  • [3] Using Topic Modeling and Adversarial Neural Networks for Fake News Video Detection
    Choi, Hyewon
    Ko, Youngjoong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2950 - 2954
  • [4] Bilingual COVID-19 Fake News Detection Based on LDA Topic Modeling and BERT Transformer
    Omrani, Pouria
    Ebrahimian, Zahra
    Toosi, Ramin
    Akhaee, Mohammad Ali
    2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,
  • [5] Illusion of Truth: Analysing and Classifying COVID-19 Fake News in Brazilian Portuguese Language
    Endo, Patricia Takako
    Santos, Guto Leoni
    de Lima Xavier, Maria Eduarda
    Nascimento Campos, Gleyson Rhuan
    de Lima, Luciana Conceicao
    Silva, Ivanovitch
    Egli, Antonia
    Lynn, Theo
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (02)
  • [6] Towards automatically filtering fake news in Portuguese
    Silva, Renato M.
    Santos, Roney L. S.
    Almeida, Tiago A.
    Pardo, Thiago A. S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 146
  • [7] Automatic social media news classification: a topic modeling approach
    Amador, Daniel
    Gamboa-Venegas, Carlos
    Garcia, Ernesto
    Segura-Castillo, Andres
    TECNOLOGIA EN MARCHA, 2022, 35
  • [8] A Fake News Detection and Credibility Ranking Platform for Portuguese Online News
    Lima, Ines Rito
    Pinto, Marcia
    Amorim, Ivone
    Marreiros, Goreti
    Ulisses, Alexandre
    INFORMATION SYSTEMS AND TECHNOLOGIES, WORLDCIST 2022, VOL 1, 2022, 468 : 531 - 541
  • [9] Fakepedia Corpus: A Flexible Fake News Corpus in Portuguese
    Charles, Anderson Cordeiro
    Ruback, Livia
    Oliveira, Jonice
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 37 - 45
  • [10] A Topic-Agnostic Approach for Identifying Fake News Pages
    Castelo, Sonia
    Almeida, Thais
    Elghafari, Anas
    Santos, Aecio
    Pham, Kien
    Nakamura, Eduardo
    Freire, Juliana
    COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2019 ), 2019, : 975 - 980