Enhancing Misinformation Detection in Spanish Language with Deep Learning: BERT and RoBERTa Transformer Models

被引:0
|
作者
Blanco-Fernandez, Yolanda [1 ]
Otero-Vizoso, Javier [2 ]
Gil-Solla, Alberto [1 ]
Garcia-Duque, Jorge [2 ]
机构
[1] Univ Vigo, AtlanTTic Res Ctr Telecommun Technol, Vigo 36310, Spain
[2] Univ Vigo, Escuela Ingn Telecomunicac, Vigo, Spain
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 21期
关键词
fake news; Spanish; curated synthetic dataset; fine-tuning; Transformer-based models; BERT; RoBERTa; FAKE NEWS;
D O I
10.3390/app14219729
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper presents an approach to identifying political fake news in Spanish using Transformer architectures. Current methodologies often overlook political news due to the lack of quality datasets, especially in Spanish. To address this, we created a synthetic dataset of 57,231 Spanish political news articles, gathered via automated web scraping and enhanced with generative large language models. This dataset is used for fine-tuning and benchmarking Transformer models like BERT and RoBERTa for fake news detection. Our fine-tuned models showed outstanding performance on this dataset, with accuracy ranging from 97.4% to 98.6%. However, testing with a smaller, independent hand-curated dataset, including statements from political leaders during Spain's July 2023 electoral debates, revealed a performance drop to 71%. Although this suggests that the model needs additional refinements to handle the complexity and variability of real-world political discourse, achieving over 70% accuracy seems a promising result in the under-explored domain of Spanish political fake news detection.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Exploring transformer models for sentiment classification: A comparison of BERT, RoBERTa, ALBERT, DistilBERT, and XLNet
    Areshey, Ali
    Mathkour, Hassan
    EXPERT SYSTEMS, 2024, 41 (11)
  • [2] Misinformation Detection Using Deep Learning
    Tsikerdekis, Michail
    Zeadally, Sherali
    IT PROFESSIONAL, 2023, 25 (05) : 57 - 63
  • [3] Deep active learning for misinformation detection using geometric deep learning
    Barnabo, Giorgio
    Siciliano, Federico
    Castillo, Carlos
    Leonardi, Stefano
    Nakov, Preslav
    Da San Martino, Giovanni
    Silvestri, Fabrizio
    ONLINE SOCIAL NETWORKS AND MEDIA, 2023, 33
  • [4] Robust Encrypted Inference in Deep Learning: A Pathway to Secure Misinformation Detection
    Ali, Hassan
    Javed, Rana Tallal
    Qayyum, Adnan
    Alghadhban, Amer
    Alazmi, Meshari
    Alzamil, Ahmad
    Al-Utaibi, Khalid
    Qadir, Junaid
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2025, 22 (02) : 1627 - 1639
  • [5] FakeBERT: Fake news detection in social media with a BERT-based deep learning approach
    Rohit Kumar Kaliyar
    Anurag Goswami
    Pratik Narang
    Multimedia Tools and Applications, 2021, 80 : 11765 - 11788
  • [6] FakeBERT: Fake news detection in social media with a BERT-based deep learning approach
    Kaliyar, Rohit Kumar
    Goswami, Anurag
    Narang, Pratik
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 11765 - 11788
  • [7] An Attack Detection Framework Based on BERT and Deep Learning
    Seyyar, Yunus Emre
    Yavuz, Ali Gokhan
    Unver, Halil Murat
    IEEE ACCESS, 2022, 10 : 68633 - 68644
  • [8] Few-Shot Learning for Misinformation Detection Based on Contrastive Models
    Zheng, Peng
    Chen, Hao
    Hu, Shu
    Zhu, Bin
    Hu, Jinrong
    Lin, Ching-Sheng
    Wu, Xi
    Lyu, Siwei
    Huang, Guo
    Wang, Xin
    ELECTRONICS, 2024, 13 (04)
  • [9] Transformer-Based Language Models for Software Vulnerability Detection
    Thapa, Chandra
    Jang, Seung Ick
    Ahmed, Muhammad Ejaz
    Camtepe, Seyit
    Pieprzyk, Josef
    Nepal, Surya
    PROCEEDINGS OF THE 38TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2022, 2022, : 481 - 496
  • [10] Email spam detection by deep learning models using novel feature selection technique and BERT
    Nasreen, Ghazala
    Khan, Muhammad Murad
    Younus, Muhammad
    Zafar, Bushra
    Hanif, Muhammad Kashif
    EGYPTIAN INFORMATICS JOURNAL, 2024, 26