Enhancing Misinformation Detection in Spanish Language with Deep Learning: BERT and RoBERTa Transformer Models

被引：0

作者：

Blanco-Fernandez, Yolanda ^{[1
]}

Otero-Vizoso, Javier ^{[2
]}

Gil-Solla, Alberto ^{[1
]}

Garcia-Duque, Jorge ^{[2
]}

机构：

[1] Univ Vigo, AtlanTTic Res Ctr Telecommun Technol, Vigo 36310, Spain

[2] Univ Vigo, Escuela Ingn Telecomunicac, Vigo, Spain

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 21期

关键词：

fake news; Spanish; curated synthetic dataset; fine-tuning; Transformer-based models; BERT; RoBERTa; FAKE NEWS;

D O I：

10.3390/app14219729

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

This paper presents an approach to identifying political fake news in Spanish using Transformer architectures. Current methodologies often overlook political news due to the lack of quality datasets, especially in Spanish. To address this, we created a synthetic dataset of 57,231 Spanish political news articles, gathered via automated web scraping and enhanced with generative large language models. This dataset is used for fine-tuning and benchmarking Transformer models like BERT and RoBERTa for fake news detection. Our fine-tuned models showed outstanding performance on this dataset, with accuracy ranging from 97.4% to 98.6%. However, testing with a smaller, independent hand-curated dataset, including statements from political leaders during Spain's July 2023 electoral debates, revealed a performance drop to 71%. Although this suggests that the model needs additional refinements to handle the complexity and variability of real-world political discourse, achieving over 70% accuracy seems a promising result in the under-explored domain of Spanish political fake news detection.

引用

页数：27

共 50 条

[1] Exploring transformer models for sentiment classification: A comparison of BERT, RoBERTa, ALBERT, DistilBERT, and XLNet
Areshey, Ali
Mathkour, Hassan
EXPERT SYSTEMS, 2024, 41 (11)
[2] Misinformation Detection Using Deep Learning
Tsikerdekis, Michail
Zeadally, Sherali
IT PROFESSIONAL, 2023, 25 (05) : 57 - 63
[3] Deep active learning for misinformation detection using geometric deep learning
Barnabo, Giorgio
Siciliano, Federico
Castillo, Carlos
Leonardi, Stefano
Nakov, Preslav
Da San Martino, Giovanni
Silvestri, Fabrizio
ONLINE SOCIAL NETWORKS AND MEDIA, 2023, 33
[4] Robust Encrypted Inference in Deep Learning: A Pathway to Secure Misinformation Detection
Ali, Hassan
Javed, Rana Tallal
Qayyum, Adnan
Alghadhban, Amer
Alazmi, Meshari
Alzamil, Ahmad
Al-Utaibi, Khalid
Qadir, Junaid
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2025, 22 (02) : 1627 - 1639
[5] FakeBERT: Fake news detection in social media with a BERT-based deep learning approach
Rohit Kumar Kaliyar
Anurag Goswami
Pratik Narang
Multimedia Tools and Applications, 2021, 80 : 11765 - 11788
[6] FakeBERT: Fake news detection in social media with a BERT-based deep learning approach
Kaliyar, Rohit Kumar
Goswami, Anurag
Narang, Pratik
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 11765 - 11788
[7] An Attack Detection Framework Based on BERT and Deep Learning
Seyyar, Yunus Emre
Yavuz, Ali Gokhan
Unver, Halil Murat
IEEE ACCESS, 2022, 10 : 68633 - 68644
[8] Few-Shot Learning for Misinformation Detection Based on Contrastive Models
Zheng, Peng
Chen, Hao
Hu, Shu
Zhu, Bin
Hu, Jinrong
Lin, Ching-Sheng
Wu, Xi
Lyu, Siwei
Huang, Guo
Wang, Xin
ELECTRONICS, 2024, 13 (04)
[9] Transformer-Based Language Models for Software Vulnerability Detection
Thapa, Chandra
Jang, Seung Ick
Ahmed, Muhammad Ejaz
Camtepe, Seyit
Pieprzyk, Josef
Nepal, Surya
PROCEEDINGS OF THE 38TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2022, 2022, : 481 - 496
[10] Email spam detection by deep learning models using novel feature selection technique and BERT
Nasreen, Ghazala
Khan, Muhammad Murad
Younus, Muhammad
Zafar, Bushra
Hanif, Muhammad Kashif
EGYPTIAN INFORMATICS JOURNAL, 2024, 26

← 1 2 3 4 5 →