Rumour detection on benchmark twitter datasets using graph neural networks with data augmentation

被引:0
作者
Patel, Shaswat [1 ]
Bansal, Prince [1 ]
Kaur, Preeti [1 ]
机构
[1] Netaji Subhas Univ Technol, Dept Comp Engn, Azad Hind Fauj Marg, New Delhi 110078, India
关键词
Rumour detection; Oversampling; Data augmentation; Graph neural network; BERT;
D O I
10.1007/s13278-024-01328-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media has become a significant source of essential facts and alarming falsehoods, including rumours. A significant increase in rumour spreading has occurred due to the lack of an autonomous rumour detection mechanism, causing widespread and severe social repercussions. To address this challenge, we present a ground-breaking method for developing an automatic rumour detection system, focusing on the fundamental problem of class imbalance in rumour detection. Our method selectively uses oversampling to obtain a uniformly distributed dataset by leveraging contextualised data augmentation techniques to generate synthetic samples for underrepresented classes. Furthermore, we effectively recreate non-linear dialogues inside a thread using two novel graph neural networks (GNNs), which improves the system's capacity to understand complex links between postings. Our method employs a distinctive feature selection mechanism to enhance further Twitter representations based on the state-of-the-art BERTweet model. The thorough analysis of our methodology using three publicly accessible datasets yielded compelling results: (1) our GNN models outperformed the most state-of-the-art classifiers in F1-score by more than 20%. Emphasizing the importance of our approach to developing sophisticated rumour detection systems. (2) By utilizing our oversampling method, we significantly improve the F1-score by 9%, highlighting the practical implications of resolving class imbalance. (3) Our technique delivers further performance increases through non-random selection criteria for data augmentation, with the selection of relevant tweets highlighting the significance of our novel augmentation strategy. (4) Notably, our approach captures rumours in their early stages more effectively than previous classifiers, establishing a baseline for future works. The innovative aspects of our proposed method lie in its ability to solve class imbalance effectively, outperform existing classifiers in terms of performance, and drastically reduce the propagation of rumours and false information on social media platforms. Our study lays the way for developments in rumour detection by offering a comprehensive solution, eventually helping to ensure the veracity of information flowing online. We are confident that our findings have an influence on the broader field of rumour detection systems and provide fresh directions for further study.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Detection of Fake Twitter Accounts with Multiple Classifier and Data Augmentation Technique
    Sevi, Mehmet
    Aydin, Ilhan
    2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,
  • [32] Generalising electrocardiogram detection and delineation: training convolutional neural networks with synthetic data augmentation
    Jimenez-Perez, Guillermo
    Acosta, Juan
    Alcaine, Alejandro
    Camara, Oscar
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2024, 11
  • [33] Product failure prediction with missing data using graph neural networks
    Kang, Seokho
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (12) : 7225 - 7234
  • [34] Multimodal Continuous Emotion Recognition with Data Augmentation Using Recurrent Neural Networks
    Huang, Jian
    Li, Ya
    Tao, Jianhua
    Lian, Zheng
    Niu, Mingyue
    Yang, Minghao
    PROCEEDINGS OF THE 2018 AUDIO/VISUAL EMOTION CHALLENGE AND WORKSHOP (AVEC'18), 2018, : 57 - 64
  • [35] Underwater Image Classification Using Deep Convolutional Neural Networks and Data Augmentation
    Xu, Yifeng
    Zhang, Yang
    Wang, Huigang
    Liu, Xing
    2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2017,
  • [36] A Data Augmentation Methodology to Improve Age Estimation using Convolutional Neural Networks
    Oliveira, Italo de Pontes
    Peixoto Medeiros, Joao Lucas
    de Sousa, Vinicius Fernandes
    Teixeira Junior, Adalberto Gomes
    Pereira, Eanes Torres
    Gomes, Herman Martins
    2016 29TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2016, : 88 - 95
  • [37] BN-GRAPH: Data Augmentation Based Bangla Text Sentiment Analysis in Deep Graph Neural Network
    Nitish Ranjan Bhowmik
    M. Rubaiyat Hossain Mondal
    Mohammad Arifuzzaman
    SN Computer Science, 6 (4)
  • [38] A novel data augmentation method to enhance deep neural networks for detection of atrial fibrillation
    Cao, Ping
    Li, Xinyi
    Mao, Kedong
    Lu, Fei
    Ning, Gangmin
    Fang, Luping
    Pan, Qing
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 56
  • [39] A dual graph neural networks model using sequence embedding as graph nodes for vulnerability detection
    Ling, Miaogui
    Tang, Mingwei
    Bian, Deng
    Lv, Shixuan
    Tang, Qi
    INFORMATION AND SOFTWARE TECHNOLOGY, 2025, 177
  • [40] Fatigue Damage Diagnostics of Composites Using Data Fusion and Data Augmentation With Deep Neural Networks
    Dabetwar, Shweta
    Ekwaro-Osire, Stephen
    Dias, Joao Paulo
    JOURNAL OF NONDESTRUCTIVE EVALUATION, DIAGNOSTICS AND PROGNOSTICS OF ENGINEERING SYSTEMS, 2022, 5 (02):