Rumour detection on benchmark twitter datasets using graph neural networks with data augmentation

被引:0
作者
Patel, Shaswat [1 ]
Bansal, Prince [1 ]
Kaur, Preeti [1 ]
机构
[1] Netaji Subhas Univ Technol, Dept Comp Engn, Azad Hind Fauj Marg, New Delhi 110078, India
关键词
Rumour detection; Oversampling; Data augmentation; Graph neural network; BERT;
D O I
10.1007/s13278-024-01328-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media has become a significant source of essential facts and alarming falsehoods, including rumours. A significant increase in rumour spreading has occurred due to the lack of an autonomous rumour detection mechanism, causing widespread and severe social repercussions. To address this challenge, we present a ground-breaking method for developing an automatic rumour detection system, focusing on the fundamental problem of class imbalance in rumour detection. Our method selectively uses oversampling to obtain a uniformly distributed dataset by leveraging contextualised data augmentation techniques to generate synthetic samples for underrepresented classes. Furthermore, we effectively recreate non-linear dialogues inside a thread using two novel graph neural networks (GNNs), which improves the system's capacity to understand complex links between postings. Our method employs a distinctive feature selection mechanism to enhance further Twitter representations based on the state-of-the-art BERTweet model. The thorough analysis of our methodology using three publicly accessible datasets yielded compelling results: (1) our GNN models outperformed the most state-of-the-art classifiers in F1-score by more than 20%. Emphasizing the importance of our approach to developing sophisticated rumour detection systems. (2) By utilizing our oversampling method, we significantly improve the F1-score by 9%, highlighting the practical implications of resolving class imbalance. (3) Our technique delivers further performance increases through non-random selection criteria for data augmentation, with the selection of relevant tweets highlighting the significance of our novel augmentation strategy. (4) Notably, our approach captures rumours in their early stages more effectively than previous classifiers, establishing a baseline for future works. The innovative aspects of our proposed method lie in its ability to solve class imbalance effectively, outperform existing classifiers in terms of performance, and drastically reduce the propagation of rumours and false information on social media platforms. Our study lays the way for developments in rumour detection by offering a comprehensive solution, eventually helping to ensure the veracity of information flowing online. We are confident that our findings have an influence on the broader field of rumour detection systems and provide fresh directions for further study.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Ethereum Phishing Scam Detection Based on Data Augmentation Method and Hybrid Graph Neural Network Model
    Chen, Zhen
    Liu, Sheng-Zheng
    Huang, Jia
    Xiu, Yu-Han
    Zhang, Hao
    Long, Hai-Xia
    SENSORS, 2024, 24 (12)
  • [42] Flipping Data Augmentation of Convolutional Neural Networks Using Discrete Cosine Transforms
    Ito, Izumi
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 1501 - 1505
  • [43] Balancing Augmentation With Edge Utility Filter for Signed Graph Neural Networks
    Chen, Ke-Jia
    Ji, Yaming
    Mu, Wenhui
    Qu, Youran
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 5903 - 5915
  • [44] Data Augmentation for Drum Transcription with Convolutional Neural Networks
    Jacques, Celine
    Roebel, Axel
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [45] Further Advantages of Data Augmentation on Convolutional Neural Networks
    Hernandez-Garcia, Alex
    Koenig, Peter
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 95 - 103
  • [46] Neuromorphic Data Augmentation for Training Spiking Neural Networks
    Li, Yuhang
    Kim, Youngeun
    Park, Hyoungseob
    Geller, Tamar
    Panda, Priyadarshini
    COMPUTER VISION, ECCV 2022, PT VII, 2022, 13667 : 631 - 649
  • [47] Counterfactual Data Augmentation With Denoising Diffusion for Graph Anomaly Detection
    Xiao, Chunjing
    Pang, Shikang
    Xu, Xovee
    Li, Xuan
    Trajcevski, Goce
    Zhou, Fan
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, : 7555 - 7567
  • [48] Virtual Adversarial Training and Data Augmentation for Acoustic Event Detection with Gated Recurrent Neural Networks
    Zoehrer, Matthias
    Pernkopf, Franz
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 493 - 497
  • [49] Using Graph Neural Networks to Improve Generalization Capability of the Models for Deepfake Detection
    She, Huimin
    Hu, Yongjian
    Liu, Beibei
    Li, Jicheng
    Li, Chang-Tsun
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 8414 - 8427
  • [50] Graph Neural Networks Based Memory Inefficiency Detection Using Selective Sampling
    Li, Pengcheng
    Guo, Yixin
    Luo, Yingwei
    Wang, Xiaolin
    Wang, Zhenlin
    Liu, Xu
    SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2022,