Entity-centric multi-domain transformer for improving generalization in fake news detection

被引:5
|
作者
Bazmi, Parisa [1 ]
Asadpour, Masoud [1 ]
Shakery, Azadeh [1 ,2 ]
Maazallahi, Abbas [1 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran
关键词
Cross-domain; Domain generalization; Entity abstraction; Fake news detection; Knowledge entities; Mixture; -of; -experts; Multi-domain;
D O I
10.1016/j.ipm.2024.103807
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fake news has become a significant concern in recent times, particularly during the COVID-19 pandemic, as spreading false information can pose significant public health risks. Although many models have been suggested to detect fake news, they are often limited in their ability to extend to new emerging domains since they are designed for a single domain. Previous studies on multidomain fake news detection have focused on developing models that can perform well on multiple domains, but they often lack the ability to generalize to new unseen domains, which limits their effectiveness. To overcome this limitation, in this paper, we propose the Entity-centric Multi-domain Transformer (EMT) model. EMT uses entities in the news as key components in learning domain-invariant and domain-specific news representations, which addresses the challenges of domain shift and incomplete domain labeling in multidomain fake news detection. It incorporates entity background information from external knowledge sources to enhance finegrained news domain representation. EMT consists of a Domain-Invariant (DI) encoder, a Domain-Specific (DS) encoder, and a Cross-Domain Transformer (CT) that facilitates investigation of domain relationships and knowledge interaction with input news, enabling effective generalization. We evaluate the EMT's performance in multi-domain fake news detection across three settings: supervised multi-domain, zero-shot setting on new unseen domain, and limited samples from new domain. EMT demonstrates greater stability than state-of-the-art models when dealing with domain changes and varying training data. Specifically, in the zero-shot setting on new unseen domains, EMT achieves a good F1 score of approximately 72 %. The results highlight the effectiveness of EMT's entity-centric approach and its potential for real-world applications, as it demonstrates the ability to adapt to various training settings and outperform existing models in handling limited label data and adapting to previously unseen domains.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Cross-Domain Failures of Fake News Detection
    Janicka, Maria
    Pszona, Maria
    Wawer, Aleksander
    COMPUTACION Y SISTEMAS, 2019, 23 (03): : 1089 - 1097
  • [22] Multi-Domain Transfer Component Analysis for Domain Generalization
    Grubinger, Thomas
    Birlutiu, Adriana
    Schoener, Holger
    Natschlaeger, Thomas
    Heskes, Tom
    NEURAL PROCESSING LETTERS, 2017, 46 (03) : 845 - 855
  • [23] Multi-Domain Transfer Component Analysis for Domain Generalization
    Thomas Grubinger
    Adriana Birlutiu
    Holger Schöner
    Thomas Natschläger
    Tom Heskes
    Neural Processing Letters, 2017, 46 : 845 - 855
  • [24] Multi-modal Chinese Fake News Detection
    Huang, Wenxi
    Zhao, Zhangyi
    Chen, Xiaojun
    Li, Mark Junjie
    Zhang, Qin
    Fournier-Viger, Philippe
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 109 - 117
  • [25] Generalizing to the Future: Mitigating Entity Bias in Fake News Detection
    Zhu, Yongchun
    Sheng, Qiang
    Cao, Juan
    Li, Shuokai
    Wang, Danding
    Zhuang, Fuzhen
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2120 - 2125
  • [26] Entity-Oriented Multi-Modal Alignment and Fusion Network for Fake News Detection
    Li, Peiguang
    Sun, Xian
    Yu, Hongfeng
    Tian, Yu
    Yao, Fanglong
    Xu, Guangluan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3455 - 3468
  • [27] Improving Fake News Detection by Using an Entity-enhanced Framework to Fuse Diverse Multimodal Clues
    Qi, Peng
    Cao, Juan
    Li, Xirong
    Liu, Huan
    Sheng, Qiang
    Mi, Xiaoyue
    He, Qin
    Lv, Yongbiao
    Guo, Chenyang
    Yu, Yingchao
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1212 - 1220
  • [28] Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network
    Wang, Jinguang
    Qian, Shengsheng
    Hu, Jun
    Hong, Richang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 234 - 244
  • [29] Multi-Source Domain Adaptation with Weak Supervision for Early Fake News Detection
    Li, Yichuan
    Lee, Kyumin
    Kordzadeh, Nima
    Faber, Brenton
    Fiddes, Cameron
    Chen, Elaine
    Shu, Kai
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 668 - 676
  • [30] Improving fake news detection with domain-adversarial and graph-attention neural network
    Yuan, Hua
    Zheng, Jie
    Ye, Qiongwei
    Qian, Yu
    Zhang, Yan
    DECISION SUPPORT SYSTEMS, 2021, 151