Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks

被引:12
作者
Wu, Jiaying [1 ]
Guo, Jiafeng [2 ]
Hooi, Bryan [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Univ Chinese Acad Sci, Inst Comp Technol CAS, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 30TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2024 | 2024年
基金
新加坡国家研究基金会;
关键词
Fake News; Large Language Models; Adversarial Robustness;
D O I
10.1145/3637528.3671977
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is commonly perceived that fake news and real news exhibit distinct writing styles, such as the use of sensationalist versus objective language. However, we emphasize that style-related features can also be exploited for style-based attacks. Notably, the advent of powerful Large Language Models (LLMs) has empowered malicious actors to mimic the style of trustworthy news sources, doing so swiftly, cost-effectively, and at scale. Our analysis reveals that LLM-camouflaged fake news content significantly undermines the effectiveness of state-of-the-art text-based detectors (up to 38% decrease in F1 Score), implying a severe vulnerability to stylistic variations. To address this, we introduce SheepDog, a style-robust fake news detector that prioritizes content over style in determining news veracity. SheepDog achieves this resilience through (1) LLM-empowered news reframings that inject style diversity into the training process by customizing articles to match different styles; (2) a style-agnostic training scheme that ensures consistent veracity predictions across style-diverse reframings; and (3) content-focused veracity attributions that distill content-centric guidelines from LLMs for debunking fake news, offering supplementary cues and potential intepretability that assist veracity prediction. Extensive experiments on three real-world benchmarks demonstrate SheepDog's style robustness and adaptability to various backbones.(1)
引用
收藏
页码:3367 / 3378
页数:12
相关论文
共 69 条
[1]  
Ajao O, 2019, INT CONF ACOUST SPEE, P2507, DOI [10.1109/ICASSP.2019.8683170, 10.1109/icassp.2019.8683170]
[2]   Social Media and Fake News in the 2016 Election [J].
Allcott, Hunt ;
Gentzkow, Matthew .
JOURNAL OF ECONOMIC PERSPECTIVES, 2017, 31 (02) :211-235
[3]   Undeutsch hypothesis and Criteria Based Content Analysis: A meta-analytic review [J].
Amado, Barbara G. ;
Arce, Ramon ;
Farina, Francisca .
EUROPEAN JOURNAL OF PSYCHOLOGY APPLIED TO LEGAL CONTEXT, 2015, 7 (01) :3-12
[4]  
[Anonymous], NeurIPS
[5]  
Asai Akari, 2024, ICLR
[6]   Defining and Measuring News Media Quality: Comparing the Content Perspective and the Audience Perspective [J].
Bachmann, Philipp ;
Eisenegger, Mark ;
Ingenhoff, Diana .
INTERNATIONAL JOURNAL OF PRESS-POLITICS, 2022, 27 (01) :9-37
[7]  
Bowman Samuel R., 2015, EMNLP, P632
[8]  
Brown TB., 2020, Advances in neural information processing systems, V33, P1877, DOI [10.48550/arXiv.2005.14165, 10.48550/ARXIV.2005.14165, DOI 10.48550/ARXIV.2005.14165]
[9]  
Chen C, 2023, ARXIV
[10]  
Chen Chao, 2024, ICLR