Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks

被引：12

作者：

Wu, Jiaying ^{[1
]}

Guo, Jiafeng ^{[2
]}

Hooi, Bryan ^{[1
]}

机构：

[1] Natl Univ Singapore, Singapore, Singapore

[2] Univ Chinese Acad Sci, Inst Comp Technol CAS, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 30TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2024 | 2024年

基金：

新加坡国家研究基金会;

关键词：

Fake News; Large Language Models; Adversarial Robustness;

D O I：

10.1145/3637528.3671977

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

It is commonly perceived that fake news and real news exhibit distinct writing styles, such as the use of sensationalist versus objective language. However, we emphasize that style-related features can also be exploited for style-based attacks. Notably, the advent of powerful Large Language Models (LLMs) has empowered malicious actors to mimic the style of trustworthy news sources, doing so swiftly, cost-effectively, and at scale. Our analysis reveals that LLM-camouflaged fake news content significantly undermines the effectiveness of state-of-the-art text-based detectors (up to 38% decrease in F1 Score), implying a severe vulnerability to stylistic variations. To address this, we introduce SheepDog, a style-robust fake news detector that prioritizes content over style in determining news veracity. SheepDog achieves this resilience through (1) LLM-empowered news reframings that inject style diversity into the training process by customizing articles to match different styles; (2) a style-agnostic training scheme that ensures consistent veracity predictions across style-diverse reframings; and (3) content-focused veracity attributions that distill content-centric guidelines from LLMs for debunking fake news, offering supplementary cues and potential intepretability that assist veracity prediction. Extensive experiments on three real-world benchmarks demonstrate SheepDog's style robustness and adaptability to various backbones.(1)

引用

页码：3367 / 3378

页数：12

共 69 条

[1]

Ajao O, 2019, INT CONF ACOUST SPEE, P2507, DOI [10.1109/ICASSP.2019.8683170, 10.1109/icassp.2019.8683170]

[2] Social Media and Fake News in the 2016 Election [J].

Allcott, Hunt ;

Gentzkow, Matthew .

JOURNAL OF ECONOMIC PERSPECTIVES, 2017, 31 (02) :211-235

[3] Undeutsch hypothesis and Criteria Based Content Analysis: A meta-analytic review [J].

Amado, Barbara G. ;

Arce, Ramon ;

Farina, Francisca .

EUROPEAN JOURNAL OF PSYCHOLOGY APPLIED TO LEGAL CONTEXT, 2015, 7 (01) :3-12

[4]

[Anonymous], NeurIPS

[5]

Asai Akari, 2024, ICLR

[6] Defining and Measuring News Media Quality: Comparing the Content Perspective and the Audience Perspective [J].

Bachmann, Philipp ;

Eisenegger, Mark ;

Ingenhoff, Diana .

INTERNATIONAL JOURNAL OF PRESS-POLITICS, 2022, 27 (01) :9-37

[7]

Bowman Samuel R., 2015, EMNLP, P632

[8]

Brown TB., 2020, Advances in neural information processing systems, V33, P1877, DOI [10.48550/arXiv.2005.14165, 10.48550/ARXIV.2005.14165, DOI 10.48550/ARXIV.2005.14165]

[9]

Chen C, 2023, ARXIV

[10]

Chen Chao, 2024, ICLR

← 1 2 3 4 5 6 7 →