Building and benchmarking the motivated deception corpus: Improving the quality of deceptive text through gaming

被引：1

作者：

Barsever, Dan ^{[1
]}

Steyvers, Mark ^{[1
]}

Neftci, Emre ^{[1
]}

机构：

[1] Univ Calif Irvine, Dept Cognit Sci, Irvine, CA 92697 USA

来源：

BEHAVIOR RESEARCH METHODS | 2023年 / 55卷 / 08期

基金：

美国国家科学基金会;

关键词：

Deception; Text; Machine learning; Neural networks; Corpus; BERT; Natural language processing; Truth; Lie;

D O I：

10.3758/s13428-022-02028-7

中图分类号：

B841 [心理学研究方法];

学科分类号：

040201 ;

摘要：

When one studies fake news or false reviews, the first step to take is to find a corpus of text samples to work with. However, most deceptive corpora suffer from an intrinsic problem: there is little incentive for the providers of the deception to put their best effort, which risks lowering the quality and realism of the deception. The corpus described in this project, the Motivated Deception Corpus, aims to rectify this problem by gamifying the process of deceptive text collection. By having subjects play the game Two Truths and a Lie, and by rewarding those subjects that successfully fool their peers, we collect samples in such a way that the process itself improves the quality of the text. We have amassed a large corpus of deceptive text that is strongly incentivized to be convincing, and thus more reflective of real deceptive text. We provide results from several configurations of neural network prediction models to establish machine learning benchmarks on the data. This new corpus is demonstratively more challenging to classify with the current state of the art than previous corpora.

引用

页码：4478 / 4488

页数：11

共 3 条

[1] Building and benchmarking the motivated deception corpus: Improving the quality of deceptive text through gaming
Dan Barsever
Mark Steyvers
Emre Neftci
Behavior Research Methods, 2023, 55 : 4478 - 4488
[2] Improving Text Emotion Detection Through Comprehensive Dataset Quality Analysis
Langure, Alejandro de Leon
Zareei, Mahdi
IEEE ACCESS, 2024, 12 : 166512 - 166536
[3] Improving Radiology Report Generation Quality and Diversity through Reinforcement Learning and Text Augmentation
Parres, Daniel
Albiol, Alberto
Paredes, Roberto
BIOENGINEERING-BASEL, 2024, 11 (04):

← 1 →