Building and benchmarking the motivated deception corpus: Improving the quality of deceptive text through gaming

被引:1
|
作者
Barsever, Dan [1 ]
Steyvers, Mark [1 ]
Neftci, Emre [1 ]
机构
[1] Univ Calif Irvine, Dept Cognit Sci, Irvine, CA 92697 USA
基金
美国国家科学基金会;
关键词
Deception; Text; Machine learning; Neural networks; Corpus; BERT; Natural language processing; Truth; Lie;
D O I
10.3758/s13428-022-02028-7
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
When one studies fake news or false reviews, the first step to take is to find a corpus of text samples to work with. However, most deceptive corpora suffer from an intrinsic problem: there is little incentive for the providers of the deception to put their best effort, which risks lowering the quality and realism of the deception. The corpus described in this project, the Motivated Deception Corpus, aims to rectify this problem by gamifying the process of deceptive text collection. By having subjects play the game Two Truths and a Lie, and by rewarding those subjects that successfully fool their peers, we collect samples in such a way that the process itself improves the quality of the text. We have amassed a large corpus of deceptive text that is strongly incentivized to be convincing, and thus more reflective of real deceptive text. We provide results from several configurations of neural network prediction models to establish machine learning benchmarks on the data. This new corpus is demonstratively more challenging to classify with the current state of the art than previous corpora.
引用
收藏
页码:4478 / 4488
页数:11
相关论文
共 3 条
  • [1] Building and benchmarking the motivated deception corpus: Improving the quality of deceptive text through gaming
    Dan Barsever
    Mark Steyvers
    Emre Neftci
    Behavior Research Methods, 2023, 55 : 4478 - 4488
  • [2] Improving Text Emotion Detection Through Comprehensive Dataset Quality Analysis
    Langure, Alejandro de Leon
    Zareei, Mahdi
    IEEE ACCESS, 2024, 12 : 166512 - 166536
  • [3] Improving Radiology Report Generation Quality and Diversity through Reinforcement Learning and Text Augmentation
    Parres, Daniel
    Albiol, Alberto
    Paredes, Roberto
    BIOENGINEERING-BASEL, 2024, 11 (04):