MultiFC: A Real-World Multi-Domain Dataset for Evidence-Based Fact Checking of Claims

被引:0
作者
Augenstein, Isabelle [1 ]
Lioma, Christina [1 ]
Wang, Dongsheng [1 ]
Lima, Lucas Chaves [1 ]
Hansen, Casper [1 ]
Hansen, Christian [1 ]
Simonsen, Jakob Grue [1 ]
机构
[1] Univ Copenhagen, Dept Comp Sci, Copenhagen, Denmark
来源
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We contribute the largest publicly available dataset of naturally occurring factual claims for the purpose of automatic claim verification. It is collected from 26 fact checking websites in English, paired with textual sources and rich metadata, and labelled for veracity by human expert journalists. We present an in-depth analysis of the dataset, highlighting characteristics and challenges. Further, we present results for automatic veracity prediction, both with established baselines and with a novel method for joint ranking of evidence pages and predicting veracity that outperforms all baselines. Significant performance increases are achieved by encoding evidence, and by modelling metadata. Our best-performing model achieves a Macro F1 of 49.2%, showing that this is a challenging testbed for claim veracity prediction.
引用
收藏
页码:4685 / 4697
页数:13
相关论文
共 39 条
[1]  
Augenstein I., 2016, P 10 INT WORKSHOP SE, P389, DOI DOI 10.18653/V1/S16-1063
[2]  
Augenstein Isabelle., 2016, P 2016 C EMPIRICAL M, DOI DOI 10.18653/V1/D16-1084
[3]  
Augenstein Isabelle, 2018, NAACL HLT, P1896
[4]  
Bachenko J., 2008, INT C COMPUTATIONAL, P41, DOI DOI 10.3115/1599081.1599087
[5]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[6]  
Baly R., 2018, P 2018 C N AM CHAPTE, V2, P21, DOI DOI 10.18653/V1/N18-2004
[7]  
Barrn-Cedeo Alberto, 2018, CEUR WORKSHOP P, V2125
[8]  
Caruana R., 1993, P 10 INT C INT C MAC, P48, DOI [DOI 10.1016/B978-1-55860-307-3.50012-5, 10.1016/b978-1-55860-307-3.50012-5]
[9]  
Chen Sihao, 2019, P NAACL
[10]  
Ciampaglia Giovanni Luca, 2015, PLoS One, V10, DOI [DOI 10.1371/JOURNAL.PONE.0128193, 10.1371/journal.pone.0128193]