Fact-checking: relevance assessment of references in the Polish political domain

被引:3
作者
Sawczyn, Albert [1 ]
Binkowski, Jakub [1 ]
Janiak, Denis [1 ]
Augustyniak, Lukasz [1 ]
Kajdanowicz, Tomasz [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Wybrzeze Stanislawa Wyspianskiego 27, PL-50370 Wroclaw, Poland
来源
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021) | 2021年 / 192卷
关键词
natural language processing; fact-checking; fake news; relevance assessment; information retrieval; deep learning; language modelling; transformers;
D O I
10.1016/j.procs.2021.08.132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The prevalence of fake news could be observed in circumstances of emotion-causing events, like elections or pandemics. In fear of the potential impact, many fact-checking organisations were established. However, fact-checking requires a large amount of human labor, and hence there is a strong demand for complete automation of this process. Nevertheless, this milestone has not been achieved yet, even for English. The problem grows for the less popular languages that suffer from a scarcity of available resources. To address this problem for the Polish language domain, we propose a solution for automating one of the fact-checking stages - relevance assessment, which is crucial when searching for evidence. Leveraging recent advancements in natural language processing, we have acquired relevant data and developed classifiers of evidence relevance with respect to claims in Polish. Our approach can assess the evidence relevance with a performance at a level of a 0.778 F1 -score. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:1285 / 1293
页数:9
相关论文
共 35 条
[1]   Evaluation of IR strategies for Polish [J].
Akasereh, Mitra ;
Malak, Piotr ;
Pawlowski, Adam .
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8686 :384-391
[2]  
Barbaresi A., 2020, P 15 C NAT LANG PROC, P267, DOI [10.5281/ zenodo.3459599, DOI 10.5281/ZENODO.3459599]
[3]  
Bender Emily, 2019, GRADIENT, V14, P34
[4]   Influence of fake news in Twitter during the 2016 US presidential election [J].
Bovet, Alexandre ;
Makse, Hernan A. .
NATURE COMMUNICATIONS, 2019, 10 (1)
[5]  
Boyd, 2020, SPACY IND STRENGTH N, DOI DOI 10.5281/ZENODO.1212303
[6]   Reading Wikipedia to Answer Open-Domain Questions [J].
Chen, Danqi ;
Fisch, Adam ;
Weston, Jason ;
Bordes, Antoine .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1870-1879
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]  
Graves L., 2016, REUTERS I DIGITAL NE
[9]   Exposure to untrustworthy websites in the 2016 US election [J].
Guess, Andrew M. ;
Nyhan, Brendan ;
Reifler, Jason .
NATURE HUMAN BEHAVIOUR, 2020, 4 (05) :472-480
[10]   A Deep Relevance Matching Model for Ad-hoc Retrieval [J].
Guo, Jiafeng ;
Fan, Yixing ;
Ai, Qingyao ;
Croft, W. Bruce .
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, :55-64