Annotation and linguistic analysis of claim types for fact-checking

被引:0
|
作者
Deck, Oliver [1 ]
Huesuenbeyi, Z. Melce [1 ]
Uhling, Leonie [1 ]
Scheffler, Tatjana [1 ]
机构
[1] Ruhr Univ Bochum, Dept German Language & Literature, Bochum, Germany
来源
LINGUISTICS VANGUARD | 2025年
关键词
fact-checking; check-worthiness; claims; corpus; fake news;
D O I
10.1515/lingvan-2024-0067
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Among the news items circulating in social media, only some contain factual statements, and factual claims can be differentiated by their check-worthiness. We describe the check-worthiness annotation of a novel corpus of claims obtained from real-world submissions to a German fact-checking organization: the German Crowd Claims (GCC) corpus. We iteratively adapted existing annotation guidelines, introducing the novel category of incident/event and a third level of annotation for statements. Exploratory analysis of 35 linguistic surface-level features highlights sentence length as the strongest predictor of check-worthiness, but remains inconclusive for more specific annotation. We therefore investigated the performance of transformer-based models for check-worthiness detection on the GCC corpus, in which classification accuracy was increased by translating the dataset into English, augmenting the dataset by adding additional data from a related task, and enriching the semantics by including related ontology embeddings.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] ANALYSIS OF THE FACT-CHECKING INITIATIVES IN SPAIN
    Cardenas Rica, Maria Luisa
    REVISTA INCLUSIONES, 2019, 6 : 62 - 82
  • [2] Communicating Fact to Combat Fake: Analysis of Fact-Checking Websites
    Pal, Anjan
    Loke, Cliff
    2019 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS (ITCC 2019), 2019, : 66 - 73
  • [3] Fact-checking in China: normative and strategic transparency of Chinese journalists in fact-checking reports
    Zhang, Haiyue
    ASIAN JOURNAL OF COMMUNICATION, 2025, 35 (02) : 81 - 99
  • [4] Fact-checking in Iberoamerica. A sex/gender analysis
    Torres, Maria Francisca Montiel
    Rodriguez, Laura Teruel
    DOXA COMUNICACION, 2024, (38): : 119 - 148
  • [5] Fact-checking with explanations
    Groza, Adrian
    Katona, Aron
    2022 24TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC, 2022, : 150 - 157
  • [6] An emerging genre of contemporary fact-checking
    Junestrom, Amalia
    JOURNAL OF DOCUMENTATION, 2021, 77 (02) : 501 - 517
  • [7] Towards Fact-Checking through Crowdsourcing
    Pinto, Marcos Rodrigues
    de Lima, Yuri Oliveira
    Barbosa, Carlos Eduardo
    de Souza, Jano Moreira
    PROCEEDINGS OF THE 2019 IEEE 23RD INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2019, : 494 - 499
  • [8] Facilitating automated fact-checking: a machine learning based weighted ensemble technique for claim detection
    Rahman, Md. Rashadur
    Karim, Rezaul
    Arefin, Mohammad Shamsul
    Dhar, Pranab Kumar
    Hossain, Gahangir
    Shimamura, Tetsuya
    DISCOVER APPLIED SCIENCES, 2025, 7 (01)
  • [9] Check-worthy claim detection across topics for automated fact-checking
    Abumansour, Amani S.
    Zubiaga, Arkaitz
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [10] A Survey on Automated Fact-Checking
    Guo, Zhijiang
    Schlichtkrull, Michael
    Vlachos, Andreas
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 178 - 206