tieval: An Evaluation Framework for Temporal Information Extraction Systems

被引:0
作者
Sousa, Hugo [1 ]
Campos, Ricardo [2 ,3 ]
Jorge, Alipio [1 ]
机构
[1] Univ Porto, INESC TEC, Porto, Portugal
[2] INESC TEC, Porto, Portugal
[3] Polytech Inst Tomar, Ci2 Smart Cities Res Ctr, Tomar, Portugal
来源
PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023 | 2023年
关键词
temporal information extraction; evaluation; !text type='python']python[!/text] package;
D O I
10.1145/3539618.3591892
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Temporal information extraction (TIE) has attracted a great deal of interest over the last two decades. Such endeavors have led to the development of a significant number of datasets. Despite its benefits, having access to a large volume of corpora makes it difficult to benchmark TIE systems. On the one hand, different datasets have different annotation schemes, which hinders the comparison between competitors across different corpora. On the other hand, the fact that each corpus is disseminated in a different format requires a considerable engineering effort for a researcher/practitioner to develop parsers for all of them. These constraints force researchers to select a limited amount of datasets to evaluate their systems which consequently limits the comparability of the systems. Yet another obstacle to the comparability of TIE systems is the evaluation metric employed. While most research works adopt traditional metrics such as precision, recall, and..1, a few others prefer temporal awareness - a metric tailored to be more comprehensive on the evaluation of temporal systems. Although the reason for the absence of temporal awareness in the evaluation of most systems is not clear, one of the factors that certainly weighs on this decision is the need to implement the temporal closure algorithm, which is neither straightforward to implement nor easily available. All in all, these problems have limited the fair comparison between approaches and consequently, the development of TIE systems. To mitigate these problems, we have developed tieval, a Python library that provides a concise interface for importing different corpora and is equipped with domain-specific operations that facilitate system evaluation. In this paper, we present the first public release of tieval and highlight its most relevant features. The library is available as open source, under MIT License, at PyPI1 and GitHub(2).
引用
收藏
页码:2871 / 2879
页数:9
相关论文
共 43 条
[1]   MAINTAINING KNOWLEDGE ABOUT TEMPORAL INTERVALS [J].
ALLEN, JF .
COMMUNICATIONS OF THE ACM, 1983, 26 (11) :832-843
[2]  
[Anonymous], 2011, P 49 ANN M ASS COMP
[3]  
[Anonymous], 2011, REV SOC ESPANOLA PRO
[4]  
[Anonymous], 1993, ACM SIGART Bulletin
[5]  
[Anonymous], 2010, P 2010 C EMPIRICAL M
[6]  
[Anonymous], 2013, P 2 JOINT C LEX COMP
[7]  
Bittar Andre, 2011, P 49 ANN M ASS COMP, P130
[8]  
Bracchi Alice, 2016, P 3 IT C COMP LING, P83, DOI [10.4000/books.aaccademia.1732, DOI 10.4000/BOOKS.AACCADEMIA.1732]
[9]  
Campos R., 2014, ACM Computing Surveys (CSUR), V47, P1, DOI DOI 10.1145/2619088
[10]   Identifying top relevant dates for implicit time sensitive queries [J].
Campos, Ricardo ;
Dias, Gael ;
Jorge, Alipio Mario ;
Nunes, Celia .
INFORMATION RETRIEVAL JOURNAL, 2017, 20 (04) :363-398