共 50 条
Estimating Time Models for News Article Excerpts
被引:0
|作者:
Mishra, Arunav
[1
]
Berberich, Klaus
[1
,2
]
机构:
[1] Max Planck Inst Informat, Saarbrucken, Germany
[2] Htw Saar, Saarbrucken, Germany
关键词:
excerpt-time model;
temporal scoping;
distribution propagation;
temporal content analysis;
probabilistic models;
sparsity reduction;
D O I:
10.1145/2983323.2983802
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
It is often difficult to ground text to precise time intervals due to the inherent uncertainty arising from either missing or multiple expressions at year, month, and day time granularities. We address the problem of estimating an excerpt-time model capturing the temporal scope of a given news article excerpt as a probability distribution over chronons. For this, we propose a semi-supervised distribution propagation framework that leverages redundancy in the data to improve the quality of estimated time models. Our method generates an event graph with excerpts as nodes and models various inter-excerpt relations as edges. It then propagates empirical excerpt-time models estimated for temporally annotated excerpts, to those that are strongly related but miss annotations. In our experiments, we first generate a test query set by randomly sampling 100 Wikipedia events as queries. For each query, making use of a standard text retrieval model, we then obtain top-10 documents with an average of 150 excerpts. From these, each temporally annotated excerpt is considered as gold standard. The evaluation measures are first computed for each gold standard excerpt for a single query, by comparing the estimated model with our method to the empirical model from the original expressions. Final scores are reported by averaging over all the test queries. Experiments on the English Gi-gaword corpus show that our method estimates significantly better time models than several baselines taken from the literature.
引用
收藏
页码:781 / 790
页数:10
相关论文