WikipEvent: Leveraging Wikipedia Edit History for Event Detection

被引:0
作者
Tran, Tuan [1 ]
Ceroni, Andrea [1 ]
Georgescu, Mihai [1 ]
Naini, Kaweh Djafari [1 ]
Fisichella, Marco [1 ]
机构
[1] L3S Res Ctr, Hannover, Germany
来源
WEB INFORMATION SYSTEMS ENGINEERING, PT II | 2014年 / 8787卷
关键词
Event Detection; Temporal Retrieval; Wikipedia; Clustering;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Much of existing work in information extraction assumes the static nature of relationships in fixed knowledge bases. However, in collaborative environments such as Wikipedia, information and structures are highly dynamic over time. In this work, we introduce a new method to extract complex event structures from Wikipedia. We propose a new model to represent events by engaging multiple entities, generalizable to an arbitrary language. The evolution of an event is captured effectively based on analyzing the user edits history in Wikipedia. Our work provides a foundation for a novel class of evolution-aware entity-based enrichment algorithms, and considerably increases the quality of entity accessibility and temporal retrieval for Wikipedia. We formalize this problem and introduce an efficient end-to-end platform as a solution. We conduct comprehensive experiments on a real dataset of 1.8 million Wikipedia articles to show the effectiveness of our proposed solution. Our results demonstrate that we are able to achieve a precision of 70% when evaluated using manually annotated data. Finally, we make a comparative analysis of our work with the well established Current Event Portal of Wikipedia and find that our system WikipEvent using Co-References method can be used in a complementary way to deliver new and more information about events.
引用
收藏
页码:90 / 108
页数:19
相关论文
共 25 条
[1]  
Allan J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P37, DOI 10.1145/290941.290954
[2]  
[Anonymous], ICWSM
[3]  
[Anonymous], WSDM
[4]  
[Anonymous], ARTIF INT J
[5]  
[Anonymous], SIGIR
[6]  
[Anonymous], WWW
[7]  
[Anonymous], ACM HYPERTEXT
[8]  
[Anonymous], SIGIR WORKSH TIM AW
[9]  
[Anonymous], 2011, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Systems Demonstrations
[10]  
[Anonymous], 2011, AAAI