Event Detection in Wikipedia Edit History Improved by Documents Web Based Automatic Assessment

被引:4
|
作者
Fisichella, Marco [1 ]
Ceroni, Andrea [2 ]
机构
[1] Leibniz Univ Hannover, Res Ctr L3S, D-30167 Hannover, Germany
[2] Joblift, D-10437 Berlin, Germany
关键词
Wikipedia; user edits; event detection; event validation; temporal retrieval; clustering;
D O I
10.3390/bdcc5030034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A majority of current work in events extraction assumes the static nature of relationships in constant expertise knowledge bases. However, in collaborative environments, such as Wikipedia, information and systems are extraordinarily dynamic over time. In this work, we introduce a new approach for extracting complex structures of events from Wikipedia. We advocate a new model to represent events by engaging more than one entities that are generalizable to an arbitrary language. The evolution of an event is captured successfully primarily based on analyzing the user edits records in Wikipedia. Our work presents a basis for a singular class of evolution-aware entity-primarily based enrichment algorithms and will extensively increase the quality of entity accessibility and temporal retrieval for Wikipedia. We formalize this problem case and conduct comprehensive experiments on a real dataset of 1.8 million Wikipedia articles in order to show the effectiveness of our proposed answer. Furthermore, we suggest a new event validation automatic method relying on a supervised model to predict the presence of events in a non-annotated corpus. As the extra document source for event validation, we chose the Web due to its ease of accessibility and wide event coverage. Our outcomes display that we are capable of acquiring 70% precision evaluated on a manually annotated corpus. Ultimately, we conduct a comparison of our strategy versus the Current Event Portal of Wikipedia and discover that our proposed WikipEvent along with the usage of Co-References technique may be utilized to provide new and more data on events.
引用
收藏
页数:23
相关论文
共 20 条
  • [1] WikipEvent: Leveraging wikipedia edit history for event detection
    Tran, Tuan
    Ceroni, Andrea
    Georgescu, Mihai
    Naini, Kaweh Djafari
    Fisichella, Marco
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8787 : 90 - 108
  • [2] WikipEvent: Leveraging Wikipedia Edit History for Event Detection
    Tran, Tuan
    Ceroni, Andrea
    Georgescu, Mihai
    Naini, Kaweh Djafari
    Fisichella, Marco
    WEB INFORMATION SYSTEMS ENGINEERING, PT II, 2014, 8787 : 90 - 108
  • [3] WHAD: Wikipedia historical attributes dataHistorical structured data extraction and vandalism detection from the Wikipedia edit history
    Enrique Alfonseca
    Guillermo Garrido
    Jean-Yves Delort
    Anselmo Peñas
    Language Resources and Evaluation, 2013, 47 : 1163 - 1190
  • [4] WHAD: Wikipedia historical attributes data Historical structured data extraction and vandalism detection from the Wikipedia edit history
    Alfonseca, Enrique
    Garrido, Guillermo
    Delort, Jean-Yves
    Penas, Anselmo
    LANGUAGE RESOURCES AND EVALUATION, 2013, 47 (04) : 1163 - 1190
  • [5] SEDTWik: Segmentation-based Event Detection from Tweets using Wikipedia
    Morabia, Keval M.
    Murthy, Neti Lalita Bhanu
    Malapati, Aruna
    Samant, Surender S.
    NAACL HLT 2019: THE 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2019, : 77 - 85
  • [6] Automatic Quality Assessment of Content Created Collaboratively by Web Communities: A Case Study of Wikipedia
    Dalip, Daniel Hasan
    Goncalves, Marcos Andre
    Cristo, Marco
    Calado, Pavel
    JCDL 09: PROCEEDINGS OF THE 2009 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, 2009, : 295 - 304
  • [7] History-based Article Quality Assessment on Wikipedia
    Zhang, Shiyue
    Hu, Zheng
    Zhang, Chunhong
    Yu, Ke
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 1 - 8
  • [8] WIKITAG: WIKIPEDIA-BASED KNOWLEDGE EMBEDDINGS TOWARDS IMPROVED ACOUSTIC EVENT CLASSIFICATION
    Zhang, Qin
    Tang, Qingming
    Kao, Chieh-Chi
    Sun, Ming
    Liu, Yang
    Wang, Chao
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 136 - 140
  • [9] A deep learning-based quality assessment model of collaboratively edited documents: A case study of Wikipedia
    Wang, Ping
    Li, Xiaodan
    Wu, Renli
    JOURNAL OF INFORMATION SCIENCE, 2021, 47 (02) : 176 - 191
  • [10] WebKey: a graph-based method for event detection in web news
    Elham Rasouli
    Sajjad Zarifzadeh
    Amir Jahangard Rafsanjani
    Journal of Intelligent Information Systems, 2020, 54 : 585 - 604