Temporal knowledge extraction from large-scale text corpus

被引:9
作者
Liu, Yu [1 ]
Hua, Wen [1 ]
Zhou, Xiaofang [1 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2021年 / 24卷 / 01期
关键词
Temporal knowledge harvesting; Temporal patterns; Temporal facts; Knowledge base; BASE;
D O I
10.1007/s11280-020-00836-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Knowledge, in practice, is time-variant and many relations are only valid for a certain period of time. This phenomenon highlights the importance of harvesting temporal-aware knowledge, i.e., the relational facts coupled with their valid temporal interval. Inspired by pattern-based information extraction systems, we resort to temporal patterns to extract time-aware knowledge from free text. However, pattern design is extremely laborious and time consuming even for a single relation, and free text is usually ambiguous which makes temporal instance extraction extremely difficult. Therefore, in this work, we study the problem of temporal knowledge extraction with two steps: (1) temporal pattern extraction by automatically analysing a large-scale text corpus with a small number of seed temporal facts, (2) temporal instance extraction by applying the identified temporal patterns. For pattern extraction, we introduce various techniques, including corpus annotation, pattern generation, scoring and clustering, to improve both accuracy and coverage of the extracted patterns. For instance extraction, we propose a double-check strategy to improve the accuracy and a set of node-extension rules to improve the coverage. We conduct extensive experiments on real world datasets and compared with state-of-the-art systems. Experimental results verify the effectiveness of our proposed methods for temporal knowledge harvesting.
引用
收藏
页码:135 / 156
页数:22
相关论文
共 50 条
[41]   A Process for Extracting Knowledge Base for Chatbots from Text Corpora [J].
Krassmann, Aliane Loureiro ;
Flach, Joao Marcos ;
Cestari da Silva Grando, Anita Raquel ;
Rockenbach Tarouco, Liane Margarida ;
Bercht, Magda .
PROCEEDINGS OF 2019 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2019, :322-329
[42]   A comprehensive all-in-one CRISPR toolbox for large-scale screens in plants [J].
Cheng, Yanhao ;
Li, Gen ;
Qi, Aileen ;
Mandlik, Rushil ;
Pan, Changtian ;
Wang, Doris ;
Ge, Sophia ;
Qi, Yiping .
PLANT CELL, 2025, 37 (04)
[43]   The association between supply chain structure and transparency: A large-scale empirical study [J].
Gualandris, Jury ;
Longoni, Annachiara ;
Luzzini, Davide ;
Pagell, Mark .
JOURNAL OF OPERATIONS MANAGEMENT, 2021, 67 (07) :803-827
[44]   Enabling large-scale genome editing at repetitive elements by reducing DNA nicking [J].
Smith, Cory J. ;
Castanon, Oscar ;
Said, Khaled ;
Volf, Verena ;
Khoshakhlagh, Parastoo ;
Hornick, Amanda ;
Ferreira, Raphael ;
Wu, Chun-Ting ;
Guell, Marc ;
Garg, Shilpa ;
Ng, Alex H. M. ;
Myllykallio, Hannu ;
Church, George M. .
NUCLEIC ACIDS RESEARCH, 2020, 48 (09) :5183-5195
[45]   Health-2000: An integrated large-scale expert system for the hospital of the future [J].
Boyom, SF ;
Kwankam, SY ;
Asoh, DA ;
Asaah, C ;
Kengne, F .
METHODS OF INFORMATION IN MEDICINE, 1997, 36 (02) :92-94
[46]   Diagnosing large-scale stellar magnetic fields using PCA on spectropolarimetric data [J].
Lehmann, L. T. ;
Donati, J-F .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2022, 514 (02) :2333-2345
[47]   A large-scale forest landscape model incorporating multi-scale processes and utilizing forest inventory data [J].
Wang, Wen J. ;
He, Hong S. ;
Spetich, Martin A. ;
Shifley, Stephen R. ;
Thompson, Frank R., III ;
Larsen, David R. ;
Fraser, Jacob S. ;
Yang, Jian .
ECOSPHERE, 2013, 4 (09)
[48]   Canonicalization of Open Knowledge Bases with Side Information from the Source Text [J].
Lin, Xueling ;
Chen, Lei .
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, :950-961
[49]   MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition [J].
Guo, Yandong ;
Zhang, Lei ;
Hu, Yuxiao ;
He, Xiaodong ;
Gao, Jianfeng .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :87-102
[50]   Springtime coupled modes of regional wind in the Iberian Peninsula and large-scale variability patterns [J].
Martin, M. L. ;
Valero, F. ;
Morata, A. ;
Luna, M. Y. ;
Pascual, A. ;
Santos-Munoz, D. .
INTERNATIONAL JOURNAL OF CLIMATOLOGY, 2011, 31 (06) :880-895