Deep Learning-Based Knowledge Graph Generation for COVID-19

被引:18
作者
Kim, Taejin [1 ]
Yun, Yeoil [1 ]
Kim, Namgyu [1 ]
机构
[1] Kookmin Univ, Grad Sch Business IT, Seoul 02707, South Korea
基金
新加坡国家研究基金会;
关键词
deep learning; knowledge graph; text analytics; pre-trained language model; BERT; EXTRACTION; CHATBOT; ENTITY;
D O I
10.3390/su13042276
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Many attempts have been made to construct new domain-specific knowledge graphs using the existing knowledge base of various domains. However, traditional "dictionary-based" or "supervised" knowledge graph building methods rely on predefined human-annotated resources of entities and their relationships. The cost of creating human-annotated resources is high in terms of both time and effort. This means that relying on human-annotated resources will not allow rapid adaptability in describing new knowledge when domain-specific information is added or updated very frequently, such as with the recent coronavirus disease-19 (COVID-19) pandemic situation. Therefore, in this study, we propose an Open Information Extraction (OpenIE) system based on unsupervised learning without a pre-built dataset. The proposed method obtains knowledge from a vast amount of text documents about COVID-19 rather than a general knowledge base and add this to the existing knowledge graph. First, we constructed a COVID-19 entity dictionary, and then we scraped a large text dataset related to COVID-19. Next, we constructed a COVID-19 perspective language model by fine-tuning the bidirectional encoder representations from transformer (BERT) pre-trained language model. Finally, we defined a new COVID-19-specific knowledge base by extracting connecting words between COVID-19 entities using the BERT self-attention weight from COVID-19 sentences. Experimental results demonstrated that the proposed Co-BERT model outperforms the original BERT in terms of mask prediction accuracy and metric for evaluation of translation with explicit ordering (METEOR) score.
引用
收藏
页码:1 / 20
页数:19
相关论文
共 73 条
[1]  
Adhikari A., 2019, DocBERT: BERT for Document Classification
[2]   KBot: A Knowledge Graph Based ChatBot for Natural Language Understanding Over Linked Data [J].
Ait-Mlouk, Addi ;
Jiang, Lili .
IEEE ACCESS, 2020, 8 :149220-149230
[3]  
Angeli G, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P344
[4]  
Banko M, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2670
[5]  
Bao QM, 2020, PROCEEDINGS OF THE AUSTRALASIAN COMPUTER SCIENCE WEEK MULTICONFERENCE (ACSW 2020)
[6]  
Belfin RV, 2019, INT CONF ADVAN COMPU, P717, DOI [10.1109/ICACCS.2019.8728499, 10.1109/icaccs.2019.8728499]
[7]  
Brown TB, 2020, ADV NEUR IN, V33
[8]  
Chen H., 2019, P 18 INT SEM WEB C A
[9]   KnowEdu: A System to Construct Knowledge Graph for Education [J].
Chen, Penghe ;
Lu, Yu ;
Zheng, Vincent W. ;
Chen, Xiyang ;
Yang, Boda .
IEEE ACCESS, 2018, 6 :31553-31563
[10]  
Chen Yen-Chun, 2019, ARXIV191103829