Completing Scientific Facts in Knowledge Graphs of Research Concepts

被引:13
作者
Borrego, Agustin [1 ]
Dessi, Danilo [2 ]
Hernandez, Inma [1 ]
Osborne, Francesco [3 ,4 ]
Recupero, Diego Reforgiato [2 ]
Ruiz, David [1 ]
Buscaldi, Davide [5 ]
Motta, Enrico [3 ]
机构
[1] Univ Seville, Dept Comp Languages & Syst, Seville 41004, Spain
[2] Univ Cagliari, Dept Math & Comp Sci, I-09124 Cagliari, Italy
[3] Open Univ, Knowledge Media Inst, Milton Keynes MK7 6AA, Bucks, England
[4] Univ Milano Bicocca, Dept Business & Law, I-20126 Milan, Italy
[5] Sorbonne Paris North Univ, LIPN, CNRS, UMR 7030, F-93430 Villetaneuse, France
关键词
Machine learning; Feature extraction; Semantic Web; Task analysis; Context modeling; Computational modeling; Benchmark testing; Knowledge based systems; Knowledge graphs; science of science; knowledge graph completion; triple classification; machine learning; semantic web;
D O I
10.1109/ACCESS.2022.3220241
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the last few years, we have witnessed the emergence of several knowledge graphs that explicitly describe research knowledge with the aim of enabling intelligent systems for supporting and accelerating the scientific process. These resources typically characterize a set of entities in this space (e.g., tasks, methods, evaluation techniques, proteins, chemicals), their relations, and the relevant actors (e.g., researchers, organizations) and documents (e.g., articles, books). However, they are usually very partial representations of the actual research knowledge and may miss several relevant facts. In this paper, we introduce SciCheck, a new triple classification approach for completing scientific statements in knowledge graphs. SciCheck was evaluated against other state-of-the-art approaches on seven benchmarks, yielding excellent results. Finally, we provide a real-world use case and applied SciCheck to the Artificial Intelligence Knowledge Graph (AI-KG), a large-scale automatically-generated open knowledge graph including 1.2M statements extracted from the 333K most cited articles in the field of Artificial Intelligence, and generated a new version of this knowledge graph with 300K additional triples.
引用
收藏
页码:125867 / 125880
页数:14
相关论文
共 74 条
[1]   Friends and neighbors on the Web [J].
Adamic, LA ;
Adar, E .
SOCIAL NETWORKS, 2003, 25 (03) :211-230
[2]  
Alahakoon T., 2011, P 4 WORKSH SOC NETW, P1, DOI DOI 10.1145/1989656.1989657
[3]  
Angeli G, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P344
[4]   A neural network for semantic labelling of structured information [J].
Ayala, Daniel ;
Borrego, Agustin ;
Hernandez, Inma ;
Ruiz, David .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 143
[5]   AYNEC: All You Need for Evaluating Completion Techniques in Knowledge Graphs [J].
Ayala, Daniel ;
Borrego, Agustin ;
Hernandez, Inma ;
Rivero, Carlos R. ;
Ruiz, David .
SEMANTIC WEB, ESWC 2019, 2019, 11503 :397-411
[6]   The Impact of Negative Triple Generation Strategies and Anomalies on Knowledge Graph Completion [J].
Bansal, Iti ;
Tiwari, Sudhanshu ;
Rivero, Carlos R. .
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, :45-54
[7]   Bio2RDF: Towards a mashup to build bioinformatics knowledge systems [J].
Belleau, Francois ;
Nolin, Marc-Alexandre ;
Tourigny, Nicole ;
Rigault, Philippe ;
Morissette, Jean .
JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (05) :706-716
[8]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[9]  
Bollacker K., 2008, SIGMOD C, P1247, DOI [10.1145/1376616.1376746, DOI 10.1145/1376616.1376746]
[10]  
Bordes A., 2013, P C ADV NEUR INF PRO