Shall I Work with Them? A Knowledge Graph-Based Approach for Predicting Future Research Collaborations

被引:10
作者
Kanakaris, Nikos [1 ]
Giarelis, Nikolaos [1 ]
Siachos, Ilias [1 ]
Karacapilidis, Nikos [1 ]
机构
[1] Univ Patras, Ind Management & Informat Syst Lab, MEAD, Rion 26504, Greece
关键词
knowledge graph; link prediction; natural language processing; document representation; future research collaborations; graph kernels; word embeddings; LINK PREDICTION; NEIGHBORS;
D O I
10.3390/e23060664
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We consider the prediction of future research collaborations as a link prediction problem applied on a scientific knowledge graph. To the best of our knowledge, this is the first work on the prediction of future research collaborations that combines structural and textual information of a scientific knowledge graph through a purposeful integration of graph algorithms and natural language processing techniques. Our work: (i) investigates whether the integration of unstructured textual data into a single knowledge graph affects the performance of a link prediction model, (ii) studies the effect of previously proposed graph kernels based approaches on the performance of an ML model, as far as the link prediction problem is concerned, and (iii) proposes a three-phase pipeline that enables the exploitation of structural and textual information, as well as of pre-trained word embeddings. We benchmark the proposed approach against classical link prediction algorithms using accuracy, recall, and precision as our performance metrics. Finally, we empirically test our approach through various feature combinations with respect to the link prediction problem. Our experimentations with the new COVID-19 Open Research Dataset demonstrate a significant improvement of the abovementioned performance metrics in the prediction of future research collaborations.
引用
收藏
页数:18
相关论文
共 56 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Friends and neighbors on the Web
    Adamic, LA
    Adar, E
    [J]. SOCIAL NETWORKS, 2003, 25 (03) : 211 - 230
  • [3] Statistical mechanics of complex networks
    Albert, R
    Barabási, AL
    [J]. REVIEWS OF MODERN PHYSICS, 2002, 74 (01) : 47 - 97
  • [4] Almeida, 2019, WORD EMBEDDINGS SURV
  • [5] Andreas J, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P822
  • [6] [Anonymous], 2008, WSDM
  • [7] [Anonymous], 2018, MACHINE LEARNING TEX
  • [8] [Anonymous], 2017, P 31 INT C NEUR INF
  • [9] [Anonymous], 2018, CSIROS DATA61 STELLA
  • [10] [Anonymous], 2013, Graph-based clustering and data visualization algorithms