SimG: A Semantic based Graph Similarity Search Engine

被引:1
作者
Yan, Haijiang [1 ]
Wang, Yuxiang [1 ]
Xu, Xiaoliang [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Hangzhou, Peoples R China
来源
2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD) | 2019年
基金
中国国家自然科学基金;
关键词
knowledge graph; RDF; semantic search engine;
D O I
10.1109/CBD.2019.00030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RDF knowledge graphs have received more attention in recent years. Many graph similarity search approaches are proposed to help users get the desired knowledge. However, few of them consider storage architecture and semantic similarity search together. We believe that a complete query engine requires not only an efficient search approach, but also a reliable storage architecture to support. In this paper, we design a semantic-based graph similarity search engine SimG with carefully designed architecture and data structures over an RDF Knowledge graph, which can quickly get the best k answers even given a simplified queries (e.g., Basic Graph Pattern (BGP)) by users. The outstanding features of SimG are as follows: (1) To improve RDF data management efficiency and reduce storage overhead, we first divide the RDF knowledge graph into multiple topic graphs based on the type similarity model. Then we store these topic graphs in a distributed manner(e.g., all topic-graphs are managed and maintained independently). (2) In order to manage topic graphs efficiently, we utilize the adjacency list as the fundamental data structure to store each topic graph. Moreover, we design a skip list based index to accelerate the data accessing. On the top of this topic graph storage, we implement several basic APIs such as access, add, delete and update to support the following semantic query approach. (3) The semantic similarity query module is deployed on the topic graph storage, which returns top-k answers by considering the semantic feature based on the APIs implemented above. Finally, extensive experiments on our query algorithm and storage architecture confirm the effectiveness and efficiency of SimG.
引用
收藏
页码:114 / 120
页数:7
相关论文
共 18 条
  • [1] SW-Store: a vertically partitioned DBMS for Semantic Web data management
    Abadi, Daniel J.
    Marcus, Adam
    Madden, Samuel R.
    Hollenbach, Kate
    [J]. VLDB JOURNAL, 2009, 18 (02) : 385 - 406
  • [2] Bollacker K.D., 2008, ACM C MAN DAT
  • [3] Bordes A, 2013, ADV NEURAL INFORM PR, V26
  • [4] SQuID: Semantic Similarity-Aware Query Intent Discovery
    Fariha, Anna
    Sarwar, Sheikh Muhammad
    Meliou, Alexandra
    [J]. SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 1745 - 1748
  • [5] Fletcher G., 2008, WORKSH LOG DAT ROM I, P1
  • [6] Ji GL, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P687
  • [7] Querying Web-Scale Information Networks Through Bounding Matching Scores
    Jin, Jiahui
    Khemmarat, Samamon
    Gao, Lixin
    Luo, Junzhou
    [J]. PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 527 - 537
  • [8] DBpedia - A large-scale, multilingual knowledge base extracted from Wikipedia
    Lehmann, Jens
    Isele, Robert
    Jakob, Max
    Jentzsch, Anja
    Kontokostas, Dimitris
    Mendes, Pablo N.
    Hellmann, Sebastian
    Morsey, Mohamed
    van Kleef, Patrick
    Auer, Soeren
    Bizer, Christian
    [J]. SEMANTIC WEB, 2015, 6 (02) : 167 - 195
  • [9] Lin YK, 2015, AAAI CONF ARTIF INTE, P2181
  • [10] LIU P, 2019, CONCURR COMP-PRACT E, DOI DOI 10.1002/APJ.2342