Efficient keyword search on graph data for finding diverse and relevant answers

被引:3
作者
Park, Chang-Sup [1 ]
机构
[1] Dongduk Womens Univ, Dept Comp Sci, Seoul, South Korea
关键词
Keyword search; Graph data; Top-k query; Diverse results; REDUNDANCY;
D O I
10.1108/IJWIS-09-2022-0157
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
PurposeThis paper studies a keyword search over graph-structured data used in various fields such as semantic web, linked open data and social networks. This study aims to propose an efficient keyword search algorithm on graph data to find top-k answers that are most relevant to the query and have diverse content nodes for the input keywords. Design/methodology/approachBased on an aggregative measure of diversity of an answer set, this study proposes an approach to searching the top-k diverse answers to a query on graph data, which finds a set of most relevant answer trees whose average dissimilarity should be no lower than a given threshold. This study defines a diversity constraint that must be satisfied for a subset of answer trees to be included in the solution. Then, an enumeration algorithm and a heuristic search algorithm are proposed to find an optimal solution efficiently based on the diversity constraint and an A* heuristic. This study also provides strategies for improving the performance of the heuristic search method. FindingsThe results of experiments using a real data set demonstrate that the proposed search algorithm can find top-k diverse and relevant answers to a query on large-scale graph data efficiently and outperforms the previous methods. Originality/valueThis study proposes a new keyword search method for graph data that finds an optimal solution with diverse and relevant answers to the query. It can provide users with query results that satisfy their various information needs on large graph data.
引用
收藏
页码:19 / 41
页数:23
相关论文
共 28 条
[1]  
[Anonymous], 2008, P ACM SIGMOD08 VANC
[2]   Keyword searching and browsing in Databases using BANKS [J].
Bhalotia, G ;
Hulgeri, A ;
Nakhe, C ;
Chakrabarti, S ;
Sudarshan, S .
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, :431-440
[3]   Robust keyword search in large attributed graphs [J].
Bryson, Spencer ;
Davoudi, Heidar ;
Golab, Lukasz ;
Kargar, Mehdi ;
Lytvyn, Yuliya ;
Mierzejewski, Piotr ;
Szlichta, Jaroslaw ;
Zihayat, Morteza .
INFORMATION RETRIEVAL JOURNAL, 2020, 23 (05) :502-524
[4]   Spatial keyword search: a survey [J].
Chen, Lisi ;
Shang, Shuo ;
Yang, Chengcheng ;
Li, Jing .
GEOINFORMATICA, 2020, 24 (01) :85-106
[5]  
Daiv BB, 2008, PROC VLDB ENDOW, V1, P1189
[6]  
Ding BL, 2007, PROC INT CONF DATA, P811
[7]  
Golenberg K., 2008, SIGMOD Conference, P927, DOI DOI 10.1145/1376616.1376708
[8]  
He Hao, 2007, P 2007 ACM SIGMOD IN, P305, DOI DOI 10.1145/1247480.1247516
[9]  
Kacholia Varun., 2005, VLDB, P505
[10]   Efficient Duplication Free and Minimal Keyword Search in Graphs [J].
Kargar, Mehdi ;
An, Aijun ;
Yu, Xiaohui .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (07) :1657-1669