Efficient Skyline Keyword-Based Tree Retrieval on Attributed Graphs

被引:0
|
作者
Wu, Dingming [1 ]
Zhang, Zhaofen [1 ]
Jensen, Christian S. [2 ]
Lu, Kezhong [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Aalborg Univ, Dept Comp Sci, DK-9220 Aalborg, Denmark
关键词
Filtering algorithms; Europe; Semantics; Indexes; Social networking (online); Keyword search; Query processing; Skyline query; keyword search; attributed graph; query processing; SEARCH; COMPUTATION; SKYGRAPH;
D O I
10.1109/TKDE.2024.3388988
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attributed graphs are graphs, where the vertices have attributes. Such graphs encompass, e.g., social network graph, citation graphs, and knowledge graphs, which have numerous real-world applications. Keyword-based search is a prominent and user-friendly way of querying attributed graphs. One widely used approach to keyword search adopts tree-based query semantics that relies on scoring functions that aggregate distances from a root to keyword-matched vertices. However, it is non-trivial to design scoring functions that capture different users' keyword preferences. This study defines and solves the skyline KTree retrieval problem that combines keyword querying with skyline functionality on attributed graphs. The result of a skyline KTree query is independent of scoring functions. Hence, no matter which keywords are preferred, users can always find their favorite KTrees in a result. To enable efficient skyline KTree retrieval, we propose algorithmFilterRefine that first identifies candidate results and then uses them for search space pruning. Computing distances between keywords and vertices is expensive and dominates the computational cost ofFilterRefine. Inspired by subspace skyline query techniques, we convert the skyline KTree retrieval problem into a multi-dimensional subspace skyline problem and propose algorithm MultiDiSkylineOpt. This algorithm is able to reuse skylines in subspaces and uses bounds on all dimensions to accelerate distance computation. Experimental results on real datasets show that a baseline algorithm cannot report results within a 500 second cut-off time, while the proposed algorithms are able to compute results in reasonable time. In particular, MultiDiSkylineOpt is able to efficiently retrieve skyline KTrees on large graphs with millions of nodes and hundreds of millions of edges.
引用
收藏
页码:6056 / 6070
页数:15
相关论文
共 50 条
  • [1] Keyword-Based Betweenness Centrality Maximization in Attributed Graphs
    Wu, Xiao
    Wu, Yanping
    Wang, Xiaoyang
    Yang, Zhengyi
    Zhang, Wenjie
    Zhang, Ying
    DATABASES THEORY AND APPLICATIONS, ADC 2024, 2025, 15449 : 209 - 223
  • [2] Efficient Keyword-Based Searching Strategies for Linked Databases
    Srinivas, V. N. V. S. R.
    Sony Krishna, R.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2016, 16 (07): : 131 - 136
  • [3] Efficient Computation of Semantically Cohesive Subgraphs for Keyword-Based Knowledge Graph Exploration
    Shi, Yuxuan
    Cheng, Gong
    Trung-Kien Tran
    Kharlamov, Evgeny
    Shen, Yulin
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 1410 - 1421
  • [4] Search Text to Retrieve Graphs: A Scalable RDF Keyword-Based Search System
    Dosso, Dennis
    Silvello, Gianmaria
    IEEE ACCESS, 2020, 8 : 14089 - 14111
  • [5] Search Text to Retrieve Graphs: a Scalable RDF Keyword-Based Search System
    Dosso D.
    Silvello G.
    IEEE Access, 2020, 8 : 14089 - 14111
  • [6] Towards Keyword-Based Geo-Social Group Query Services
    Zhu, Huaijie
    Liu, Wei
    Yin, Jian
    Xu, Jianliang
    Lee, Wang-Chien
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (01) : 670 - 683
  • [7] Robust keyword search in large attributed graphs
    Spencer Bryson
    Heidar Davoudi
    Lukasz Golab
    Mehdi Kargar
    Yuliya Lytvyn
    Piotr Mierzejewski
    Jaroslaw Szlichta
    Morteza Zihayat
    Information Retrieval Journal, 2020, 23 : 502 - 524
  • [8] Robust keyword search in large attributed graphs
    Bryson, Spencer
    Davoudi, Heidar
    Golab, Lukasz
    Kargar, Mehdi
    Lytvyn, Yuliya
    Mierzejewski, Piotr
    Szlichta, Jaroslaw
    Zihayat, Morteza
    INFORMATION RETRIEVAL JOURNAL, 2020, 23 (05): : 502 - 524
  • [9] Keyword-Based Diverse Image Retrieval With Variational Multiple Instance Graph
    Zeng, Yawen
    Wang, Yiru
    Liao, Dongliang
    Li, Gongfu
    Huang, Weijie
    Xu, Jin
    Cao, Da
    Man, Hong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10528 - 10537
  • [10] Automatic Inclusion of Semantics over Keyword-Based Linked Data Retrieval
    Rahoman, Md-Mizanur
    Ichise, Ryutaro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (11): : 2852 - 2862