Fast Top-K Path-Based Relevance Query on Massive Graphs

被引:9
|
作者
Khemmarat, Samamon [1 ]
Gao, Lixin [1 ]
机构
[1] Univ Massachusetts, Dept Elect & Comp Engn, Amherst, MA 01003 USA
基金
美国国家科学基金会;
关键词
Top-k query processing; graph algorithms; path-based relevance metrics; PAGERANK; FRAMEWORK;
D O I
10.1109/TKDE.2015.2509973
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Obtaining the items highly-relevant to a given set of query items is a key task for various applications, such as recommendation and relationship prediction. A family of path-based relevance metrics, which quantify item relevance based on the paths in an item graph, have been shown to be effective in capturing the relevance in many applications. Despite their effectiveness, path-based relevance normally requires time-consuming iterative computation. We propose an approach to obtain the top-k most relevant items for a given query item set quickly. Our approach uses novel score bounds to detect the emergence of the top-k items during the computation. The approach is designed for a distributed environment, which makes it scale for massive graphs having billions of nodes. Our experimental results show that the proposed approach can provide the results up to two order of magnitudes faster than previously proposed approaches and can scale well with both the size of input and the number of machines used in the computation.
引用
收藏
页码:1189 / 1202
页数:14
相关论文
共 50 条
  • [21] Top-k Subgraph Query Based on Frequent Structure in Large-Scale Dynamic Graphs
    Shan, Xiaohuan
    Wang, Guangxiang
    Ding, Linlin
    Song, Baoyan
    Xu, Yan
    IEEE ACCESS, 2018, 6 : 78471 - 78482
  • [22] TKAP: Efficiently processing top-k query on massive data by adaptive pruning
    Han, Xixian
    Liu, Xianmin
    Li, Jianzhong
    Gao, Hong
    KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 47 (02) : 301 - 328
  • [23] TKAP: Efficiently processing top-k query on massive data by adaptive pruning
    Xixian Han
    Xianmin Liu
    Jianzhong Li
    Hong Gao
    Knowledge and Information Systems, 2016, 47 : 301 - 328
  • [24] Answering top-K query combined keywords and structural queries on RDF graphs
    Peng, Peng
    Zou, Lei
    Qin, Zheng
    INFORMATION SYSTEMS, 2017, 67 : 19 - 35
  • [25] Top-K Possible Shortest Path Query over a Large Uncertain Graph
    Zou, Lei
    Peng, Peng
    Zhao, Dongyan
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2011, 2011, 6997 : 72 - 86
  • [26] Fast Core-based Top-k Frequent Pattern Discovery in Knowledge Graphs
    Zeng, Jian
    Hou, Leong U.
    Yan, Xiao
    Han, Mingji
    Tang, Bo
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 936 - 947
  • [27] An Efficient Top-k Query Scheme Based on Multilayer Grouping
    Cui, Zongmin
    Gao, Yu
    Zhou, Caixue
    Gao, Guangyong
    Mei, Zhuolin
    Wu, Zongda
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (05): : 1339 - 1345
  • [28] A Top-k Query Algorithm for Big Data Based on MapReduce
    Lin, Xueyan
    PROCEEDINGS OF 2015 6TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE, 2015, : 982 - 985
  • [29] A sampling-based estimator for top-k selection query
    Chen, CM
    Ling, YB
    18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 617 - 627
  • [30] Top-k medical images query based on association graph
    Li, Pengyuan
    Pan, Haiwei
    Li, Qing
    Han, Qilong
    Xie, Xiaoqin
    Zhang, Zhiqiang
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (09): : 2033 - 2045