Fast Top-K Path-Based Relevance Query on Massive Graphs

被引:9
|
作者
Khemmarat, Samamon [1 ]
Gao, Lixin [1 ]
机构
[1] Univ Massachusetts, Dept Elect & Comp Engn, Amherst, MA 01003 USA
基金
美国国家科学基金会;
关键词
Top-k query processing; graph algorithms; path-based relevance metrics; PAGERANK; FRAMEWORK;
D O I
10.1109/TKDE.2015.2509973
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Obtaining the items highly-relevant to a given set of query items is a key task for various applications, such as recommendation and relationship prediction. A family of path-based relevance metrics, which quantify item relevance based on the paths in an item graph, have been shown to be effective in capturing the relevance in many applications. Despite their effectiveness, path-based relevance normally requires time-consuming iterative computation. We propose an approach to obtain the top-k most relevant items for a given query item set quickly. Our approach uses novel score bounds to detect the emergence of the top-k items during the computation. The approach is designed for a distributed environment, which makes it scale for massive graphs having billions of nodes. Our experimental results show that the proposed approach can provide the results up to two order of magnitudes faster than previously proposed approaches and can scale well with both the size of input and the number of machines used in the computation.
引用
收藏
页码:1189 / 1202
页数:14
相关论文
共 50 条
  • [31] Efficient and Exact Local Search for Random Walk Based Top-K Proximity Query in Large Graphs
    Wu, Yubao
    Jin, Ruoming
    Zhang, Xiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (05) : 1160 - 1174
  • [32] A Candidate Filtering Mechanism for Fast Top-K Query Processing on Modern CPUs
    Dimopoulos, Constantinos
    Nepomnyachiy, Sergey
    Suel, Torsten
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 723 - 732
  • [33] Waves: a fast multi-tier top-k query processing algorithm
    Daoud, Caio Moura
    de Moura, Edleno Silva
    Fernandes, David
    da Silva, Altigran Soares
    Rossi, Cristian
    Carvalho, Andre
    INFORMATION RETRIEVAL JOURNAL, 2017, 20 (03): : 292 - 316
  • [34] Waves: a fast multi-tier top-k query processing algorithm
    Caio Moura Daoud
    Edleno Silva de Moura
    David Fernandes
    Altigran Soares da Silva
    Cristian Rossi
    Andre Carvalho
    Information Retrieval Journal, 2017, 20 : 292 - 316
  • [35] FastTopK: A Fast Top-K Trajectory Similarity Query Processing Algorithm for GPUs
    Mustafa, Hamza
    Leal, Eleazar
    Gruenwald, Le
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 542 - 547
  • [36] Efficient and Fast Distributed Top-k Query Protocol in Wireless Sensor Networks
    Tang, Shao-Jie
    Mao, Xufei
    Li, Xiang-Yang
    2011 19TH IEEE INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2011,
  • [37] A Distributed Index for Efficient Parallel Top-k Keyword Search on Massive Graphs
    Zhong, Ming
    Liu, Mengchi
    PROCEEDINGS OF THE TWELFTH INTERNATIONAL WORKSHOP ON WEB INFORMATION AND DATA MANAGEMENT, 2012, : 27 - 32
  • [38] Categorical top-k spatial influence query
    Yang, Jianye
    Zhang, Wenjie
    Zhang, Ying
    Wang, Xiaoyang
    Lin, Xuemin
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2017, 20 (02): : 175 - 203
  • [39] Top-k Query Processing with Conditional Skips
    Bortnikov, Edward
    Carmel, David
    Golan-Gueta, Guy
    WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 653 - 661
  • [40] Processing Spatial Keyword Query as a Top-k Aggregation Query
    Zhang, Dongxiang
    Chan, Chee-Yong
    Tan, Kian-Lee
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 355 - 364