Language-aware Indexing for Conjunctive Path Queries

被引:1
|
作者
Sasaki, Yuya [1 ,2 ]
Fletcher, George [3 ]
Makoto, Onizuka [1 ]
机构
[1] Osaka Univ, Suita, Osaka, Japan
[2] JST PRESTO, Kawaguchi, Saitama, Japan
[3] Eindhoven Univ Technol, Eindhoven, Netherlands
来源
2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022) | 2022年
关键词
Graph databases; index; bisimulation; SUBGRAPH ISOMORPHISM; XML;
D O I
10.1109/ICDE53745.2022.00054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conjunctive path queries (CPQ) are one of the most frequently used queries for complex graph analysis. However, current graph indexes are not tailored to fully support the power of query languages to express CPQs. Consequently, current methods do not take advantage of significant pruning opportunities during CPQ evaluation, resulting in poor query processing performance. We propose the CPQ-aware path index CPQx, the first path index tailored to the expressivity of CPQ. CPQx is built on the partition of the set of source-target vertex pairs of paths in a graph based on the structural notion of path-bisimulation. Path-bisimulation is an equivalence relation on paths such that each partition block induced by the relation consists of paths in the graph indistinguishable with respect to CPQs. This language-aware partitioning of the graph can significantly reduce the cost of query evaluation. We present methods to support the full index life cycle: index construction, maintenance, and query processing with our index. We also develop interest-aware CPQx to reduce index size and index construction overhead while accelerating query evaluation for queries of interest. We demonstrate through extensive experiments on 14 real graphs that our methods accelerate query processing by up to multiple orders of magnitude over the state-of-the-art methods, with smaller index sizes. Our complete C++ codebase is available as open source for further research.
引用
收藏
页码:661 / 673
页数:13
相关论文
共 50 条
  • [31] An approximation-aware algebra for XML full-text queries
    Buratti, Giacomo
    Montesi, Danilo
    ICSOFT 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL ISDM/WSEHST/DC, 2007, : 62 - +
  • [32] An Indexing Method of Continuous Spatiotemporal Queries for Stream Data Processing Rules of Detected Target Objects
    Rahman, Muhammad Habibur
    Hong, Bonghee
    Setiawan, Hari
    Lee, Sanghyun
    Lim, Dongjun
    Kim, Woochan
    SENSORS, 2021, 21 (23)
  • [33] Answering XML Queries Using Path-Based Indexes: A Survey
    Kam-Fai Wong
    Jeffrey Xu Yu
    Nan Tang
    World Wide Web, 2006, 9 : 277 - 299
  • [34] Answering XML queries using path-based indexes: A survey
    Wong, Kam-Fai
    Yu, Jeffrey Xu
    Tang, Nan
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2006, 9 (03): : 277 - 299
  • [35] Asymptotic Determinacy of Path Queries Using Union-of-Paths Views
    Francis, Nadime
    THEORY OF COMPUTING SYSTEMS, 2017, 61 (01) : 156 - 190
  • [36] Expressive Languages for Path Queries over Graph-Structured Data
    Barcelo, Pablo
    Libkin, Leonid
    Lin, Anthony W.
    Wood, Peter T.
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2012, 37 (04):
  • [37] Asymptotic Determinacy of Path Queries Using Union-of-Paths Views
    Nadime Francis
    Theory of Computing Systems, 2017, 61 : 156 - 190
  • [38] A new way to evaluate path-oriented queries in document databases
    Chen, Yangjun
    Shi, Yong
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 1, 2005, : 261 - 266
  • [39] Efficient Evaluation of Context-Free Path Queries for Graph Databases
    Medeiros, Ciro M.
    Musicante, Martin A.
    Costa, Umberto S.
    33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 1230 - 1237
  • [40] A Study of a Positive Fragment of Path Queries: Expressiveness, Normal Form and Minimization
    Wu, Yuqing
    Van Gucht, Dirk
    Gyssens, Marc
    Paredaens, Jan
    COMPUTER JOURNAL, 2011, 54 (07): : 1091 - 1118