An adaptive path index for XML data using the query workload

被引:6
作者
Min, JK
Chung, CW
Shim, K
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn & Comp Sci, Div Comp Sci, Taejon 305701, South Korea
[2] Seoul Natl Univ, Sch Elect Engn & Comp Sci, Seoul 151742, South Korea
关键词
XML; semistructured data; path index; query processing;
D O I
10.1016/j.is.2004.04.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to its flexibility, XML is becoming the de facto standard for exchanging and querying documents over the Web. Many XML query languages such as XQuery and XPath use label paths to traverse the irregularly structured XML data. Without a structural Summary and efficient indexes, query processing can be quite inefficient due to an exhaustive traversal on XML data. To overcome the inefficiency, several path indexes have been proposed in the research community. Traditional indexes generally record all label paths from the root element in XML data and are constructed with the use of data only. Such path indexes may result in performance degradation due to large sizes and exhaustive navigations for partial matching path queries which start with the self-or-descendent axis("//"). To improve the query performance, we propose an adaptive path index for XML data (termed APEX). APEX does not keep all paths starting from the root and utilizes frequently used paths on query workloads. APEX also has a nice property that it can be updated incrementally according to the changes of query workloads. Experimental results with synthetic and real-life data sets clearly confirm that APEX improves the query processing cost typically 2-69 times compared with the traditional indexes, with the performance gap increasing with the irregularity of XML data. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:467 / 487
页数:21
相关论文
共 50 条
  • [41] Top-Down Keyword Query Processing on XML Data
    Zhou, Junfeng
    Zhao, Xingmin
    Wang, Wei
    Chen, Ziyang
    Yu, Jeffrey Xu
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 2225 - 2230
  • [42] Research on Heterogeneous Data Query and Sharing Mode Based on XML
    Sang Yaqun
    Mu Qi
    FIFTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2013), 2013, 8878
  • [43] A New Sequence-Based Approach for XML Data Query
    Li, Wen
    Yang, Jin
    Sun, Gaofeng
    Yue, Sen
    PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT INFORMATION PROCESSING, 2013, 256 : 661 - 670
  • [44] Design of Heterogeneous Data Source Query System Based on XML
    Zhu Rongrong
    Mu Qi
    Li Zhanli
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON TECHNOLOGY MANAGEMENT AND INNOVATION (TMI 2010), 2010, : 95 - 97
  • [45] Data storage practices and query processing in XML databases: A survey
    Haw, Su-Cheng
    Lee, Chien-Sing
    KNOWLEDGE-BASED SYSTEMS, 2011, 24 (08) : 1317 - 1340
  • [46] Query processing optimization in broadcasting XML data in mobile communications
    Shekarriz, Mohsen
    Babamir, Seyed Morteza
    Mirabi, Meghdad
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (06) : 5354 - 5380
  • [47] Query Optimization for Complex Path Queries on Data
    Wang, Hongzhi
    Li, Jianzhong
    Liu, Xianmin
    Luo, Jizhou
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2009, 5463 : 389 - 404
  • [48] Query processing optimization in broadcasting XML data in mobile communications
    Mohsen Shekarriz
    Seyed Morteza Babamir
    Meghdad Mirabi
    The Journal of Supercomputing, 2021, 77 : 5354 - 5380
  • [49] Efficient processing of XML path queries based on BI index
    Hu, Xiangyu
    Mo, Yunyin
    Zhang, Haiwei
    Yuan, Xiaojie
    ADVANCED RESEARCH ON MECHANICAL ENGINEERING, INDUSTRY AND MANUFACTURING ENGINEERING, PTS 1 AND 2, 2011, 63-64 : 119 - 123
  • [50] Efficient Relaxed XML Path Query Matching based on Extended Dewey Labeling Scheme
    Chen, Zhe
    SECOND INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN, VOL 1, PROCEEDINGS, 2009, : 531 - 534