An adaptive path index for XML data using the query workload

被引:6
作者
Min, JK
Chung, CW
Shim, K
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn & Comp Sci, Div Comp Sci, Taejon 305701, South Korea
[2] Seoul Natl Univ, Sch Elect Engn & Comp Sci, Seoul 151742, South Korea
关键词
XML; semistructured data; path index; query processing;
D O I
10.1016/j.is.2004.04.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to its flexibility, XML is becoming the de facto standard for exchanging and querying documents over the Web. Many XML query languages such as XQuery and XPath use label paths to traverse the irregularly structured XML data. Without a structural Summary and efficient indexes, query processing can be quite inefficient due to an exhaustive traversal on XML data. To overcome the inefficiency, several path indexes have been proposed in the research community. Traditional indexes generally record all label paths from the root element in XML data and are constructed with the use of data only. Such path indexes may result in performance degradation due to large sizes and exhaustive navigations for partial matching path queries which start with the self-or-descendent axis("//"). To improve the query performance, we propose an adaptive path index for XML data (termed APEX). APEX does not keep all paths starting from the root and utilizes frequently used paths on query workloads. APEX also has a nice property that it can be updated incrementally according to the changes of query workloads. Experimental results with synthetic and real-life data sets clearly confirm that APEX improves the query processing cost typically 2-69 times compared with the traditional indexes, with the performance gap increasing with the irregularity of XML data. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:467 / 487
页数:21
相关论文
共 50 条
  • [31] A partition index for XML and semi-structured data
    Kim, J
    Kim, HJ
    DATA & KNOWLEDGE ENGINEERING, 2004, 51 (03) : 349 - 368
  • [32] XML query processing using materialized views
    Kim, S
    Kang, H
    IC'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, VOLS I AND II, 2001, : 111 - 117
  • [33] Data Mining for XML Query-Answering Support
    Mazuran, Mirjana
    Quintarelli, Elisa
    Tanca, Letizia
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (08) : 1393 - 1407
  • [34] Keywords Query of uncertain spatiotemporal data based on XML
    Xu, Changming
    Zhu, Lin
    Bai, Luyi
    He, Juan
    EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 241 - 257
  • [35] Keywords Query of uncertain spatiotemporal data based on XML
    Changming Xu
    Lin Zhu
    Luyi Bai
    Juan He
    Earth Science Informatics, 2023, 16 : 241 - 257
  • [36] Efficient query processing for XML keyword queries based on the IDList index
    Zhou, Junfeng
    Bao, Zhifeng
    Wang, Wei
    Zhao, Jinjia
    Meng, Xiaofeng
    VLDB JOURNAL, 2014, 23 (01) : 25 - 50
  • [37] Efficient query processing for XML keyword queries based on the IDList index
    Junfeng Zhou
    Zhifeng Bao
    Wei Wang
    Jinjia Zhao
    Xiaofeng Meng
    The VLDB Journal, 2014, 23 : 25 - 50
  • [38] Processing XML path expressions using XML materialised views
    Moon, CH
    Kim, SH
    Kang, HC
    NEW HORIZONS IN INFORMATION MANAGEMENT, 2003, 2712 : 19 - 37
  • [39] Consistent query answers from virtually integrated XML data
    Tan, Zijing
    Liu, Chengfei
    Wang, Wei
    Shi, Baile
    JOURNAL OF SYSTEMS AND SOFTWARE, 2010, 83 (12) : 2566 - 2578
  • [40] IXDIRQL: An interactive XML data and information retrieval query language
    Gancarski, ACRDSL
    Henriques, PMSR
    FROM INFORMATION TO KNOWLEDGE, 2003, : 316 - 323