An adaptive path index for XML data using the query workload

被引:6
作者
Min, JK
Chung, CW
Shim, K
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn & Comp Sci, Div Comp Sci, Taejon 305701, South Korea
[2] Seoul Natl Univ, Sch Elect Engn & Comp Sci, Seoul 151742, South Korea
关键词
XML; semistructured data; path index; query processing;
D O I
10.1016/j.is.2004.04.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to its flexibility, XML is becoming the de facto standard for exchanging and querying documents over the Web. Many XML query languages such as XQuery and XPath use label paths to traverse the irregularly structured XML data. Without a structural Summary and efficient indexes, query processing can be quite inefficient due to an exhaustive traversal on XML data. To overcome the inefficiency, several path indexes have been proposed in the research community. Traditional indexes generally record all label paths from the root element in XML data and are constructed with the use of data only. Such path indexes may result in performance degradation due to large sizes and exhaustive navigations for partial matching path queries which start with the self-or-descendent axis("//"). To improve the query performance, we propose an adaptive path index for XML data (termed APEX). APEX does not keep all paths starting from the root and utilizes frequently used paths on query workloads. APEX also has a nice property that it can be updated incrementally according to the changes of query workloads. Experimental results with synthetic and real-life data sets clearly confirm that APEX improves the query processing cost typically 2-69 times compared with the traditional indexes, with the performance gap increasing with the irregularity of XML data. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:467 / 487
页数:21
相关论文
共 50 条
  • [21] XML data exchange: Consistency and query answering
    Arenas, Marcelo
    Libkin, Leonid
    JOURNAL OF THE ACM, 2008, 55 (02)
  • [22] XQBE: A Graphical Environment to Query XML Data
    Daniele Braga
    Alessandro Campi
    World Wide Web, 2005, 8 : 287 - 316
  • [23] Web/XML data management and query processing
    Zhou, AY
    Zheng, SH
    Qian, WN
    WORLD WIDE WEB TECHNOLOGIES IN CHINA: RESEARCH, DEVELOPMENT, AND APPLICATIONS, 2002, : 95 - 115
  • [24] Efficient probabilistic XML query processing using an extended labeling scheme and a lightweight index
    Yun, Jung-Hee
    Chung, Chin-Wan
    INFORMATION PROCESSING & MANAGEMENT, 2012, 48 (06) : 1181 - 1202
  • [25] A survey of graphical query languages for XML data
    Ykhlef, Mourad
    Alqahtani, Sarra
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2011, 23 (02) : 59 - 70
  • [26] XQBE: A graphical environment to query XML data
    Braga, D
    Campi, A
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2005, 8 (03): : 287 - 316
  • [27] The Query Implement for Object-Oriented XML Based on Path Repository
    Dong Huan-zhi
    Hao Changsheng
    Xu Yang
    2011 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION AND INDUSTRIAL APPLICATION (ICIA2011), VOL I, 2011, : 378 - 381
  • [28] Adaptive query relaxation and top-k result sorting of fuzzy spatiotemporal data based on XML
    Bai, Luyi
    Duan, Xinyi
    Qin, Bin
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (03) : 2502 - 2520
  • [29] Keyword coupling query of spatiotemporal data based on XML
    Bai, Luyi
    Cui, Zengmei
    Duan, Xinyi
    Fu, Hao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (03) : 2219 - 2228
  • [30] Compression Algorithms for Structural Query Results on XML Data
    Wang, Qing
    Wang, Hongzhi
    Gao, Hong
    Li, Jianzhong
    WEB-AGE INFORMATION MANAGEMENT, 2010, 6185 : 141 - 145