A Twig Join algorithm for a Query with ID References

被引:0
作者
Li, Dong [1 ]
Zhao, Lin [2 ]
Li, Jing [3 ]
机构
[1] SCUT, Sch Software Engn, Guangzhou, Guangdong, Peoples R China
[2] SCUT, Sch Comp Sci & Technol, Guangzhou, Guangdong, Peoples R China
[3] SCUT, Sch Light Ind & Food Sci, Guangzhou, Guangdong, Peoples R China
来源
2014 ASIA-PACIFIC SERVICES COMPUTING CONFERENCE (APSCC) | 2014年
基金
中国国家自然科学基金;
关键词
XML; graph; Twig join; ID/IDREF;
D O I
10.1109/APSCC.2014.26
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
ID/IDREF feature makes XML document model become graph structure rather than tree structure, while traditional Twig join algorithms can just process simple queries without ID references. Those queries with ID references often involve attribute node or predicates with expressions which do not exist in traditional Twig pattern, so it is necessary to design the Twig join algorithm for the implement of queries involving ID references. There are several typical Twig join algorithms like Twig(2)Stack, TwigList, TwigMix. Twig(2)Stack use over-complicated data structures with large memory overhead. TwigList uses simple lists but lack efficient filtering of useless elements. TwigMix simply introduces the getNext() function into TwigList to avoid manipulation of useless elements for the ancestor-descendent (AD) relationship in the stack and lists, but it will filter some useful elements when process the queries involving attribute node or predicates within expressions. To this end, we propose a new algorithm, called TwigExpand, which can process the queries involving attribute node or predicates within expressions by avoiding the manipulation of useless elements for both the parent-child (PC) relationship and AD relationship. In addition, TwigGraph is proposed by expanding TwigExpand, which can process the queries involving ID references, and it's much faster than binary structural join proved by experimental study.
引用
收藏
页码:81 / 87
页数:7
相关论文
共 12 条
  • [1] Structural joins: A primitive for efficient XML query pattern matching
    Al-Khalifa, S
    Jagadish, HV
    Koudas, N
    Patel, JM
    Srivastava, D
    Wu, YQ
    [J]. 18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 141 - 152
  • [2] [Anonymous], 2008, EXTENSIBLE MARKUP LA
  • [3] Bruno N., 2002, P 2002 ACM SIGMOD IN, P310, DOI DOI 10.1145/564691.564727
  • [4] Chen S., 2006, P VERY LARGE DATA BA, P283
  • [5] Dietz P.F., 1982, P 14 ANN ACM S THEOR, P122, DOI DOI 10.1145/800070.802184
  • [6] Design and Implementation of XSQS System
    Hong, Lukai
    Li, Dong
    Gu, Ning
    [J]. 2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 3, PROCEEDINGS, 2009, : 385 - +
  • [7] Li J, 2008, LECT NOTES COMPUT SC, V5181, P523
  • [8] Morgenthal J., 2000, XML J, V1
  • [9] QIN L, 2007, P 12 TH INT C DAT, V4443, P850
  • [10] Schmidt A., 2002, Proceedings of the Twenty-eighth International Conference on Very Large Data Bases, P974