Caching Techniques for XML Message Filtering

被引:0
作者
Cao, Yang [1 ]
Majumdar, Shikharesh [2 ]
Lung, Chung-Horng [2 ]
机构
[1] Carleton Univ, Sch Comp Sci, Ottawa, ON K1S 5B6, Canada
[2] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON, Canada
来源
2009 IEEE 28TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCC 2009) | 2009年
关键词
publish/subscribe; XML; caching; performance evaluation;
D O I
10.1109/PCCC.2009.5403839
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An XML publish/subscribe system is based on filtering XML message streams for a large number of subscriptions expressed in XPath. A major issue on an XML-based publish/subscribe system is its performance. As the number of XML documents and XPath-based subscriptions increases in the system, to provide XML filtering efficiently becomes a challenging problem. Hence, there is an urgent need for optimization techniques to meet this challenge. There are many existing approaches on designing efficient XML filtering engine. Most existing research efforts focus on efficient filtering algorithms for achieving a high system performance or supporting more complex XPath syntax. Each proposed scheme has its advantages and limitations. Not much research, however, has considered using caching in the context of XML filtering. In this paper, we propose two caching schemes to be used in conjunction with an XML filtering engine. First, we present a complete message caching algorithm that is a strict caching policy to reduce the computation cost that accrues from multiple filtering of the same messages, by reusing results of previously processed messages. Second, we investigate a structure-based caching method that is an approximate caching policy for messages sharing the same structure. Performance evaluation for synthetic data and real data both show that complete message caching and structure-based caching schemes are able to achieve significantly better filtering performance (up to 80% for both caching schemes for the message streams experimented with).
引用
收藏
页码:315 / +
页数:2
相关论文
共 27 条
  • [1] ALTINEL M., 2000, VLDB 2000 P 26 INT C, P53
  • [2] [Anonymous], 2011, XML PATH LANGUAGE XP
  • [3] [Anonymous], 2007, Extensible stylesheet language transformations (XSLT) 2.0
  • [4] [Anonymous], 2009, cURL groks URLs
  • [5] ANTONELLIS P, 2008, INT J WEB ENG TECHN, V4
  • [6] Path summaries and path partitioning in modern XML databases
    Arion, Andrei
    Bonifati, Angela
    Manolescu, Ioana
    Pugliese, Andrea
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2008, 11 (01): : 117 - 151
  • [7] BARBOSA JKD, TOXGENE TOX XML GENE
  • [8] Efficient filtering of XML documents with XPath expressions
    Chan, CY
    Felber, P
    Garofalakis, M
    Rastogi, R
    [J]. VLDB JOURNAL, 2002, 11 (04) : 354 - 379
  • [9] CHAND R, 2008, IEEE T PARALLEL DIST, V19
  • [10] Chandramouli B., 2008, P VLDB, P434