Key Elements Tracing Method for Parallel XML Parsing in Multi-core System

被引:0
作者
Li, Xiaosong [1 ]
Wang, Hao [1 ]
Liu, Taoying [1 ]
Li, Wei [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
来源
2009 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT 2009) | 2009年
关键词
XML parsing; parallel; multi-core; key element tracing;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Though XML is applied intensively in a lot of applications, XML parsing is not practical in many fields because of its poor performance. Parallel XML parsing on multi-core is a promising choice. Previous methods all adopt data parallel approach on XML parsing. As the semi-structured nature of XML, they were obliged to divide the data into well-formed XML chunks and then parse these chunks parallel. The division process is named as preparsing. As the preparsing is serial, it becomes the bottleneck of parallel XML parsing. Related work Simultaneous Finite Transducer (SFTXP) :parallelized the preparsing stage. It maintained multiple preparser results for each equal sized chunk according to enumerated all possible parsing states. In spite of finite states for each XML, the overhead by SFTXP is tremendous, including CPU time and memory for multiple results generating and storing, respectively. In this work, we address parallel XML parsing by Key Element Parse Tracing (KEPT) method which parallelizes the preparsing and parsing at element level. It remolds the preparsing as a key element extracting process and schedules the processing of key elements in 1:he framework of KEPT. Then parsing process is parallelized as a whole. To demonstrate the effectiveness, we implement it on libxml2 and obtain good scalability on both an 8-core Linux machine and an 8-core 24 SMT Sun machine running Solaris.
引用
收藏
页码:439 / 444
页数:6
相关论文
共 21 条
[1]  
[Anonymous], DBLP XML RECORDS
[2]  
APPARAO P, 2004, ICPP 04 MONTR QUEB C
[3]  
BERGER ED, 2000, ASPLOS 9, P117
[4]   Investigating the limits of SOAP performance for scientific computing [J].
Chiu, K ;
Govindaraju, M ;
Bramley, R .
11TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 2002, :246-254
[5]   WORKING SET MODEL FOR PROGRAM BEHAVIOR [J].
DENNING, PJ .
COMMUNICATIONS OF THE ACM, 1968, 11 (05) :323-&
[6]  
Ding C., 2003, Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation, P245, DOI DOI 10.1145/781131.781159
[7]  
DING JJ, 2007, ICPP 07 XI AN CHIN S
[8]  
HEAD MR, 2007, WORKSH SERV OR COMP, P17
[9]  
Kostoulas MG., 2006, P 15 INT C WORLD WID, P93, DOI DOI 10.1145/1135777.1135796
[10]  
KULKARNI M, 2008, ASPLOS 08 SEATTL WA