Distributed SLCA-Based XML Keyword Search by Map-Reduce

被引:0
作者
Zhang, Chenjing [1 ,2 ]
Ma, Qiang [2 ]
Wang, Xiaoling [3 ]
Zhou, Aoying [2 ,3 ]
机构
[1] Shanghai Ocean Univ, Coll Informat Technol, Shanghai, Peoples R China
[2] Fudan Univ, Sch Comp Sci & Technol, Shanghai, Peoples R China
[3] East China Normal Univ, Inst Software Engn, Shanghai Key Lab Trustworthy Comp, Shanghai, Peoples R China
来源
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS | 2010年 / 6193卷
关键词
SLCA; keyword search; XML; distributed system;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large scales of XML information comes continually from new Web applications, and SLCA (Smallest Lowest Common Ancestor)-based XML keyword search is one of the most important information retrieval approaches. Previous approaches focus on building index for XML documents. However in information dissemination scenario, it is impossible to build index in advance for continuous XML document streams. This paper addresses SLCA-based keyword search for continuous XML documents by Map-Reduce mechanism. We use parallel algorithms to process plenty of XML documents in Hadoop environment. A distributed SLCA computation method is designed, where each net node computes SLCA independently and just a little information needs be transmitted. A real Hadoop environment is built and we demonstrate the efficiency of our algorithms analytically and experimentally.
引用
收藏
页码:386 / +
页数:2
相关论文
共 12 条
[1]  
[Anonymous], 2003, Proceedings of the 2003 ACM SIGMOD international conference on Management of data
[2]  
[Anonymous], P ACM SIGMOD INT C M
[3]  
[Anonymous], 2004, Proceedings of the Thirtieth international conference on Very Large Databases-Volume
[4]  
Bremer J.M., 2003, International Workshop on the Web and Databases (WebDB), P73
[5]   Navigation- vs. index-based XML multi-query processing [J].
Bruno, N ;
Gravano, L ;
Koudas, N ;
Srivastava, D .
19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, :139-150
[6]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[7]  
Gong XQ, 2005, PROC INT CONF DATA, P890
[8]  
Machdi I., 2008, P 10 INT C INF INT W, P137
[9]  
Sun C., 2007, WWW, P1043
[10]  
Wang WY, 2009, LECT NOTES COMPUT SC, V5463, P496, DOI 10.1007/978-3-642-00887-0_44