XPEDIA: XML Processing for Data Integration

被引:4
|
作者
Bhide, Manish [1 ]
Agarwal, Manoj K. [1 ]
Bar-Or, Amir [2 ]
Padmanabhan, Sriram [2 ]
Mittapalli, Srinivas K. [3 ]
Venkatachaliah, Girish [3 ]
机构
[1] IBM India Res Lab, New Delhi, India
[2] IBM Software Grp, Armonk, NY USA
[3] IBM Software Grp, Bangalore, Karnataka, India
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2009年 / 2卷 / 02期
关键词
D O I
10.14778/1687553.1687559
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data Integration engines increasingly need to provide sophisticated processing options for XML data. In the past, it was adequate for these engines to support basic shredding and XML generation capabilities. However, with the steady growth of XML in applications and databases, integration platforms need to provide more direct operations on XML as well as improve the scalability and efficiency of these operations. In this paper, we describe a robust and comprehensive framework for performing Extract-Transform-Load (ETL) of XML. This includes (i) full computational model and engine capabilities to perform these operations in an ETL flow, (ii) an approach to pushing down XML operations into a database engine capable of supporting XML processing, and (iii) methods to apply partitioning techniques to provide scalable, parallel processing for large XML documents. We describe experimental results showing the effectiveness of these techniques.
引用
收藏
页码:1330 / 1341
页数:12
相关论文
共 50 条
  • [31] OLAP query processing for XML data in RDBMS
    Kit, Chantola
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    2007 IEEE INTERNATIONAL WORKSHOP ON DATABASES FOR NEXT GENERATION RESEARCHERS, 2007, : 7 - +
  • [32] Dynamic Labelling Scheme for XML Data Processing
    Duong, Maggic
    Zhang, Yanchun
    On the Move to Meaningful Internet Systems: OTM 2008, Pt II, Proceedings, 2008, 5332 : 1183 - 1199
  • [33] Web/XML data management and query processing
    Zhou, AY
    Zheng, SH
    Qian, WN
    WORLD WIDE WEB TECHNOLOGIES IN CHINA: RESEARCH, DEVELOPMENT, AND APPLICATIONS, 2002, : 95 - 115
  • [34] Natural XML for data binding, processing, and persistence
    Thiruvathukal, GK
    Läufer, K
    COMPUTING IN SCIENCE & ENGINEERING, 2004, 6 (02) : 86 - 92
  • [35] A query processing architecture for an XML data warehouse
    Wiwatwattana, Nuwee
    Jagadish, H. V.
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1513 - +
  • [36] Efficient grouping and ordering processing on XML data
    Chang, Ya-Hui
    Huang, Chih-Chung
    Chien, Po-Hsien
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2012, 35 (06) : 697 - 709
  • [37] A Review on Utilising XML as the Mediated Layer for Data Integration
    Awadallah, Bahaaeldin M. H.
    Haw, Su-Cheng
    Soon, Lay-Ki
    ADVANCED SCIENCE LETTERS, 2018, 24 (02) : 1191 - 1195
  • [38] Peer-to-peer data integration with active XML
    Milo, T
    ADVANCES IN COMPUTER SCIENCE - ASIAN 2005, PROCEEDINGS: DATA MANAGEMENT ON THE WEB, 2005, 3818 : 11 - 18
  • [39] Research on the data integration of archaeological digital museum with XML
    Liu, Shi-Jun
    Meng, Xiang-Xu
    Xiang, Hui
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2002, 14 (12):
  • [40] Meta modeling approach for XML based data integration
    Song, OY
    Yi, H
    ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, PROCEEDINGS, 2006, 3842 : 112 - 116