XPEDIA: XML Processing for Data Integration

被引:4
|
作者
Bhide, Manish [1 ]
Agarwal, Manoj K. [1 ]
Bar-Or, Amir [2 ]
Padmanabhan, Sriram [2 ]
Mittapalli, Srinivas K. [3 ]
Venkatachaliah, Girish [3 ]
机构
[1] IBM India Res Lab, New Delhi, India
[2] IBM Software Grp, Armonk, NY USA
[3] IBM Software Grp, Bangalore, Karnataka, India
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2009年 / 2卷 / 02期
关键词
D O I
10.14778/1687553.1687559
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data Integration engines increasingly need to provide sophisticated processing options for XML data. In the past, it was adequate for these engines to support basic shredding and XML generation capabilities. However, with the steady growth of XML in applications and databases, integration platforms need to provide more direct operations on XML as well as improve the scalability and efficiency of these operations. In this paper, we describe a robust and comprehensive framework for performing Extract-Transform-Load (ETL) of XML. This includes (i) full computational model and engine capabilities to perform these operations in an ETL flow, (ii) an approach to pushing down XML operations into a database engine capable of supporting XML processing, and (iii) methods to apply partitioning techniques to provide scalable, parallel processing for large XML documents. We describe experimental results showing the effectiveness of these techniques.
引用
收藏
页码:1330 / 1341
页数:12
相关论文
共 50 条
  • [1] XML processing and data integration with XQuery
    Robie, Jonathan
    IEEE INTERNET COMPUTING, 2007, 11 (04) : 62 - 67
  • [2] XML and data integration
    Bertino, E
    Ferrari, E
    IEEE INTERNET COMPUTING, 2001, 5 (06) : 75 - 76
  • [3] Integration of XML data
    Saccol, DD
    Heuser, CA
    EFFICIENCY AND EFFECTIVENESS OF XML TOOLS AND TECHNIQUES AND DATA INTEGRATION OVER THE WEB, 2003, 2590 : 68 - 80
  • [4] XJ: Integration of XML processing into Java
    Harren, Matthew
    Burke, Michael
    Raghavachari, Mukund
    Sarkar, Vivek
    Shmueli, Oded
    Bordawekar, Rajesh
    Thirteenth Int. World Wide Web Conf. Proc. WWW, (1072-1073):
  • [5] XML data integration with identification
    Poggi, A
    Abiteboul, S
    DATABASE PROGRAMMING LANGUAGES, 2005, 3774 : 106 - 121
  • [6] AN IMPLEMENTATION OF XML DATA INTEGRATION
    Pan, Weidong
    Liu, Jixue
    Tian, Jiashen
    ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL DISI: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2008, : 111 - 116
  • [7] XML, bioinformatics and data integration
    Achard, F
    Vaysseix, G
    Barillot, E
    BIOINFORMATICS, 2001, 17 (02) : 115 - 125
  • [8] XML scheme directory:: A data structure for XML data processing
    Kotsakis, E
    Böhm, K
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL I, 2000, : 62 - 69
  • [9] Integration of chemical data using XML
    Bachrach, SM
    SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2002, 13 (3-4) : 381 - 390
  • [10] XML data integration in OGSA Grids
    Comito, C
    Talia, D
    DATA MANAGEMENT IN GRIDS, 2005, 3836 : 4 - 15