Concurrent Processing of Increments in Online Integration of Semi-structured Data

被引:0
|
作者
Handoko [1 ]
Getta, Janusz R. [1 ]
机构
[1] Univ Wollongong, Sch Comp Sci & Software Engn, Wollongong, NSW 2522, Australia
关键词
Data integration; dynamic scheduling; distributed database; semi-structured data; XML DATA INTEGRATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
An online integration system enables incremental computation shortly after an increment data arrived at the central site. Processing increments serially ensures all data containers are in their updated states for computation of the next increment data. In general, a data container may show up as several arguments in a data integration expression. Serial processing of increments at this data container failed to show its best performance due to expensive IO costs for materialization updates. This paper proposes an online integration system with dynamic scheduling to enable concurrent processing of increments of data. The online integration system allows a series of transformation of a data integration expression into a single increment expression upon the increments of multiple data containers, and generates a data integration plan. The dynamic scheduling system employs a monitoring system and a priority scheduling which is able to dynamically change the data integration plans according to the increment data behavior.
引用
收藏
页码:289 / 294
页数:6
相关论文
共 50 条
  • [41] Compressed materialised views of semi-structured data
    Gourlay, Richard
    Tripney, Brian
    Wilson, John
    WORKSHOPS OF THE TWENTY FOURTH BRITISH NATIONAL CONFERENCE ON DATABASES, WORKSHOP PROCEEDINGS, 2007, : 75 - +
  • [42] Supporting structured, semi-structured and unstructured data in digital libraries
    Sánchez, JA
    Proal, C
    Maldonado-Naude, F
    PROCEEDINGS OF THE FIFTH MEXICAN INTERNATIONAL CONFERENCE IN COMPUTER SCIENCE (ENC 2004), 2004, : 368 - 375
  • [43] Querying semi-structured data with graph grammars
    Furfaro, F
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, PROCEEDINGS, 2002, : 288 - 293
  • [44] List data extraction in semi-structured document
    Xu, H
    Li, JZ
    Xu, P
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 584 - 585
  • [45] A semi-monad for semi-structured data - (ICDT version)
    Fernandez, M
    Simeon, J
    Wadler, P
    DATABASE THEORY - ICDT 2001, PROCEEDINGS, 2001, 1973 : 263 - 300
  • [46] Resolving Data Interoperability in Ubiquitous Health Profile using semi-structured storage and processing
    Satti, Fahad Ahmed
    Khan, Wajahat Ali
    Lee, Ganghun
    Khattak, Asad Masood
    Lee, Sungyoung
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 762 - 770
  • [47] OLERA: OnLine extraction rule analysis for semi-structured documents
    Chang, CH
    Kuo, SC
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, VOLS 1AND 2, 2004, : 736 - 742
  • [48] Tool for extracting semi-structured data to a big data load
    Furtado, Joao Carlos
    Bulsing, Gabriel Merten
    Kroth, Eduardo
    Benitez Nara, Elpidio Oscar
    Kipper, Liane Malhmann
    REVISTA BRASILEIRA DE COMPUTACAO APLICADA, 2015, 7 (03): : 43 - 52
  • [49] A strategy for data storage and the search for semi-structured data in the Web
    do Nascimento, C. A. S. A.
    Ebecken, N. F. F.
    Rosa, J. L. dos A.
    DATA MINING X: DATA MINING, PROTECTION, DETECTION AND OTHER SECURITY TECHNOLOGIES, 2009, 42 : 51 - +
  • [50] Multilevel Data Storage Model of Fuzzy Semi-Structured Data
    Yants, V. I.
    Chernov, A. V.
    Butakova, M. A.
    Klimanskaya, E. V.
    2015 XVIII International Conference on Soft Computing and Measurements (SCM), 2015, : 112 - 114