Concurrent Processing of Increments in Online Integration of Semi-structured Data

被引:0
|
作者
Handoko [1 ]
Getta, Janusz R. [1 ]
机构
[1] Univ Wollongong, Sch Comp Sci & Software Engn, Wollongong, NSW 2522, Australia
关键词
Data integration; dynamic scheduling; distributed database; semi-structured data; XML DATA INTEGRATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
An online integration system enables incremental computation shortly after an increment data arrived at the central site. Processing increments serially ensures all data containers are in their updated states for computation of the next increment data. In general, a data container may show up as several arguments in a data integration expression. Serial processing of increments at this data container failed to show its best performance due to expensive IO costs for materialization updates. This paper proposes an online integration system with dynamic scheduling to enable concurrent processing of increments of data. The online integration system allows a series of transformation of a data integration expression into a single increment expression upon the increments of multiple data containers, and generates a data integration plan. The dynamic scheduling system employs a monitoring system and a priority scheduling which is able to dynamically change the data integration plans according to the increment data behavior.
引用
收藏
页码:289 / 294
页数:6
相关论文
共 50 条
  • [31] Managing Semantic Evolutions in Semi-Structured Data
    Siqueira Nepomuceno, Pedro Ivo
    Braghetto, Kelly Rosa
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2023, PT I, 2023, 14146 : 179 - 185
  • [32] An Overview on Supervised Semi-structured Data Classification
    Zhang, Lijun
    Li, Ning
    Li, Zhanhuai
    2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2021,
  • [33] A partition index for XML and semi-structured data
    Kim, J
    Kim, HJ
    DATA & KNOWLEDGE ENGINEERING, 2004, 51 (03) : 349 - 368
  • [34] Schema Matching for Semi-structured and Linked Data
    Kettouch, Mohamed
    Luca, Cristina
    Hobbs, Mike
    2017 11TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2017, : 270 - 271
  • [35] On Repairing Structural Problems In Semi-structured Data
    Korn, Flip
    Saha, Barna
    Srivastava, Divesh
    Ying, Shanshan
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (09): : 601 - 612
  • [36] Dimensions of ignorance in a semi-structured data model
    Magnani, M
    Montesi, D
    15TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2004, : 933 - 937
  • [37] INFOTABS: Inference on Tables as Semi-structured Data
    Gupta, Vivek
    Mehta, Maitrey
    Nokhiz, Pegah
    Srikumar, Vivek
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2309 - 2324
  • [38] Parameterized Mapping Distances for Semi-Structured Data
    Shin, Kilho
    Niiyama, Taro
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2018, 2019, 11352 : 443 - 466
  • [39] Building web warehouse for semi-structured data
    Mohania, M
    DATA & KNOWLEDGE ENGINEERING, 2001, 39 (02) : 101 - 103
  • [40] Storing semi-structured data on disk drives
    Bhadkamkar, Medha
    Farfan, Fernando
    Hristidis, Vagelis
    Rangaswami, Raju
    ACM Transactions on Storage, 2009, 5 (02) : 1 - 35