Integrating domain heterogeneous data sources using decomposition aggregation queries

被引:6
|
作者
Xu, Jian [1 ]
Pottinger, Rachel [1 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Semantic integration; Aggregation; Query optimization; LOCAL-SEARCH; ALGORITHM; DATABASES;
D O I
10.1016/j.is.2013.06.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The decomposition aggregation query (DAQ) we introduce in this paper extends semantic integration queries by allowing query translation to create aggregate queries based on the DAQ's novel three role structure. We describe the application of DAQs in integrating domain heterogeneous data sources, the new semantics of DAQ answers and the query translation algorithm called "aggregation rewriting". A central problem of optimizing DAQ processing requires determining the data sources towards which the DAQ is translated. Our source selection algorithm has cover-finding and partitioning steps which are optimized to 1. lower the processing overhead while speeding up query answering and 2. eliminate duplicates with minimal overhead. We establish connections between source selection optimizations and classic NP-hard optimizations and resolve the optimization problems with efficient solvers. We empirically study both the DAQ query translation and the source selection algorithms using real-world and synthetic data sets; the results show satisfying scalability both in size of aggregations and data sources for the query translation algorithms and the source selection algorithms save a good amount of computational resources. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:80 / 107
页数:28
相关论文
共 50 条
  • [41] Data Aggregation Using Homomorphic Encryption in Wireless Sensor Networks
    Ramotsoela, T. D.
    Hancke, G. P.
    2015 INFORMATION SECURITY FOR SOUTH AFRICA - PROCEEDINGS OF THE ISSA 2015 CONFERENCE, 2015,
  • [42] Signal Integrity Analysis of Integrated Circuits by Using Embedded Domain Decomposition Method
    Lu, Jiaqing
    Lee, Jin-Fa
    IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2018, 66 (12) : 5369 - 5382
  • [43] DERIVING A NEW DOMAIN DECOMPOSITION METHOD FOR THE STOKES EQUATIONS USING THE SMITH FACTORIZATION
    Dolean, Victorita
    Nataf, Frederic
    Rapin, Gerd
    MATHEMATICS OF COMPUTATION, 2009, 78 (266) : 789 - 814
  • [44] Archaeological distribution map system using aggregate information from heterogeneous information sources
    Hayashi A.
    Hochin T.
    Nomiya H.
    International Journal of Networked and Distributed Computing, 2016, 4 (2) : 85 - 95
  • [45] Efficient and Accurate Spatial Queries Using Lossy Compressed 3D Geometry Data
    Teng, Dejun
    Li, Zhaochuan
    Peng, Zhaohui
    Ma, Shuai
    Wang, Fusheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (05) : 2472 - 2487
  • [46] Handling Data Skew for Aggregation in Spark SQL Using Task Stealing
    Zeyu He
    Qiuli Huang
    Zhifang Li
    Chuliang Weng
    International Journal of Parallel Programming, 2020, 48 : 941 - 956
  • [47] Handling Data Skew for Aggregation in Spark SQL Using Task Stealing
    He, Zeyu
    Huang, Qiuli
    Li, Zhifang
    Weng, Chuliang
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2020, 48 (06) : 941 - 956
  • [48] Literature Survey on Reliable event detection in WSN using aggregation of data
    Abraham, Jesseline Annie
    Jose, A. Felix Arokya
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2018), 2018, : 841 - 843
  • [49] Design, Aggregation and Analysis of Power Consumption Data using the Jump Process
    Yacouba, Yazid Hambally
    Diabagate, Amadou
    Babri, Michel
    Coulibaly, Adama
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (05) : 554 - 563
  • [50] Opportunistic data gathering in IoT networks using an energy-efficient data aggregation mechanism
    Afonso, Edvar
    Campista, Miguel Elias M.
    ANNALS OF TELECOMMUNICATIONS, 2024,