Integrating domain heterogeneous data sources using decomposition aggregation queries

被引:6
|
作者
Xu, Jian [1 ]
Pottinger, Rachel [1 ]
机构
[1] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Semantic integration; Aggregation; Query optimization; LOCAL-SEARCH; ALGORITHM; DATABASES;
D O I
10.1016/j.is.2013.06.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The decomposition aggregation query (DAQ) we introduce in this paper extends semantic integration queries by allowing query translation to create aggregate queries based on the DAQ's novel three role structure. We describe the application of DAQs in integrating domain heterogeneous data sources, the new semantics of DAQ answers and the query translation algorithm called "aggregation rewriting". A central problem of optimizing DAQ processing requires determining the data sources towards which the DAQ is translated. Our source selection algorithm has cover-finding and partitioning steps which are optimized to 1. lower the processing overhead while speeding up query answering and 2. eliminate duplicates with minimal overhead. We establish connections between source selection optimizations and classic NP-hard optimizations and resolve the optimization problems with efficient solvers. We empirically study both the DAQ query translation and the source selection algorithms using real-world and synthetic data sets; the results show satisfying scalability both in size of aggregations and data sources for the query translation algorithms and the source selection algorithms save a good amount of computational resources. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:80 / 107
页数:28
相关论文
共 50 条
  • [31] A Project Monitoring Cockpit Based On Integrating Data Sources in Open Source Software Development
    Biffl, Stefan
    Sunindyo, Wikan Danar
    Moser, Thomas
    22ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING & KNOWLEDGE ENGINEERING (SEKE 2010), 2010, : 620 - 627
  • [32] Time-Domain Aggregation of Interharmonics from Parallel Operation of Multiple Sustainable Sources and Electric Vehicles
    Ravindran, Vineetha
    Letha, Shimi Sudha
    Roennberg, Sarah
    Bollen, Math H. J.
    SUSTAINABILITY, 2025, 17 (03)
  • [33] Casimir Force for Complex Objects Using Domain Decomposition Techniques
    Atkins, Phillip R.
    Chew, Weng Cho
    Li, Mao Kun
    Sun, Lin E.
    Ma, Zu Hui
    Jiang, Li Jun
    PROGRESS IN ELECTROMAGNETICS RESEARCH-PIER, 2014, 149 : 275 - 280
  • [34] Pain research using Veterans Health Administration electronic and administrative data sources
    Abel, Erica A.
    Brandt, Cynthia A.
    Czlapinski, Rebecca
    Goulet, Joseph L.
    JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 2016, 53 (01) : 1 - 11
  • [35] Supporting retrieval of diverse biomedical data using evidence-aware queries
    Cadag, Eithon
    Tarczy-Hornoch, Peter
    JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (06) : 873 - 882
  • [36] Using Fuzzy Logic for Data Aggregation in Vehicular Networks
    Tal, Irina
    Muntean, Gabriel-Miro
    2012 IEEE/ACM 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT), 2012, : 151 - 154
  • [37] Semantic Integration of Heterogeneous Data Sources for Monitoring Frequent-Release Software Projects
    Biffl, Stefan
    Sunindyo, Wikan Danar
    Moser, Thomas
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2010), 2010, : 360 - 367
  • [38] Frequency domain identification using adaptive Fourier decomposition method with polynomials
    Mi, Wen
    Zheng, Wei Xing
    IET CONTROL THEORY AND APPLICATIONS, 2020, 14 (12) : 1539 - 1547
  • [39] Computing Aggregate Queries in Raster Image Databases Using Pre-Aggregated Data
    Gutierrez, Angelica Garcia
    Baumann, Peter
    WCECS 2008: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, 2008, : 201 - 206
  • [40] Hierarchical Data Aggregation Using Compressive Sensing (HDACS) in WSNs
    Xu, Xi
    Ansari, Rashid
    Khokhar, Ashfaq
    Vasilakos, Athanasios V.
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2015, 11 (03)