AN EFFICIENT APPROACH FOR VIEW SELECTION FOR DATA WAREHOUSE USING TREE MINING AND EVOLUTIONARY COMPUTATION

被引:0
|
作者
Thakare, Atul [1 ]
Deshpande, Parag [1 ]
机构
[1] Visvesvaraya Natl Inst Technol, Comp Sci & Engn Dept, South Ambazari Rd, Nagpur 440010, Maharashtra, India
来源
COMPUTER SCIENCE-AGH | 2018年 / 19卷 / 04期
关键词
database management systems; data warehousing and data mining; query optimization; graph mining; algorithms for parallel computing; evolutionary computations; genetic algorithms;
D O I
10.7494/csci.2018.19.4.3006
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The selection of a proper set of views to materialize plays an important role in database performance. There are many methods of view selection that use different techniques and frameworks to select an efficient set of views for materialization. In this paper, we present a new efficient scalable method for view selection under the given storage constraints using a tree mining approach and evolutionary optimization. The tree mining algorithm is designed to determine the exact frequency of (sub)queries in the historical SQL dataset. The Query Cost model achieves the objective of maximizing the performance benefits from the final view set that is derived from the frequent view set given by the tree mining algorithm. The performance benefit of a query is defined as a function of query frequency, query creation cost, and query maintenance cost. The experimental results show that the proposed method is successful in recommending a solution that is fairly close to an optimal solution.
引用
收藏
页码:431 / 455
页数:25
相关论文
共 50 条
  • [31] Efficient approaches for materialized views selection in a data warehouse
    Hung, Ming-Chuan
    Huang, Man-Lin
    Yang, Don-Lin
    Hsueh, Nien-Lin
    INFORMATION SCIENCES, 2007, 177 (06) : 1333 - 1348
  • [32] Materialized view selection based on query cost in data warehouse
    Zhou, LJ
    Liu, C
    Liu, D
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY VI, 2004, 5433 : 246 - 252
  • [33] Complex view selection for data warehouse self-maintainability
    Theodoratos, D
    COOPERATIVE INFORMATION SYSTEMS, PROCEEDINGS, 2000, 1901 : 78 - 89
  • [34] Decision tree underfitting in mining of gene expression data. An evolutionary multi-test tree approach
    Czajkowski, Marcin
    Kretowski, Marek
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 137 : 392 - 404
  • [35] An evolutionary computation-based approach for feature selection
    Fateme Moslehi
    Abdorrahman Haeri
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 3757 - 3769
  • [36] An evolutionary computation-based approach for feature selection
    Moslehi, Fateme
    Haeri, Abdorrahman
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (09) : 3757 - 3769
  • [37] Power optimization of wind turbines with data mining and evolutionary computation
    Kusiak, Andrew
    Zheng, Haiyang
    Song, Zhe
    RENEWABLE ENERGY, 2010, 35 (03) : 695 - 702
  • [38] An efficient data mining method for learning Bayesian networks using an evolutionary algorithm-based hybrid approach
    Wong, ML
    Leung, KS
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2004, 8 (04) : 378 - 404
  • [39] Accommodating dimension hierarchies in a data warehouse view/index selection scheme
    Ezeife, CI
    SYSTEMS DEVELOPMENT METHODS FOR THE NEXT CENTURY, 1997, : 195 - 211
  • [40] A tree comparison approach to detect changes in data warehouse structures
    Eder, J
    Koncilia, C
    Wiggisser, K
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2005, 3589 : 1 - 10