MR-MVPP: A map-reduce-based approach for creating MVPP in data warehouses for big data applications

被引:9
作者
Azgomi, Hossein [1 ]
Sohrabi, Mohammad Karim [1 ]
机构
[1] Islamic Azad Univ, Semnan Branch, Dept Comp Engn, Semnan, Iran
关键词
Data warehouse; Materialized view selection; Multiple view processing plan; Big data; Map-reduce; MATERIALIZED VIEW SELECTION; DATA-STORAGE; OPTIMIZATION; ALGORITHM; QUERIES;
D O I
10.1016/j.ins.2021.04.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Materialized view selection (MVS) is the problem of selecting an appropriate set of views to be materialized to speed up analytical query processing of data warehouses. Online analytical processing (OLAP) of queries is an essential application of the MVS problem, in which, the response times of the queries are reduced by storing the selected views. Views are intermediate results of query processing and are selected in the MVS problem to be stored and will then be exploited in answering process of several queries. Views are usually organized as a view representation structure in the MVS problem. Multiple Views Processing Plan (MVPP) is a standard structure used for view representation in the MVS problem. Due to the tremendous amount of data, constructing the MVPP is a challenge in the big data applications. The MR-MVPP (Map-Reduce-based construction of the MVPP) is the proposed method of this paper to address this problem. The MR-MVPP performs a set similarity join (similarity-based join) on the base relations and views using the map-reduce model and the hashing technique. The MVPP construction time in the proposed method is reduced by avoiding redundant calculations in the process of creating the MVPP. The performance of the proposed method is empirically evaluated. According to the results of the experiments, the execution time of the MR-MVPP method is better than the other methods. The average time improvement is about 26.5 units. This improvement is better than the other similar researches in this area and is significant due to the high volume of data in real applications. Moreover, the proposed method works well in terms of the effectiveness of the created MVPP and has about a 50% coverage rate for view selection methods. Deterministic methods are more accurate than hashing methods and can be utilized for set similarity join as future work to probably improve the effectiveness of the constructed MVPP. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:200 / 224
页数:25
相关论文
共 68 条
[1]  
Abramov V. E., 2013, 14th Scientific Conference on the "theory and practice of the struggle against parasitic diseases", Moscow, Russia, 21-23 May 2013, P14
[2]  
[Anonymous], 2005, Building the Data Warehouse
[3]   Data mining-based materialized view and index selection in data warehouses [J].
Aouiche, Kamel ;
Darmont, Jerome .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2009, 33 (01) :65-93
[4]  
Arasu A., 2006, P 32 INT C VERY LA, P918
[5]   A novel coral reefs optimization algorithm for materialized view selection in data warehouse environments [J].
Azgomi, Hossein ;
Sohrabi, Mohammad Karim .
APPLIED INTELLIGENCE, 2019, 49 (11) :3965-3989
[6]   A game theory based framework for materialized view selection in data warehouses [J].
Azgomi, Hossein ;
Sohrabi, Mohammad Karim .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 71 :125-137
[7]   Urban data management system: Towards Big Data analytics for Internet of Things based smart urban environment using customized Hadoop [J].
Babar, Muhammad ;
Arif, Fahim ;
Jan, Mian Ahmad ;
Tan, Zhiyuan ;
Khan, Fazlullah .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 :398-409
[8]   A scalable approach to model big and interacted queries for materialized view through data mining [J].
Betouati, Fatiha ;
Rahal, Sid Ahmed .
MULTIAGENT AND GRID SYSTEMS, 2019, 15 (02) :137-154
[9]  
Chaves LeonardoWeiss F., 2009, Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT 09, P1088
[10]   Big Data: A Survey [J].
Chen, Min ;
Mao, Shiwen ;
Liu, Yunhao .
MOBILE NETWORKS & APPLICATIONS, 2014, 19 (02) :171-209