Multi-Objective Big Data View Materialization Using NSGA-II

被引:2
作者
Kumar, Akshay [1 ]
Kumar, T. V. Vijay [2 ]
机构
[1] Jawaharlal Nehru Univ, Sch Comp & Syst Sci, New Delhi, India
[2] Jawaharlal Nehru Univ, New Delhi, India
关键词
Big Data; Multi-Objective Optimization; NSGA-II; View Materialization; ALGORITHM; INTERNET;
D O I
10.4018/IRMJ.2021040101
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Big data views, in the context of distributed file system (DFS), are defined over structured, semi-structured and unstructured data that are voluminous in nature with the purpose to reduce the response time of queries over Big data. As the size of semi-structured and unstructured data in Big data is very large compared to structured data, a framework based on query attributes on Big data can be used to identify Big data views. Materializing Big data views can enhance the query response time and facilitate efficient distribution of data over the DFS based application. Given all the Big data views cannot be materialized, therefore, a subset of Big data views should be selected for materialization. The purpose of view selection for materialization is to improve query response time subject to resource constraints. The Big data view materialization problem was defined as a bi-objective problem with the two objectives-minimization of query evaluation cost and minimization of the update processing cost, with a constraint on the total size of the materialized views. This problem is addressed in this paper using multi-objective genetic algorithm NSGA-II. The experimental results show that proposed NSGA-II based Big data view selection algorithm is able to select reasonably good quality views for materialization.
引用
收藏
页码:1 / 28
页数:28
相关论文
共 61 条
[1]  
Abiteboul S., 1999, SIGMOD REC, V28, P30, DOI [10.1145/344816.344853, DOI 10.1145/344816.344853]
[2]  
Abiteboul S., 1997, Technical Report
[3]  
Afrati F, 1996, LECT NOTES COMPUTER, V1186
[4]  
Agrawal Sanjay, 2000, VLDB, P496
[5]   The role of big data analytics in Internet of Things [J].
Ahmed, Ejaz ;
Yaqoob, Ibrar ;
Hashem, Ibrahim Abaker Targio ;
Khan, Imran ;
Ahmed, Abdelmuttlib Ibrahim Abdalla ;
Imran, Muhammad ;
Vasilakos, Athanasios V. .
COMPUTER NETWORKS, 2017, 129 :459-471
[6]  
[Anonymous], Evolutionary Computation, V2, P221
[7]  
[Anonymous], 2011, BIG DATA NEXT FRONTI
[8]  
[Anonymous], 2011, Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data
[9]  
[Anonymous], 1985, P 1 INT C GEN ALG
[10]  
[Anonymous], 2012, Hadoop: The Definitive Guide