Data quality-based view selection in big data integration system

被引:0
作者
Anter S. [1 ]
机构
[1] Faculty of Science and Technology of Mohammedia, Hassan II University of Casablanca
关键词
big data; data integration; data quality; materialised views; view selection;
D O I
10.1504/IJBIDM.2023.133139
中图分类号
学科分类号
摘要
An integration system is an intermediate tool between a user and a set of distributed sources. It provides transparent access to information through an interface using a unique query language. This provides an illusion to the end user as if it is accessing a homogeneous central repository. In a hybrid system, one part of the data is queried on demand whereas another part is extracted, filtered and stored in a local database. This approach is very much promising for data access in the big data context. However, obtaining satisfactory results depends on the correct choice of data to materialise. Further this task is even more difficult in the big data context. In this article, a novel approach has been proposed to overcome the above problem which uses data quality to select views that will be materialised. © 2023 Inderscience Enterprises Ltd.
引用
收藏
页码:264 / 276
页数:12
相关论文
共 41 条
[1]  
Anassin C.M.P., Boko A.K.A., Babri M., Optimisation de la qualité des données appliquée au domuine du marketing, Afrique Science, 16, 6, pp. 126-134, (2020)
[2]  
Anter S., Zellou A., Idri A., Personalization of a hybrid integration system: creation of views to materialize based on the distribution of user queries, 2012 IEEE International Conference on Complex Systems (ICCS), pp. 1-7, (2012)
[3]  
Anter S., Zellou A., Idri A., The hybrid integration system towards a new approach for creating candidate views for materialization, 2013 5th International Conference on Computer Science and Information Technology, pp. 201-209, (2013)
[4]  
Anter S., Zellou A., Idri A., MATHIS: a new approach for creating views to materialize in a hybrid integration system, Journal IRECOS, International Review on Computers and Software, 8, 3, pp. 816-825, (2013)
[5]  
Anter S., Zellou A., Idri A., Retrieving and materializing data in hybrid mediators, International Journal of Applied Engineering Research, 11, 3, pp. 2128-2134, (2016)
[6]  
Ariza-Jimenez L., Villa L.F., Pinel N., Quintero O.L., Extracted information quality, a comparative study in high and low dimensions, International Journal of Business Intelligence and Data Mining, 19, 2, pp. 214-241, (2021)
[7]  
Ashish N., Knoblock C.A., Shahabi C., Selectively materializing data in mediators by analyzing user queries, Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No. PR00384), pp. 256-266, (1999)
[8]  
Ashish N., Knoblock C.A., Shahabi C., Selectively materializing data in mediators by analyzing source structure, query distribution and maintenance cost, Proceedings of the 2nd International Workshop on Web Information and Data Management, pp. 33-37, (1999)
[9]  
Ashish N., Optimizing Information Mediators by Selectively Materializing Data, (2000)
[10]  
Barnes T.J., Big data, little history, Dialogues in Human Geography, 3, 3, pp. 297-302, (2013)