On hierarchical clustering-based approach for RDDBS design

被引:0
作者
Hassan I. Abdalla
Ali A. Amer
Sri Devi Ravana
机构
[1] Zayed University,College of Technological Innovation
[2] Taiz University,Computer Science Department
[3] Universiti Malaya,Department of Information Systems, Faculty of Computer Science and Information Technology
来源
Journal of Big Data | / 10卷
关键词
Database; Relational DDBS; Fragmentation; Clustering; Replication; Allocation;
D O I
暂无
中图分类号
学科分类号
摘要
Distributed database system (DDBS) design is still an open challenge even after decades of research, especially in a dynamic network setting. Hence, to meet the demands of high-speed data gathering and for the management and preservation of huge systems, it is important to construct a distributed database for real-time data storage. Incidentally, some fragmentation schemes, such as horizontal, vertical, and hybrid, are widely used for DDBS design. At the same time, data allocation could not be done without first physically fragmenting the data because the fragmentation process is the foundation of the DDBS design. Extensive research have been conducted to develop effective solutions for DDBS design problems. But the great majority of them barely consider the RDDBS's initial design. Therefore, this work aims at proposing a clustering-based horizontal fragmentation and allocation technique to handle both the early and late stages of the DDBS design. To ensure that each operation flows into the next without any increase in complexity, fragmentation and allocation are done simultaneously. With this approach, the main goals are to minimize communication expenses, response time, and irrelevant data access. Most importantly, it has been observed that the proposed approach may effectively expand RDDBS performance by simultaneously fragmenting and assigning various relations. Through simulations and experiments on synthetic and real databases, we demonstrate the viability of our strategy and how it considerably lowers communication costs for typical access patterns at both the early and late stages of design.
引用
收藏
相关论文
共 71 条
[1]  
Nashat D(2018)A comprehensive taxonomy of fragmentation and allocation techniques in distributed database design ACM Comput Surv (CSUR) 51 1-25
[2]  
Amer AA(2019)A survey on data storage and placement methodologies for Cloud-Big Data ecosystem J Big Data 6 1-37
[3]  
Mazumdar S(2017)Clustering large datasets using K-means modified inter and intra clustering (KM-I2C) in Hadoop J Big Data 4 27-14
[4]  
Seybold D(2021)Managing fragmented database in distributed database environment J Math Comput Sci 7 8-886
[5]  
Kritikos K(2020)ASGOP: an aggregated similarity-based greedy-oriented approach for relational DDBSs design Heliyon 6 1-205
[6]  
Verginadis Y(2017)An optimized approach for simultaneous horizontal data fragmentation and allocation in Distributed Database Systems (DDBSs) Heliyon 3 e00487-2473
[7]  
Sreedhar C(2022)DSGA: a distributed segment-based genetic algorithm for multi-objective outsourced database partitioning Inf Sci 612 864-298
[8]  
Kasiviswanath N(2021)A hybrid method based on SA and VNS algorithms for solving DAP in DDS Comput Sci J Moldova 86 184-1134
[9]  
Chenna Reddy P(2020)SBBO based replicated data allocation approach for distributed database design Int J Eng Res Technol 13 2461-20
[10]  
Fauzi AAC(2023)Modeling and performance analysis of single-server database over quasi-static rayleigh fading channel IEEE Trans Veh Technol 2023 1-18