Impacts of data consistency levels in cloud-based NoSQL for data-intensive applications

被引:0
|
作者
Ferreira, Saulo [1 ]
Mendonca, Julio [2 ]
Nogueira, Bruno [3 ]
Tiengo, Willy [3 ]
Andrade, Ermeson [1 ]
机构
[1] Univ Fed Rural Pernambuco, Recife, PE, Brazil
[2] Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust SnT, Luxembourg, Luxembourg
[3] Univ Fed Alagoas, Maceio, Alagoas, Brazil
关键词
Cloud; Data consistency; Databases; NoSQL; Performance;
D O I
10.1186/s13677-024-00716-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When using database management systems (DBMSs), it is common to distribute instance replicas across multiple locations for disaster recovery and scaling purposes. To efficiently geo-replicate data, it is crucial to ensure the data and its replicas remain consistent with the same and the most up-to-date data. However, DBMSs' inner characteristics and external factors, such as the replication strategy and network latency, can affect system performance when dealing with data replication, especially when the replicas are deployed far apart from the others. Thus, it is essential to comprehend how achieving high data consistency levels in geo-replicated systems can impact systems performance. This work analyzes various data consistency settings for the widely used NoSQL DBMSs, namely MongoDB, Redis, and Cassandra. The analysis is based on real-world experiments in which DBMS nodes are deployed on cloud platforms in different locations, considering single and multiple region deployments. Based on the results of the experiments, we provide a comprehensive analysis regarding the system throughput and response time when executing reading and writing operations, pointing out scenarios where each DBMS could be better employed. Some of our findings include, for instance, that opting for strong data consistency significantly impacts Cassandra's reading operations in the single-region deployment, while MongoDB writing operations are most affected in a multi-region scenario. Additionally, all of these DBMSs exhibit statistically significant variations across all scenarios in the multi-region setup when the data consistency is switched from weak to stronger level.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] A Data Placement Strategy for Data-Intensive Cloud Storage
    Ding, Jie
    Han, Haiyun
    Zhou, Aihua
    PROGRESS IN POWER AND ELECTRICAL ENGINEERING, PTS 1 AND 2, 2012, 354-355 : 896 - 900
  • [22] Data replication techniques for data-intensive applications
    No, Jaechun
    Park, Chang Won
    Park, Sung Soon
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 4, PROCEEDINGS, 2006, 3994 : 1063 - 1070
  • [23] Analysis of Big Data for Data-Intensive Applications
    Dave, Meenu
    Gianey, Hemant Kumar
    2016 INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE), 2016,
  • [24] QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing Systems
    Lin, Jenn-Wei
    Chen, Chien-Hung
    Chang, J. Morris
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2013, 1 (01) : 101 - 115
  • [25] CLOUD BASED RESOURCE SCHEDULING METHODOLOGY FOR DATA-INTENSIVE SMART CITIES AND INDUSTRIAL APPLICATIONS
    Ma, Shiming
    Chen, Jichang
    Zhang, Yang
    Shrivastava, Anand
    Mohan, Hari
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2021, 22 (02): : 227 - 235
  • [26] Cloud-based Data-intensive Framework towards Fault Diagnosis in Large-scale Petrochemical Plants
    Huo, Zhiqiang
    Mukherjee, Mithun
    Shu, Lei
    Chen, Yuanfang
    Zhou, Zhangbing
    2016 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2016, : 1080 - 1085
  • [27] DICE: Quality-Driven Development of Data-Intensive Cloud Applications
    Casale, G.
    Ardagna, D.
    Artac, M.
    Barbier, F.
    Di Nitto, E.
    Henry, A.
    Iuhasz, G.
    Joubert, C.
    Merseguer, J.
    Munteanu, V. I.
    Perez, J. F.
    Petcu, D.
    Rossi, M.
    Sheridan, C.
    Spais, I.
    Vladusic, D.
    2015 IEEE/ACM 7TH INTERNATIONAL WORKSHOP ON MODELING IN SOFTWARE ENGINEERING, 2015, : 78 - 83
  • [28] Collaborative Optimization of Service Composition for Data-Intensive Applications in a Hybrid Cloud
    Ma, Hua
    Zhu, Haibin
    Li, Keqin
    Tang, Wensheng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (05) : 1022 - 1035
  • [29] Improvement of job completion time in data-intensive cloud computing applications
    Ibrahim Adel Ibrahim
    Mostafa Bassiouni
    Journal of Cloud Computing, 9
  • [30] Performance Evaluation of Data-Intensive Computing Applications on a Public IaaS Cloud
    Exposito, Roberto R.
    Taboada, Guillermo L.
    Ramos, Sabela
    Tourino, Juan
    Doallo, Ramon
    COMPUTER JOURNAL, 2016, 59 (03): : 287 - 307