Impacts of data consistency levels in cloud-based NoSQL for data-intensive applications

被引:0
|
作者
Ferreira, Saulo [1 ]
Mendonca, Julio [2 ]
Nogueira, Bruno [3 ]
Tiengo, Willy [3 ]
Andrade, Ermeson [1 ]
机构
[1] Univ Fed Rural Pernambuco, Recife, PE, Brazil
[2] Univ Luxembourg, Interdisciplinary Ctr Secur Reliabil & Trust SnT, Luxembourg, Luxembourg
[3] Univ Fed Alagoas, Maceio, Alagoas, Brazil
关键词
Cloud; Data consistency; Databases; NoSQL; Performance;
D O I
10.1186/s13677-024-00716-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When using database management systems (DBMSs), it is common to distribute instance replicas across multiple locations for disaster recovery and scaling purposes. To efficiently geo-replicate data, it is crucial to ensure the data and its replicas remain consistent with the same and the most up-to-date data. However, DBMSs' inner characteristics and external factors, such as the replication strategy and network latency, can affect system performance when dealing with data replication, especially when the replicas are deployed far apart from the others. Thus, it is essential to comprehend how achieving high data consistency levels in geo-replicated systems can impact systems performance. This work analyzes various data consistency settings for the widely used NoSQL DBMSs, namely MongoDB, Redis, and Cassandra. The analysis is based on real-world experiments in which DBMS nodes are deployed on cloud platforms in different locations, considering single and multiple region deployments. Based on the results of the experiments, we provide a comprehensive analysis regarding the system throughput and response time when executing reading and writing operations, pointing out scenarios where each DBMS could be better employed. Some of our findings include, for instance, that opting for strong data consistency significantly impacts Cassandra's reading operations in the single-region deployment, while MongoDB writing operations are most affected in a multi-region scenario. Additionally, all of these DBMSs exhibit statistically significant variations across all scenarios in the multi-region setup when the data consistency is switched from weak to stronger level.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Improvement of job completion time in data-intensive cloud computing applications
    Ibrahim, Ibrahim Adel
    Bassiouni, Mostafa
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2020, 9 (01):
  • [32] Deadline based scheduling for data-intensive applications in clouds
    Fu Xiong
    Cang Yeliang
    Zhu Lipeng
    Hu Bin
    Deng Song
    Wang Dong
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2016, 23 (06) : 8 - 15
  • [33] Provisioning, Placement and Pipelining Strategies for Data-Intensive Applications in Cloud Environments
    Ghoshal, Devarshi
    Ramakrishnan, Lavanya
    2014 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E), 2014, : 325 - 330
  • [34] Accordia: Adaptive Cloud Configuration Optimization for Recurring Data-Intensive Applications
    Liu, Yang
    Xu, Huanle
    Lau, Wing Cheong
    PROCEEDINGS OF THE 2019 TENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '19), 2019, : 479 - 479
  • [35] Accordia: Adaptive Cloud Configuration Optimization for Recurring Data-Intensive Applications
    Liu, Yang
    Xu, Huanle
    Lau, Wing Cheong
    2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, : 831 - 841
  • [36] Deadline based scheduling for data-intensive applications in clouds
    Fu Xiong
    Cang Yeliang
    Zhu Lipeng
    Hu Bin
    Deng Song
    Wang Dong
    The Journal of China Universities of Posts and Telecommunications, 2016, (06) : 8 - 15
  • [37] Special section on data-intensive cloud infrastructure
    Ashraf Aboulnaga
    Beng Chin Ooi
    Patrick Valduriez
    The VLDB Journal, 2014, 23 : 843 - 843
  • [38] A Data Placement Strategy for Data-Intensive Scientific Workflows in Cloud
    Zhao, Qing
    Xiong, Congcong
    Zhao, Xi
    Yu, Ce
    Xiao, Jian
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 928 - 934
  • [39] Special section on data-intensive cloud infrastructure
    Aboulnaga, Ashraf
    Ooi, Beng Chin
    Valduriez, Patrick
    VLDB JOURNAL, 2014, 23 (06): : 843 - 843
  • [40] A Framework for Data-Intensive Computing with Cloud Bursting
    Bicer, Tekin
    Chiu, David
    Agrawal, Gagan
    2011 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2011, : 169 - 177