Data mining-based hierarchical transaction model for multi-level consistency management in large-scale replicated databases

被引:0
作者
Mukherjee, Aradhita [1 ]
Chaki, Rituparna [2 ]
Chaki, Nabendu [1 ]
机构
[1] Univ Calcutta, Dept Comp Sci & Engn, JD-2,Sect 3, Kolkata 700106, India
[2] Univ Calcutta, AK Choudhury Sch Informat Technol, JD-2,Sect 3, Kolkata 700106, India
关键词
Big data; CAP; Frequent itemset mining; Association rule mining; Serializability; Snapshot isolation; Non-monotonic snapshot isolation; SNAPSHOT ISOLATION; MAPREDUCE;
D O I
10.1016/j.csi.2020.103485
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scalability and availability in a large-scale distributed database is determined by the consistency strategies used by the transactions. Most of the big data applications demand consistency and availability at the same time. However, a suitable transaction model that handles the trade-obetween availability and consistency is presently lacking. In this article, we have proposed a hierarchical transaction model that supports multiple consistency levels for data items in a large-scale replicated database. The data items have been classified into different categories based on their consistency requirement, computed using a data mining algorithm. Thereafter, these have been mapped to the appropriate consistency level in the hierarchy. This allows parallel execution of several transactions belonging to each level. The topmost level called the Serializable (SR) level follows strong consistency applicable to data items that are mostly read and updated both. The next level of consistency, Snapshot Isolation (SI), maps to data items which are mostly read and demand unblocking read. Data items which are mostly updated do not follow strict consistent snapshot and have been mapped to the next lower level called Non- monotonic Snapshot Isolation (NMSI). The lowest level in the hierarchy correspond to data items for which ordering of operations does not matter. This level is called the Asynchronous (ASYNC) level. We have tested the proposed transaction model with two different workloads on a test-bed designed following the TPC-C benchmark schema. The performance of the proposed model has been evaluated against other transaction models that support single consistency policy. The proposed model has shown promising results in terms of transaction throughput, commit rate and average latency.
引用
收藏
页数:13
相关论文
共 21 条
  • [1] Ardekani M.S., 2013, ABS13063906 CORR ABS13063906 CORR
  • [2] Non-Monotonic Snapshot Isolation: scalable and strong consistency for geo-replicated transactional systems
    Ardekani, Masoud Saeida
    Sutra, Pierre
    Shapiro, Marc
    [J]. 2013 IEEE 32ND INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS 2013), 2013, : 163 - 172
  • [3] Data modeling in the NoSQL world
    Atzeni, Paolo
    Bugiotti, Francesca
    Cabibbo, Luca
    Torlone, Riccardo
    [J]. COMPUTER STANDARDS & INTERFACES, 2020, 67 (67)
  • [4] Bailis Peter, 2013, QUEUE, V11, P20
  • [5] Berenson H., 1995, SIGMOD Record, V24, P1, DOI 10.1145/568271.223785
  • [6] Making snapshot isolation serializable
    Fekete, A
    Liarokapis, D
    O'Neil, E
    O'Neil, P
    Shasha, D
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2005, 30 (02): : 492 - 528
  • [7] Han J, 2012, MOR KAUF D, P1
  • [8] Don't Settle for Eventual Consistency
    Lloyd, Wyatt
    Freedman, Michael J.
    Kaminsky, Michael
    Andersen, David G.
    [J]. COMMUNICATIONS OF THE ACM, 2014, 57 (05) : 61 - 68
  • [9] Apriori Versions Based on MapReduce for Mining Frequent Patterns on Big Data
    Maria Luna, Jose
    Padillo, Francisco
    Pechenizkiy, Mykola
    Ventura, Sebastian
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (10) : 2851 - 2865
  • [10] Blotter: Low Latency Transactions for Geo-Replicated Storage
    Moniz, Henrique
    Leitao, Joao
    Dias, Ricardo J.
    Gehrke, Johannes
    Preguica, Nuno
    Rodrigues, Rodrigo
    [J]. PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17), 2017, : 263 - 272