A Strategy of Data Synchronization in Distributed System with Read Separating from Write

被引:2
作者
Gao J. [1 ]
Liu W. [1 ]
Li Z. [1 ]
机构
[1] School of Computer Science, Northwestern Polytechnical University, Xi'an
来源
| 1600年 / Northwestern Polytechnical University卷 / 38期
关键词
Data synchronization; Distributed database; Fine granularity; Oceanbase; Read separating from write;
D O I
10.1051/jnwpu/20203810209
中图分类号
学科分类号
摘要
Read separating from write is a strategy that NewSQL adopts to incorporate the advantages of traditional relation database and NoSQL database. Under this architecture, baseline data is split into multiple partitions stored at distributed physical nodes, while delta data is stored at single transaction node. For reducing the pressure of transaction node and improving the query performance, delta data needs to be synchronized into storage nodes. The current strategies trigger the procedure of data synchronization per partition, meaning that unchanged partitions will also participate in data synchronization, which consumes extra network cost, local IO and space resources. For improving the efficiency of data synchronization meanwhile mitigating space utilization, the fine-grained data synchronization strategy is proposed, whose main idea includes that fine-grained logical partitions upon original coarse-grained partitions is established, providing more correct synchronized unit; the delta data sensing strategy is introduced, which records the mapping between changed partitions and its delta data; instead of partition driven, the data synchronization through the delta-broadcasting mechanism is driven, constraining that only changed partitions can participate in data synchronization. The fine-grained data synchronization strategy on Oceanbase is implemented, which is a distributed database with read separating from write, and the results show that our strategy is better than other strategies in efficiency of data synchronizing and space utilization. © 2020 Journal of Northwestern Polytechnical University.
引用
收藏
页码:209 / 215
页数:6
相关论文
共 19 条
  • [1] Moniruzzaman A.B.M., NewSQL: Towards Next-Generation Scalable RDBMS for Online Transaction Processing(OLTP) for Big Data Management, Computer Science, 7, 6, pp. 121-130, (2014)
  • [2] Chen J., Jindel S., Walzer R., Et al., The MemSQL Query Optimizer: a Modern Optimizer for Real-Time Analytics in a Distributed Database, Proceedings of the VLDB Endowment, 9, 13, pp. 1401-1412, (2016)
  • [3] VoltDB
  • [4] Yang Z., The Architecture of Oceanbase Relational Database System, Journal of East China Normal University, 5, pp. 141-148, (2014)
  • [5] Lin X., System, Method and Database Proxy Server for Separating Operations of Read and Write
  • [6] LevelDB
  • [7] PostgreSQL
  • [8] MySQL
  • [9] DB2
  • [10] Chang F., Dean J., Ghemawat S., Et al., Bigtable: a Distributed Storage System for Structured Data, ACM Trans on Computer Systems, 26, 2, pp. 1-26, (2008)