High Net Information Density DNA Data Storage by the MOPE Encoding Algorithm

被引:11
作者
Zheng, Yanfen [1 ]
Cao, Ben [2 ]
Wu, Jieqiong [1 ]
Wang, Bin [1 ]
Zhang, Qiang [1 ]
机构
[1] Dalian Univ, Sch Software Engn, Key Lab Adv Design & Intelligent Comp, Minist Educ, Dalian 116622, Liaoning, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116024, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
DNA storage; MOPE encoding algorithm; non-payload coding set; payload encoding algorithm; APPROACHING CONSTRAINED CODES; DIGITAL INFORMATION; ERROR-CORRECTION; PID CONTROLLER; MUTATION; DESIGN; CAPACITY; GENERATION; OPTIMIZER; ROBUST;
D O I
10.1109/TCBB.2023.3263521
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
DNA has recently been recognized as an attractive storage medium due to its high reliability, capacity, and durability. However, encoding algorithms that simply map binary data to DNA sequences have the disadvantages of low net information density and high synthesis cost. Therefore, this paper proposes an efficient, feasible, and highly robust encoding algorithm called MOPE (Modified Barnacles Mating Optimizer and Payload Encoding). The Modified Barnacles Mating Optimizer (MBMO) algorithm is used to construct the non-payload coding set, and the Payload Encoding (PE) algorithm is used to encode the payload. The results show that the lower bound of the non-payload coding set constructed by the MBMO algorithm is 3%-18% higher than the optimal result of previous work, and theoretical analysis shows that the designed PE algorithm has a net information density of 1.90 bits/nt, which is close to the ideal information capacity of 2 bits per nucleotide. The proposed MOPE encoding algorithm with high net information density and satisfying constraints can not only effectively reduce the cost of DNA synthesis and sequencing but also reduce the occurrence of errors during DNA storage.
引用
收藏
页码:2992 / 3000
页数:9
相关论文
共 50 条
  • [1] Secondary structure prediction of interacting RNA molecules
    Andronescu, M
    Zhang, ZC
    Condon, A
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2005, 345 (05) : 987 - 1001
  • [2] Forward Error Correction for DNA Data Storage
    Blawat, Meinolf
    Gaedke, Klaus
    Huetter, Ingo
    Chen, Xiao-Ming
    Turczyk, Brian
    Inverso, Samuel
    Pruitt, Benjamin W.
    Church, George M.
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016), 2016, 80 : 1011 - 1022
  • [3] Bornhol J, 2016, ACM SIGPLAN NOTICES, V51, P637, DOI [10.1145/2872362.2872397, 10.1145/2954679.2872397]
  • [4] FMG: An observable DNA storage coding method based on frequency matrix game graphs
    Cao, Ben
    Shi, Peijun
    Zheng, Yanfen
    Zhang, Qiang
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [5] Adaptive coding for DNA storage with high storage density and low coverage
    Cao, Ben
    Zhang, Xiaokang
    Cui, Shuang
    Zhang, Qiang
    [J]. NPJ SYSTEMS BIOLOGY AND APPLICATIONS, 2022, 8 (01)
  • [6] Minimum Free Energy Coding for DNA Storage
    Cao, Ben
    Zhang, Xiaokang
    Wu, Jieqiong
    Wang, Bin
    Zhang, Qiang
    Wei, Xiaopeng
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2021, 20 (02) : 212 - 222
  • [7] Designing Uncorrelated Address Constrain for DNA Storage by DMVO Algorithm
    Cao, Ben
    Li, Xue
    Zhang, Xiaokang
    Wang, Bin
    Zhang, Qiang
    Wei, Xiaopeng
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (02) : 866 - 877
  • [8] Next-Generation Digital Information Storage in DNA
    Church, George M.
    Gao, Yuan
    Kosuri, Sriram
    [J]. SCIENCE, 2012, 337 (6102) : 1628 - 1628
  • [9] New Trends of Digital Data Storage in DNA
    De Silva, Pavani Yashodha
    Ganegoda, Gamage Upeksha
    [J]. BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [10] Image storage onto synthetic DNA
    Dimopoulou, Melpomeni
    Antonini, Marc
    Barbry, Pascal
    Appuswamy, Raja
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 97