Forward Error Correction for DNA Data Storage

被引:237
作者
Blawat, Meinolf [1 ]
Gaedke, Klaus [1 ]
Huetter, Ingo [1 ]
Chen, Xiao-Ming [1 ]
Turczyk, Brian [2 ]
Inverso, Samuel [2 ]
Pruitt, Benjamin W. [2 ]
Church, George M. [2 ]
机构
[1] Technicolor Res & Innovat, Hannover, Germany
[2] Harvard Med Sch, Wyss Inst, Boston, MA 02115 USA
来源
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE 2016 (ICCS 2016) | 2016年 / 80卷
关键词
Bio-technology; DNA; DNA synthesis and sequencing; digital data storage; data preservation; archiving;
D O I
10.1016/j.procs.2016.05.398
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We report on a strong capacity boost in storing digital data in synthetic DNA. In principle, synthetic DNA is an ideal media to archive digital data for very long times because the achievable data density and longevity outperforms today's digital data storage media by far. On the other hand, neither the synthesis, nor the amplification and the sequencing of DNA strands can be performed error-free today and in the foreseeable future. In order to make synthetic DNA available as digital data storage media, specifically tailored forward error correction schemes have to be applied. For the purpose of realizing a DNA data storage, we have developed an efficient and robust forward-error-correcting scheme adapted to the DNA channel. We based the design of the needed DNA channel model on data from a proof-of-concept conducted 2012 by a team from the Harvard Medical School [1]. Our forward error correction scheme is able to cope with all error types of today's DNA synthesis, amplification and sequencing processes, e.g. insertion, deletion, and swap errors. In a successful experiment, we were able to store and retrieve error-free 22 MByte of digital data in synthetic DNA recently. The found residual error probability is already in the same order as it is in hard disk drives and can be easily improved further. This proves the feasibility to use synthetic DNA as long-term digital data storage media.
引用
收藏
页码:1011 / 1022
页数:12
相关论文
共 13 条
[1]  
[Anonymous], 2015, DATA STORAGE ETERNIT
[2]  
[Anonymous], AURAL INFORM
[3]  
[Anonymous], 2012, NEXT GENERATION SEQU
[4]  
[Anonymous], AURAL INFORM
[5]  
Bertone Paul, 2013, NATURE
[6]  
Buermans H.P.J., 2014, NEXT GENERATION SEQU
[7]  
Church George., 2012, Regenesis
[8]  
Church M., 2012, NEXT GENERATION DIGI
[9]  
Freudenberger Jiirgen, 2007, CODING THEORY
[10]  
Gray J., 2005, MSRTR2005166