摘要
数据量的不断增长给信息存储带来巨大挑战。脱氧核糖核酸(DNA)具有寿命长、稳定性好、维护率低和容量高等先天优势,被公认为一种有潜力的自然信息存储介质。鉴于此,提出一种新的DNA信息存储方案.该方案采用Raptor码将二进制文件转换为DNA碱基序列,并结合DNA自身结构的特点引人四进制RS(Reed-Solomon)纠错码,保障信道传输的可靠性,此外提出GC(鸟嘌呤和胞嘧啶碱基)含量及均聚物的筛选方案.降低DNA的合成、测序难度及错误率。最后将文本、图片和音频等不同格式的文件分别通过该存储框架后编码为碱基序列,并进行生物实验合成DNA以实现信息的存储。实验结果表明:基于Raptor码的DNA存储框架的每个碱基的平均编码效率为1.49bit,使用生物实验合成的DNA能够无错误恢复文件,具有良好的信息存储性能。
Increasing the volume of data poses huge challenges to information storage.Deoxyribonucleic acid(DNA)has the natural advantages of long life,good stability,low maintenance rate,and high storage capacity.Further,it is recognized as a potential natural information storage medium.Therefore,a new DNA information storage plan is proposed,where Raptor codes are used to convert the binary files into DNA base sequences and the structural characteristics of DNA are considered to introduce quaternary RS(Reed-Solomon)error correction code for ensuring the reliability of channel transmission.In addition,based on the guanine and cytosine base content,homopolymer screening schemes have been proposed to reduce the difficulty associated with DNAl synthesis and sequencing as well as the error rate.Finally,files of different formats,such as text,images,and videos,are encoded into the DNA base sequences after they are passed through the storage framework,and biological experiments are conducted to synthesize DNA for achieving information storage.The experimental results show that the average coding efficiency of each basic of the DNA storage framework based on Raptor codes is 1.49 bit,and the DNA synthesized by biological experiments can recover files without errors and has a good information storage performance.
作者
张淑芳
彭康
Zhang Shufang;Peng Kang(School of Electrical and Information Engineering,Tianjin Universitgy,Tianjin 300072,China)
出处
《激光与光电子学进展》
CSCD
北大核心
2020年第15期161-167,共7页
Laser & Optoelectronics Progress
关键词
生物光学
信息存储
脱氧核糖核酸
RAPTOR码
RS纠错码
biotechnology
information storage
deoxyribonucleic acid
Raptor code
Reed-Solornon error correction code