DNA片段拼接中基于定长特征子串的重复序列信息屏蔽方法被引量：4

Definite-sized Characteristic Substrings Based Method for the Masking-off of Repeats in DNA Fragment Assembly

下载PDF

导出

摘要包含重复序列(repeats)的DNA序列的重构是大规模DNA片段拼接所面临的实际困难之一。在考虑片段数据所隐含的位置信息的基础上,提出了一种基于定长特征子串的屏蔽片段数据中重复序列信息的方法,即在进行序列相互比对前利用独特子串标识大多数片段,从而减少可能的错误重叠,讨论了方法中几个参数的确定问题并用计算结果说明了方法的有效性。 One of the practical difficulties that remains in large-scale DNA fragment assembly is the correct reconstruction of DNA sequences including repeats. An approach based on the definite-sized characteristic substring for the masking-off of repeats is proposed after considering the relative position information contained in fragment data. Before pair-wise alignment the approach chose unique substrings to mark fragments for the sake of decrease in possible incorrect overlaps. We also concretely describes the determination of some parameters and finally presents the computational result to prove the effectiveness of the method.

作者张博锋王正华

机构地区国防科技大学并行与分布处理国家重点实验室

出处《国防科技大学学报》 EI CAS CSCD 北大核心 2002年第6期67-70,共4页 Journal of National University of Defense Technology

基金国家自然科学基金资助重点项目(69933030)

关键词重复序列信息屏蔽生物信息学片段拼接重复片段定长特征子串 DNA序列 bioinformatics fragment assembly repeats masking-off definite-sized characteristic substring

分类号 Q811.4 [生物学—生物工程]

引文网络
相关文献

参考文献7

1International Human Genome Sequencing Consortium. Initial Sequencing and Analysis of the Human Genome [J]. Nature, 2001, 409: 860-864.
2Jain M, Myers E W. Algorithms for Computing and Integrating Physical Maps Using Unique Probes [J]. Journal of Computational Biology, 1997, 4(4): 449-466.
3Setuball J C,Werneck R F. A Program for Building Contig Scaffolds in Double-barrelled Shotgun Genome Sequencing [R]. Institute of Computing Technical Report IC-01-05, Unicamp, 2001.
4Lander E S,Waterman M S. Genomic Mapping by Fingerprinting Random Clones a Mathematical Analysis [J]. Genomics, 1998, 2: 231-239.
5Kececioglu J D,Meyers E W. Combinatorial Algorithms for DNA Sequence Assembly [J]. Algorithmica, 1995, 13: 7-15.
6Allex C F. Computational Methods for Fast and Accurate DNA Fragment Assembly[D]. A Dissertation for the Degree of Doctor of Philosophy (Computer Science) at the University of Wisconsin-Madison, 1999: 83-142.
7Pevzner P A,Tang Haixu,Waterman M S. An Eulerian Path Approach to DNA Fragment Assembly [J]. Proceedings of National Academy of Sciences, 2001, 98: 9487-9753.

同被引文献57

1涂俐兰,王能超,陈莹,梅启鹏.生物序列拼接及其算法[J].生命科学研究,2003,7(S1):79-82. 被引量：3
2涂俐兰,王能超.DNA序列拼接中重复序列屏蔽的一种新方法[J].华中科技大学学报（自然科学版）,2004,32(8):107-109. 被引量：1
3方小永,骆志刚.DNA序列拼接的分布式并行处理[J].计算机工程与科学,2005,27(2):71-73. 被引量：3
4赵东升,杭兴宜,李稚锋,张成岗.军事医学科学院生物医学超级计算中心的计算资源与应用[J].军事医学科学院院刊,2005,29(4):363-367. 被引量：6
5杭兴宜,赵东升,李稚锋,翁景然,张成岗.超级刀片计算机上任务并行生物计算程序的效率优化[J].军事医学科学院院刊,2005,29(6):550-553. 被引量：1
6王磊,张祖平,陈建二.DNA片段拼接中重复序列算法研究[J].计算机科学,2006,33(7):164-166. 被引量：2
7International Human Genome Sequencing Consortium. Initial Sequencing and Analysis of the Human Genome[J]. Nature, 2001, 409(6822): 860-921.
8Kececioglu J D, Meyers E W. Combinatorial Algorithms for DNA Sequencing Assembly[J]. Algorithmica, 1995, 13(1/2): 7-15.
9Wang Jun, Wong Gane Ka-Shul. RePS: A Sequence Assembler That Masks Exact Repeats Identified from the Shotgun Data[J]. Genome Research, 2002, 12(5): 824-831.
10GenBank Database[Z]. [2008-06-10]. http://www.ncbi.nlm.nih.gov/ Genbank/index.html.

引证文献4

1王磊,张祖平,陈建二.DNA片段拼接中重复序列算法研究[J].计算机科学,2006,33(7):164-166. 被引量：2
2毛逸清,赵东升,李稚锋,杭兴宜,骆志刚,张成岗.大规模EST序列聚类的并行算法研究进展[J].军事医学科学院院刊,2006,30(6):591-595. 被引量：1
3蔡葵,杨进才.DNA片段拼接中的预归并重复序列屏蔽方法[J].计算机工程,2009,35(4):88-90. 被引量：1
4蔡葵,杨进才.基于变长子串的DNA重复序列预归并屏蔽方法[J].武汉理工大学学报（信息与管理工程版）,2012,34(1):16-19.

二级引证文献3

1蔡葵,杨进才.DNA片段拼接中的预归并重复序列屏蔽方法[J].计算机工程,2009,35(4):88-90. 被引量：1
2岳洋,王栋,郝海生,杜卫华,赵学明,朱化彬,路永强.消减cDNA文库差异表达基因检测分析方法的研究进展[J].中国畜牧兽医,2010,37(10):100-104. 被引量：1
3蔡葵,杨进才.基于变长子串的DNA重复序列预归并屏蔽方法[J].武汉理工大学学报（信息与管理工程版）,2012,34(1):16-19.

1王磊,张祖平,陈建二.DNA片段拼接中重复序列算法研究[J].计算机科学,2006,33(7):164-166. 被引量：2
2Nature:研究发现大脑多任务处理机制[J].现代生物医学进展,2015,15(34).
3邓晗嵩.黔西南州生态农业的现状及对策[J].黔西南民族师范高等专科学校学报,2005(3):81-83. 被引量：4
4钱毓,周桐.“裸奔”野骆驼昂首17年[J].法治人生,2011(9):32-33.
5彭梅,尹娜,谢天宏,葛长勇,张光明,李鸿钧,孙茂盛.重组抗菌肽基因克隆及其Pichia pastoris表达工程菌的构建[J].云南大学学报（自然科学版）,2007,29(S3):454-457.
6陈国梁,张金文,徐露,严海霞,张向前.马铃薯块茎启动子驱动的淀粉合成酶基因RNAi载体构建及遗传转化[J].食品与生物技术学报,2015,34(11):1141-1145.
7Alina Stoita,Ian D Penman,David B Williams.Review of screening for pancreatic cancer in high risk individuals[J].World Journal of Gastroenterology,2011,17(19):2365-2371. 被引量：4
8白云.消除腹部脂肪的四个步骤[J].老人世界,2013(9):50-50.
9周莉莉,陈瑞琴,陈秀兰,曾胤新,陈波.应用hiTAIL-PCR技术克隆Marinomonas sp.BSi20584菌株β-半乳糖苷酶基因[J].极地研究,2013,25(3):249-256. 被引量：3
10冯德江,徐军望,路子显,陈蕾,徐鸿林,李旭刚,朱祯.利用PCR对PVX外壳蛋白基因进行同义密码子修饰[J].生物技术通报,2003,19(4):29-32.

国防科技大学学报

2002年第6期

浏览历史

内容加载中请稍等...

DNA片段拼接中基于定长特征子串的重复序列信息屏蔽方法被引量：4

参考文献7

同被引文献57

引证文献4

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

DNA片段拼接中基于定长特征子串的重复序列信息屏蔽方法 被引量：4

参考文献7

同被引文献57

引证文献4

二级引证文献3

相关作者

相关机构

相关主题

浏览历史

DNA片段拼接中基于定长特征子串的重复序列信息屏蔽方法被引量：4