期刊文献+

个体单体型问题参数化算法研究 被引量:4

Research on Parameterized Algorithms of the Individual Haplotyping Problem
下载PDF
导出
摘要 个体单体型问题指如何利用个体DNA测序片断数据,根据不同的优化准则确定该个体单体型的计算问题.因为技术上的限制,DNA测序实验中能直接测定的片断长度是有限的,一个片断所覆盖的最大SNP位点数k1通常小于10;出于时间和金钱的考虑,覆盖一个SNP位点的最大片断数k2也不是很大,通常约为10左右;与要测定的单体型SNP位点总数n及所测序的DNA片断总数m相比,k1和k2均很小.在此基础上,文中对个体单体型问题最少SNP位点删除MSR和最少片段删除MFR模型进行了参数化,提出了时间复杂度分别为O(nk1k2+mlogm+mk1)和O(mk22+mk1k2+mlogm+nk2)求解无空隙MSR和MFR的精确算法.和Bafna等提出的时间复杂度为O(mn2)和O(m2n+m3)的精确算法相比,文中的算法效率提高了很多,具有较高的实用价值. The Individual Haplotyping Problem is the computing problem of inducing an individual's haplotypes based on several optimal criteria from one's DNA sequence fragment data. Limited by current sequencing techniques, the maximum length of a fragment sequenced directly by DNA sequencers is not large, and the maximum number k1 of SNP sites covered by a fragment is usually smaller than 10. In order to save money and time, the maximum number k2 of fragments covering a SNP site is also small and about 10. Compared with the number n of SNPs of interest and the number m of fragments, k1 and k2 are both very small. Based on the above fact, this paper parameterizes MSR(Minimum SNP Removal) and MFR(Minimum Fragment Removal) models of the individual haplotyping problem, and introduces the corresponding exact algorithms to solve the gapless MSR and MFR problems in the time complexity O(nk1 k2 +m log m+mk1) and O(mk2^2+mk1k2+mlogm+nk2) respectively. Compared with Bafna et al. 's algorithms of time complexity O(mn^2) and O(m^2n+m^3) respectively, the algorithms are more efficient and applicable.
出处 《计算机学报》 EI CSCD 北大核心 2009年第8期1637-1650,共14页 Chinese Journal of Computers
基金 国家自然科学基金(60773111) 国家"九七三"重点基础研究发展规划前期研究专项基金(2008CB317107) 长江学者和创新团队发展计划(IRT0661) 湖南省自然科学基金(09JJ3116) 中国博士后科学基金 中南大学博士后科学基金资助~~
关键词 单核苷酸多态性 单体型 参数化算法 最少SNP位点删除 最少片断删除 SNPs (Single-Nucleotide Polymorphisms) haplotype parameterized algorithm Minimum SNP Removal (MSR) Minimum Fragment Removal (MFR)
  • 相关文献

参考文献10

  • 1http://www.ornl.gov/sci/techresources/Human-Genome/publicat/primer/prim1.html#3 .
  • 2http://www.ebiotrade.com/custom/amersham/Mega-BACE.htm .
  • 3http://www.hapmap.org/downloads/phasing/2006-07-phaseII/ .
  • 4Lancia G,Bafna V,Istrail S,Lippert R,Schwartz R.SNPs problems,complexity and algorithms[].Proceedings of theth Annual European Symposium on Algorithms(ESA).2001
  • 5Panconesi A,Sozio M.Fast hare:Afast heuristic for single individual SNP haplotype reconstruction[].Proceedings of theth International Workshop on Algorithms in Bioinformatics(WABI).2004
  • 6Wang Rui-Sheng,Wu Ling-Yun,Li Zhen-Ping,Zhang Xiang-Sun.Haplotype reconstruction from SNP fragments by minimumerror correction[].Bioinformatics.2005
  • 7Hüffner F.Algorithmengineering for optimal graph biparti-zation[].Proceedings of theth International Workshop on Experimental and Efficient Algorithms(WEA).2005
  • 8Ansorge W,Voss H,Wirkner U et al.Automated sanger DNAsequencing with onelabel inless thanfour lanes on gel[].Journal of Biochemistry.1989
  • 9Myers G.Adataset generator for whole genome shotgun se-quencing[].Proceedings of thethInternational Conference on Intelligent Systems for Molecular Biology(ISMB).1999
  • 10Kruglyak L,Nickerson DA.Variation is the spice of life[].Nature Genetics.2001

同被引文献34

  • 1陈垚亮,洪骥,崔万云,肖仰华.BWA Plus:一个基于频繁序列的下一代基因比对工具[J].计算机研究与发展,2011,48(S3):391-394. 被引量:2
  • 2罗四维,赵连伟.基于谱图理论的流形学习算法[J].计算机研究与发展,2006,43(7):1173-1179. 被引量:76
  • 3葛宏伟,梁艳春.基于隐马尔可夫模型和免疫粒子群优化的多序列比对算法[J].计算机研究与发展,2006,43(8):1330-1336. 被引量:9
  • 4Levy S,Sutton G,Ng P C,et al.The diploid genome sequence of an individual human[J].PLoS Biology,2007,5(10).
  • 5Tachmazidou I,Verzilli C J,Iorio M D.Genetic association mapping via evolution-based clustering of haplotypes[J].PLoS Genet,2007,3(7).
  • 6Zhang X S,Wang R S,Wu L Y,et al.Models and algorithms for haplotyping problem[J].Current Bioinformatics,2006,1(1):105-114.
  • 7Lancia G,Bafna V,Istrail S,et al.SNPs problems,com plexity and algorithms[C]//Heide F M.LNCS 2161:Proc of the 9th Ann European Symp on Algorithms.Heidelberg:Springer,2001:182-193.
  • 8Cilibrasi R,Iersel L,Kelk S,et al.The complexity of the single individual SNP haplotyping pProblem[J].Algorithmica,2007,49(1):13-36.
  • 9Wang R S,Wu L Y,Li Z P,et al.Haplotype reconstruction from SNP fragments by minimum error correction[J].Bioinformatics,2005,21 (10):2456-2462.
  • 10Panconesi A.Sozio M.Fast hare:a fast heuristic for single individual SNP haplotype reconstruction[C]//Jonassen I,Kim J.LNCS 3240:Proc of the 4th Int'l Workshop on Algorithms in Bioin-formatics.Heidelberg:Springer,2004:266-277.

引证文献4

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部