串联重复序列识别方法研究被引量：1

The Research of Methods for Identifying Tandem Repeat

下载PDF

导出

摘要非编码区重复序列分析在基因组研究中起着重要作用，其基础就是在非编码DNA序列中识别和定位所有的重复结构。重复序列识别问题在计算机科学中主要体现为字符串匹配问题。在分析了后缀树和后缀数组字符串匹配算法的基础上，详细阐述了基于后缀数组的精确串联重复序列识别方法。实验表明，该方法适合用于非编码DNA序列分析。 The repeat sequences analysis of non-coding area plays an important role in the research of genomes, its foundation is to identify and locate the periodic patterns. It addresses the method of identifying the accurate tandem repeat in detail after analyzing suffix tree and suffix array algorithms of string matching. The experiment indicates that the method adapts to non-coding DNA sequence analysis.

作者陈昌平刘自伟周文鹃彭春艳 CHEN Chang-ping, LIU Zi-wei, ZHOU Wen-juan, PENG Chun-yan （Southwest University of Science and Technology, College of Computer Science and Technology, Mianyang 621000, China）

机构地区西南科技大学计算机科学与技术学院

出处《电脑知识与技术》 2008年第11期930-931,937,共3页 Computer Knowledge and Technology

关键词串联重复序列后缀树应缀数组最大串联重复序列 tandem repeat suffix tree suffix array longest tandem repeat

分类号 TP274 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

参考文献2

1王晓敏,王正志.基于后缀列的基因序列最大串联重复查找技术(英文)[J].生物信息学,2006,4(2):72-75. 被引量：4
2E. Ukkonen. On-line construction of suffix trees[J] 1995,Algorithmica(3):249～260

二级参考文献12

1[1]E.S.Lander,L.M.Linton,B.Birren,C.Nusbaum,M.C.Zody,J.Baldwin,K.Devon,and K.Dewar,et.al.Initial Sequencing and Analysis of the Human Genome[J].Nature,2001,409:860-921.
2[2]R.R.Sinden,Trinucleotide repeats biological implications of the DNA structures associated with disease-causing triplet repeats[J].Human Genetics,2000,vol.64,346-353.
3[3]Achaz,G.,Rocha,E.P.C.,Netter,P.and Coissac,E..Origin and fate of repeats in bacteria[J].Nucleic Acids Res.,2002,30:2987-2994.
4[4]Kolpakov,R.and Kucherov,G.Finding maximal repetitions in a word in linear time[R].In Proceedings of the 1999 Symposium on Foundations of Computer Science,New York (USA) 1999,596-604.
5[5]Lothaire,M.Algebraic Combinatorics on Words[M].Cambridge University Press.2002.
6[6]U.Manber and E.W.Myers.Suffix Arrays:A New Method for On -Line String Searches[J].SIAM Journal on Computing,1993,22(5):935-948.
7[7]Peter M.McIlroy and M.Douglas McIlroy,ssort.c,Source Code,1997,http://cm.bell-labs.com/cm/cs/who/doug/source.html.
8[8]T.Kasai,G.Lee,H.Arimura,S.Arikawa,and K.Park.Linear-Time Longest-Common-Prffix Computation in Suffix Arrays and its Applications[R].In Proceedings of the 12th Annual Symposium on Combinatorial Pattern Matching,pp.181-192.Lecture Notes in Computer Science 2089,Springer-Verlag,2001.
9[9]Benson,G.Tandem repeats finder:a program to analyze DNA sequences[J].Nucleic Acid Research,vol.1998,27:573-580.
10[10]Valerio Parisi,Valeria De Fonzo and Filippo Aluffi-Pentini,STRING:finding tandem repeats in DNA sequences[J].Bioinformatics 2003,19(14):1733-1738.

共引文献3

1周文鹃,刘自伟,陈昌平.基于DC3算法的非编码区序列最大串联重复识别[J].兵工自动化,2009,28(3):42-44. 被引量：1
2陆向艳,钟诚.机群系统上长序列最大串联重复识别并行算法[J].微电子学与计算机,2010,27(8):186-189. 被引量：2
3唐四薪,谭晓兰,向卓.生物信息学案例在《数据结构》教学中的应用[J].安康学院学报,2013,25(3):96-98.

同被引文献8

1庄永龙,周敏,李衍达,沈岩.人类遗传突变数据库及其应用[J].遗传,2004,26(4):514-518. 被引量：4
2尹贤贵,王小佳,张赟,潘光辉,杨琦凤.DNA分子标记及其在番茄遗传育种中的应用[J].西南农业大学学报（自然科学版）,2004,26(6):663-668. 被引量：14
3Hao Fan,Jia-You Chu.A Brief Review of Short Tandem Repeat Mutation[J].Genomics, Proteomics & Bioinformatics,2007,5(1):7-14. 被引量：9
4M T. Short Tandem Repeat-Based Identification of Individuals and Parents [J]. Croat Med J, 2001, 42(3) : 233--238.
5Gl R. Examining Coding Structure and Redundancy in DNA [J]. IEEE Engineering In Medicine And Biology Magazine, 2006, 25(1): 62 -68.
6ZHOU H, DU L, YAN H. Detection of Tandem Repeats in DNA Sequences Based on Parametric Spectral Estimation [J]. IEEE Transactions on Information Technology in Biomedicine, 2009, 13(5): 747--755.
7DIMITRIS A. Genomic Signal Processing [J].IEEE Signal Processing Magazine, 2001, 18(4) : 8--20.
8余敏,马明星,卢培,吴松锜,任永富,谭德勇.一种简便的STR重复次数测定方法[J].云南大学学报（自然科学版）,2008,30(5):519-525. 被引量：1

引证文献1

1王嘉,田逢春,王世元,刘晓.一种定位DNA序列短串联重复的办法[J].西南大学学报（自然科学版）,2011,33(9):126-130.

1王晓敏,王正志.基于后缀列的基因序列最大串联重复查找技术(英文)[J].生物信息学,2006,4(2):72-75. 被引量：4
2常国锋.浅析编写计算机程序的三种结构[J].电子制作,2015,23(2Z).
3郑启华.PASCAL语言讲座(三)[J].电脑爱好者,1998(11):29-31.
4骆嘉伟,颜军,何海峰.基于YKW图形表达的人类基因短编码序列识别[J].计算机应用,2011,31(8):2087-2091.
5杨纪青,陈洪萍.Tobacco Yellow dwarf Virus完整基因组上串联重复序列分布[J].数字技术与应用,2010,28(9):91-92.
6毛家顺,张汝波,杨大伟.基于TLD改进的自动人体检测与实时跟踪算法[J].微型机与应用,2015,34(22):47-49. 被引量：1
7生物信息专用计算机[J].光学精密机械,2003(3):35-36.
8王旺.数字指纹技术研究进展[J].中国新技术新产品,2015(12):12-12.
9苹果Xserve进军生物工程[J].数字技术与应用,2005(4):7-7.
10黄浩锋,肖南峰.基于组稀疏表示的医学图像超分辨率重建[J].计算机科学,2015,42(S1):151-153 189. 被引量：6

电脑知识与技术

2008年第11期

浏览历史

内容加载中请稍等...

串联重复序列识别方法研究被引量：1

参考文献2

二级参考文献12

共引文献3

同被引文献8

引证文献1

相关作者

相关机构

相关主题

浏览历史

串联重复序列识别方法研究 被引量：1

参考文献2

二级参考文献12

共引文献3

同被引文献8

引证文献1

相关作者

相关机构

相关主题

浏览历史

串联重复序列识别方法研究被引量：1