期刊文献+

基于样本熵的DNA序列相似性分析

Analysis of similarity of DNA sequences based on sample entropy
下载PDF
导出
摘要 针对传统方法在分析DNA序列相似性方面的不足,提出了一种基于样本熵的DNA序列相似性分析方法。以5种东亚钳蝎神经毒素的基因序列作为分析对象,首先通过DNA序列的图形表示把DNA序列转换为时间序列,然后运用样本熵算法计算出时间序列的样本熵值,将样本熵的互值大小作为分析序列之间相似性的依据,最后将样本熵方法与DTW(Dynamic Time Warping,动态时间弯曲)方法的实验结果进行比较。实验结果表明,样本熵分析方法能有效分析序列之间的相似性,与DTW分析方法相比较,显示出更强的相似性和区别度,可将其进一步应用于生物序列的分析。 This paper studies the application of sample entropy for similarity analysis of DNA sequences. The gene sequences of five kinds of Buthus martensi Karsch neurotoxins are analyzed. The graphical representation of DNA sequences are converted into digital sequences,and their sample entropy are calculated based on sample entropy method. The mutual value between different sample entropy is used to analysis sequence similarity. Analysis result is compared with the method of DTW distance. The analysis result of the proposed method provides good analysis efficiency and higher sensitivity and distinction than the results of DTW distance method. The method of sample entropy can be used for further biological sequences analysis.
出处 《智能计算机与应用》 2016年第1期101-103,共3页 Intelligent Computer and Applications
关键词 样本熵 DNA序列 序列相似性 DTW距离 sample entropy DNA sequence similarity analysis DTW distance
  • 相关文献

参考文献13

  • 1HUANG Y, WANG T. New graphical representation of a DNA sequence based on the ordered dinucleotides and its application to sequence analysis [ J ]. International Journal of Quantum Chemistry, 2012, 112(6): 1746-1757.
  • 2BIELINSKA-WAZ D. Graphical and numerical representations of DNA sequences: Statistical aspects of simi{arity [ J ]. Journal of Computational Chemistry, 2011, 49 (49) : 2345 - 2407.
  • 3JAFAZADEH N, IRANMANESH A. A novel graphical and numerical representation for analyzing DNA sequences based on codons [ J ]. MATCH Communications in Mathematical and in Computer Chemistry. 2012. 68, 611-620.
  • 4CHUN L, HONG M, YANG Z, et al. Similarity analysis of DNA sequences based on the weighted pseudo-entropy [ J ]. Journal of Computational Chemistry, 2011, 32(4): 675-680.
  • 5RNDIC M, VRACKO M, LER N, et al. Analysis of similarity dissimilarity of DNA sequences based on novel 2- D graphical representation[ J ]. Chemical Physics Letters, 2003, 371 ( 1 - 2 ) : 202 - 207.
  • 6LIAO Bo, DING Kequal. A 3D graphical representation of DNA sequences and its application [ J ]. Theoretical Computer Science, 2006, 358(1) :56-64.
  • 7TANG XiaoChan ZHOU PanPan QIU WenYuan.On the similarity/dissimilarity of DNA sequences based on 4D graphical representation[J].Chinese Science Bulletin,2010,55(8):701-704. 被引量:5
  • 8YU Chenglong, DENG Mo, YAU S S T. DNA sequence comparison by a novel probabilistic method [ J ]. Information Sciences, 2011, 181(8) : 1484-1492.
  • 9ZHANG Xun, ZHOU Xiaoan, YU Yunhui .Similarity analysis of DNA using improved approximate entropy [ C ]//Biomedical Engineering and Biotechnology (iCBEB), 2012 International Conference on. Macan. Macao:IEEE, 2012: 511-514.
  • 10李梅,白凤兰.基于DTW距离的DNA序列相似性分析[J].生物数学学报,2009,24(2):374-378. 被引量:11

二级参考文献26

  • 1白凤兰,廖波,王天明.拓扑指数在生物序列相似性比较中的应用[J].生物数学学报,2006,21(4):521-530. 被引量:3
  • 2刘懿,鲍德沛,杨泽红,赵雁南,贾培发,王家钦.新型时间序列相似性度量方法研究[J].计算机应用研究,2007,24(5):112-114. 被引量:24
  • 3Posada D. Bioinformatics for DNA sequence analysis. New York: Humana Press, 2009.
  • 4Nandy A. A new graphical representation and analysis of DNA sequence structure L methodology and application to globin genes. Curr Sci, 1994, 66:309-314.
  • 5Randic M, Zupan J, Novic M. On 3-D graphical representation of proteomics maps and their numerical characterization. J Chem Inf Comput Sci, 2001, 41:1339-1344.
  • 6Randic M, Vracko M, Lers N, et al. Novel 2-D graphical representation of DNA sequences and their numerical characterization. Chem Phys Lett, 2003, 368:1-6.
  • 7Randic M, Vracko M, Nandy A, et al. On 3-D graphical representation of DNA primary sequences and their numerical characterization. J Chem Inf Comput Sci, 2000, 40:1235-1244.
  • 8Randic M. Graphical representations of DNA as 2-D map. Chem Phys Lett., 2004, 386:468-471.
  • 9Randic M, Vracko M, Lers N, et al. Analysis of similarity/dissimilarity of DNA sequences based on novel 2-D graphical representation. Chem Phys Lett, 2003, 371:202-207.
  • 10Liao B, Wang T M. Analysis of similarity/dissimilarity of DNA sequences based on 3-D graphical representation. Chem Plays Lett, 2004, 388:195-200.

共引文献42

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部