摘要
针对传统方法在分析DNA序列相似性方面的不足,提出了一种基于样本熵的DNA序列相似性分析方法。以5种东亚钳蝎神经毒素的基因序列作为分析对象,首先通过DNA序列的图形表示把DNA序列转换为时间序列,然后运用样本熵算法计算出时间序列的样本熵值,将样本熵的互值大小作为分析序列之间相似性的依据,最后将样本熵方法与DTW(Dynamic Time Warping,动态时间弯曲)方法的实验结果进行比较。实验结果表明,样本熵分析方法能有效分析序列之间的相似性,与DTW分析方法相比较,显示出更强的相似性和区别度,可将其进一步应用于生物序列的分析。
This paper studies the application of sample entropy for similarity analysis of DNA sequences. The gene sequences of five kinds of Buthus martensi Karsch neurotoxins are analyzed. The graphical representation of DNA sequences are converted into digital sequences,and their sample entropy are calculated based on sample entropy method. The mutual value between different sample entropy is used to analysis sequence similarity. Analysis result is compared with the method of DTW distance. The analysis result of the proposed method provides good analysis efficiency and higher sensitivity and distinction than the results of DTW distance method. The method of sample entropy can be used for further biological sequences analysis.
出处
《智能计算机与应用》
2016年第1期101-103,共3页
Intelligent Computer and Applications