摘要
针对现有串联重复序列识别方法存在的计算量大、灵敏度低等问题,提出一种基于频谱分析的串联重复序列识别方法。该方法采用碱基的电子离子相互作用势作为基因序列数字化表示的方法,通过对数字序列作离散傅里叶变换得到序列中串联重复序列出现的频率,并对基因序列做加窗傅里叶变换,找出串联重复序列存在的位置。实验表明,该方法的计算量较已有方法减少了75%,并能较好地解决已有方法识别灵敏度低的缺点。
Aiming at the drawbacks of the existing tandem repeats finding methods,such as large number of calculations and feeble sensitivity,this paper presents a tandem repeats identification method which is based on spectral analysis.The technique employs the Electron-Ion Interaction Potential(EIIP) of each nucleotide as the numerical representation for DNA sequence,and obtains the occurrence frequency of the tandem repeats which is buried in the sequence after computing the Discrete Fourier Transform(DFT) of the sequence.The windowed Fourier transform is used,and the tandem repeats location is identified efficiently.Experiment demonstrates that the calculation amount is reduced by 75% compared with the existing methods,and greatly resolves the feeble sensitivity of the existing techniques.
出处
《计算机工程》
CAS
CSCD
北大核心
2011年第9期181-183,共3页
Computer Engineering
基金
河北省教育厅自然科学研究计划基金资助项目(2009339)
关键词
串联重复序列
离散傅里叶变换
电子离子相互作用势
频谱分析
信噪比
tandem repeats
Discrete Fourier Transform(DFT)
Electron-Ion Interaction Potential(EIIP)
spectral analysis
Signal to Noise Ratio(SNR)