期刊文献+

A Modified Statistically Optimal Null Filter Method for Recognizing Protein-coding Regions 被引量:1

A Modified Statistically Optimal Null Filter Method for Recognizing Protein-coding Regions
原文传递
导出
摘要 Computer-aided protein-coding gene prediction in uncharacterized genomic DNA sequences is one of the most important issues of bio- logical signal processing. A modified filter method based on a statistically optimal null filter (SONF) theory is proposed for recognizing protein-coding regions. The square deviation gain (SDG) between the input and output of the model is used to identify the coding regions. The effective SDG amplification model with Class I and Class II enhancement is designed to suppress the non-coding regions. Also, an evaluation algorithm has been used to compare the modified model with most gene prediction methods currently available in terms of sensitivity, specificity and precision. The performance for identification of protein-coding regions has been evaluated at the nucleotide level using benchmark datasets and 91.4%, 96%, 93.7% were obtained for sensitivity, specificity and precision, respectively. These results suggest that the proposed model is potentially useful in gene finding field, which can help recognize protein-coding regions with higher precision and speed than present algorithms. Computer-aided protein-coding gene prediction in uncharacterized genomic DNA sequences is one of the most important issues of bio- logical signal processing. A modified filter method based on a statistically optimal null filter (SONF) theory is proposed for recognizing protein-coding regions. The square deviation gain (SDG) between the input and output of the model is used to identify the coding regions. The effective SDG amplification model with Class I and Class II enhancement is designed to suppress the non-coding regions. Also, an evaluation algorithm has been used to compare the modified model with most gene prediction methods currently available in terms of sensitivity, specificity and precision. The performance for identification of protein-coding regions has been evaluated at the nucleotide level using benchmark datasets and 91.4%, 96%, 93.7% were obtained for sensitivity, specificity and precision, respectively. These results suggest that the proposed model is potentially useful in gene finding field, which can help recognize protein-coding regions with higher precision and speed than present algorithms.
出处 《Genomics, Proteomics & Bioinformatics》 CAS CSCD 2012年第3期166-173,共8页 基因组蛋白质组与生物信息学报(英文版)
基金 supported by the Fundamental Research Funds for the Central Universities (Grant No.CDJXS10160001) the Central University Postgradu-ate’ Science and Innovation Funds of China (Grant No.CDJXS12160005)
关键词 Gene prediction Biological signal processing Protein-coding region Square deviation gain Gene prediction Biological signal processing Protein-coding region Square deviation gain
  • 相关文献

参考文献1

二级参考文献35

  • 1Fickett,J.W.and Tung,C.S.1992.Assessment of protein coding measures.Nucleic Acids Res.20:6441-6450.
  • 2Fickett,J.W.1996.The gene identification problem:an overview for developers.Comput.Chem.20:103-118.
  • 3Vaidyanathan,P.P.and Yoon,B.J.2004.The role of signal-processing concepts in genomics and proteomics.J.Franklin Inst.341:111-135.
  • 4Tiwari,S.,et al.1997.Prediction of probable genes by Fourier analysis of genomic sequences.Comput.Appl.Biosci.13:263-270.
  • 5Tsonis,A.A.,et al.1991.Periodicity in DNA coding sequences:implications in gene evolution.J.Theor.Biol.151:323-331.
  • 6Gutierrez,G.,et al.1994.On the origin of the periodicity of three in protein coding DNA sequences.J.Theor.Biol.167:413-414.
  • 7Bernaola-Galvan,P.,et al.2000.Finding borders between coding and noncoding DNA regions by an entropic segmentation method.Phy.Rev.Lett.85:1342-1345.
  • 8Voss,R.F.1992.Evolution of long-range fractal correlations and 1/f noise in DNA base sequences.Phys.Rev.Lett.68:3805-3808.
  • 9Chatzidimitriou-Dreismann,C.A.and Larhammar,D.1993.Long-range correlations in DNA.Nature 361:212-213.
  • 10Henderson,J.,et al.1997.Finding genes in DNA with a Hidden Markov Model.J.Comput.Biol.4:127-141.

共引文献3

同被引文献1

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部