[Objective] To discuss the effects of major mapping methods for DNA sequence on the accuracy of protein coding regions prediction,and to find out the effective mapping methods.[Method] By taking Approximate Correlatio...[Objective] To discuss the effects of major mapping methods for DNA sequence on the accuracy of protein coding regions prediction,and to find out the effective mapping methods.[Method] By taking Approximate Correlation(AC) as the full measure of the prediction accuracy at nucleotide level,the windowed narrow pass-band filter(WNPBF) based prediction algorithm was applied to study the effects of different mapping methods on prediction accuracy.[Result] In DNA data sets ALLSEQ and HMR195,the Voss and Z-Curve methods are proved to be more effective mapping methods than paired numeric(PN),Electron-ion Interaction Potential(EIIP) and complex number methods.[Conclusion] This study lays the foundation to verify the effectiveness of new mapping methods by using the predicted AC value,and it is meaningful to reveal DNA structure by using bioinformatics methods.展开更多
A class of multistage filters, namely, real narrowband bandpass filter (RNBPF) has been previously used for identification of protein coding regions. This filter passes the frequency component at 2π/3 along with it...A class of multistage filters, namely, real narrowband bandpass filter (RNBPF) has been previously used for identification of protein coding regions. This filter passes the frequency component at 2π/3 along with its conjugate. This conjugate frequency compo- nent may degrade the identification accuracy. To improve the identification accuracy, two types of multistage filters are proposed in this paper. A complex narrowband bandpass filter (CNBPF) is proposed for suppressing the conjugate frequency component which, in turn, reduces the background noise present in the deoxyribonucleic acid (DNA) spec- trum and improves identification accuracy. By cascading RNBPF with moving average filter (RNBPFMA), another type of multistage filter is proposed. As moving average filter smooth out the rapid variations in the DNA spectrum, RNBPFMA improves the identification accuracy. The computational complexity of RNBPFMA is less than that of CNBPF. The RNBPF and proposed multistage filters are compared with previously reported short-time discrete Fourier transform (ST-DFT) method in terms of compu- tational complexity. It is found that multistage filters reduce the computational load to a greater extent compared to ST-DFT method. The identification accuracy of the proposed CNBPF and RNBPFMA methods is compared with existing anti-notch filter and RNBPF methods. The results show that proposed methods outperform existing methods in terms of identification accuracy for benchmark data sets.展开更多
基金Supported by Ningxia Natural Science Foundation (NZ1024)the Scientific Research the Project of Ningxia Universities (201027)~~
文摘[Objective] To discuss the effects of major mapping methods for DNA sequence on the accuracy of protein coding regions prediction,and to find out the effective mapping methods.[Method] By taking Approximate Correlation(AC) as the full measure of the prediction accuracy at nucleotide level,the windowed narrow pass-band filter(WNPBF) based prediction algorithm was applied to study the effects of different mapping methods on prediction accuracy.[Result] In DNA data sets ALLSEQ and HMR195,the Voss and Z-Curve methods are proved to be more effective mapping methods than paired numeric(PN),Electron-ion Interaction Potential(EIIP) and complex number methods.[Conclusion] This study lays the foundation to verify the effectiveness of new mapping methods by using the predicted AC value,and it is meaningful to reveal DNA structure by using bioinformatics methods.
文摘A class of multistage filters, namely, real narrowband bandpass filter (RNBPF) has been previously used for identification of protein coding regions. This filter passes the frequency component at 2π/3 along with its conjugate. This conjugate frequency compo- nent may degrade the identification accuracy. To improve the identification accuracy, two types of multistage filters are proposed in this paper. A complex narrowband bandpass filter (CNBPF) is proposed for suppressing the conjugate frequency component which, in turn, reduces the background noise present in the deoxyribonucleic acid (DNA) spec- trum and improves identification accuracy. By cascading RNBPF with moving average filter (RNBPFMA), another type of multistage filter is proposed. As moving average filter smooth out the rapid variations in the DNA spectrum, RNBPFMA improves the identification accuracy. The computational complexity of RNBPFMA is less than that of CNBPF. The RNBPF and proposed multistage filters are compared with previously reported short-time discrete Fourier transform (ST-DFT) method in terms of compu- tational complexity. It is found that multistage filters reduce the computational load to a greater extent compared to ST-DFT method. The identification accuracy of the proposed CNBPF and RNBPFMA methods is compared with existing anti-notch filter and RNBPF methods. The results show that proposed methods outperform existing methods in terms of identification accuracy for benchmark data sets.