摘要
RNA二级结构预测是生物信息学的重要研究内容。提出了一个新的启发式算法进行带假节的RNA结构预测。首先通过对RNA序列的若干特征和RNA二级结构进行相关性分析,从中选择跟RNA结构有较大相关性的特征;然后依据遗传算法、综合自由能、被选择茎区的条数以及被选择茎区的平均长度等特征来构造打分函数预测RNA的结构。对该方法进行了测试,结果表明采用的从特征分析中得出的打分函数,以及通过启发式算法来叠加茎的方法是有效的。在对tRNA以及5SrRNA等序列的预测上,相比单纯的自由能最小方法有更高的准确性。并且该方法进一步推广到预测含假结的RNA的二级结构时,也有较好的结果。
RNA secondary structure prediction is an important research field of bioinformatics. A new heuristic method to predict RNA secondary structure including pseudoknot is propose. Based on the analysis of specific fea- tures of RNA structures, the features is selected that have significant relationship with RNA structure characteris- tics. Then the genetic algorithm and incorporate those features are used: the free energy, the number and the length of selected stems to predict the RNA structure. The experiment results show that the scoring function concluded from the feature analysis as well as the heuristic stem adding algorithm is effective. It outperforms the traditional minimal free energy algorithm on tRNA and 5SrRNA data. This method also has good prediction results on the pseudo-knot structure.
出处
《科学技术与工程》
北大核心
2013年第31期9207-9213,共7页
Science Technology and Engineering
基金
国家自然科学基金(61003164)资助