期刊文献+

KELMPSP:基于核极限学习机的假尿苷修饰位点识别 被引量:2

KELMPSP: Pseudouridine Sites Identification Based on Kernel Extreme Learning Machine
下载PDF
导出
摘要 假尿苷(ψ)是RNA序列中的一种化学修饰,其在基因转录过程中,由酶的催化作用而形成。它是目前所发现为数最多的一种RNA修饰,并且在正常行使生物学功能方面扮演着重要角色。因此,假尿苷修饰位点的识别是一个非常重要的研究领域。随着RNA序列数据的急速增长,基于机器学习识别假尿苷位点的方法相继提出,但其识别精度有待提高。因此,本文提出了一个新的融合核苷酸化学性质、核苷酸浓度和位置特异性的单核苷酸、双核苷酸、三核苷酸偏好特征的序列编码方式,并基于此编码方式和核极限学习机(Kernel Extreme Learning Machine,KELM)算法,构建了一个新的假尿苷位点预测器,该预测器被称为"KELMPSP"。通过Jackknife测试和独立数据集测试表明,KELMPSP明显优于现有的假尿苷位点预测器。KELMPSP可以通过网站:http://39.105.77.161:8890/KELMPSP进行使用。 Pseudouridine( ψ) is a chemical modification of the RNA sequence,which is formed by enzymatic catalysis during gene transcription. It is one of the most commonly found RNA modifications and plays an important role in various biological functions. Therefore,the identification of pseudouridine sites is a very important research field. With the rapid growth of RNA sequencing data,machine learning-based methods for identifying pseudouridine sites has been put forward,but their recognition accuracies need to be improved. This paper proposes a new sequence encoding method that combines the nucleotide chemical properties, nucleotide concentration and position-specific mononucleotide,dinucleotide and trinucleotide propensity characteristics. In addition,a new predictor for identifyingpseudouridine sites based on this encoding method and the Kernel Extreme Learning Machine( KELM)algorithm is built,which is named "KELMPSP". The experiment performances of Jackknife tests and independent dataset tests show that KELMPSP remarkably outperforms the existing predictors. KELMPSP is available at: http://39. 105. 77. 161∶ 8890/KELMPSP.
作者 李永贞 樊永显 杨辉华 LI Yong-Zhen;FAN Yong-Xian;YANG Hui-Hua(Laboratory of Artificial Intelligence,School of Electronic Engineering and Automation,Guilin University of Electronic Technology,Guilin 541004,Guangxi,China;Information Security,Guilin University of Electronic Technology,Laboratory of Artificial Intelligence,School of Computer and Guilin 541004,Guangxi,China;Laboratory of Spectrum and Big Data,School of Automation,Beijing University of Posts and Telecommunications,Beijing 100876,China)
出处 《中国生物化学与分子生物学报》 CAS CSCD 北大核心 2018年第7期785-793,共9页 Chinese Journal of Biochemistry and Molecular Biology
基金 国家自然科学基金项目(No.61462018 No.61762026) 广西自然科学基金(No.2017GXNSFAA198278) 广西可信软件重点实验室(No.kx201403) 广西高校计算机图像与图形智能处理重点实验室(No.GIIP201502)资助~~
关键词 假尿苷 RNA 识别 核极限学习机 pseudouridine RNA identification Kernel extreme learning machine
  • 相关文献

同被引文献9

引证文献2

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部