摘要
启动子是调控基因转录表达的核心元件。启动子上的突变会影响启动子的功能活性,从而导致基因表达异常。本研究通过挖掘启动子序列中蕴含的序列特征,提出了一个基于序列模式挖掘的启动子序列打分模型,通过该模型,实现对启动子序列信号强度的定量度量。试验结果表明,该模型可有效区分真、假启动子序列,计算验证试验表明该模型具有良好的鲁棒性,并可用于识别致病启动子序列突变。
Promoter is a type of core regulatory element on gene expression. Mutations in promoter could affect its functional activity, which could cause dysregulation on gene expression, and result in abnormal protein product. In this study, based on the sequential features identified from promoter sequences, we developed a score system for promoter. Through this score system, the signal strength of a promoter could be measured quantitatively. The experi- ment results showed that this model can effectively distinguish between true and false promoter sequences. Compu- tational validation experiment showed that this model had a great robustness. Besides, the model could be used to efficiently identify pathogenic mutation in promoter.
作者
陈虹
赵海峰
王畅畅
施梦军
马猛
Chen Hong;Zhao Haifeng;Wang Changchang;Shi Mengjun;Ma Meng(School of Computer Science and Technology, Anhui University, Hefei, 230601;Icahn School of Medicine at Mount Sinai, New York, 10029)
出处
《基因组学与应用生物学》
CAS
CSCD
北大核心
2017年第11期4579-4584,共6页
Genomics and Applied Biology
基金
国家自然基金(No.61300057
No.81000321)
安徽省教育厅重点项目(KJ2016A040
KJ2013A007)共同资助
关键词
启动子
致病突变
序列模式挖掘
Promoter, Pathogenic mutation, Sequential pattern mining