摘要
C2H2型锌指蛋白是哺乳动物中数量最多的一类转录调控因子.C2H2型锌指蛋白中含有的C2H2型锌指基序多是不相同的,表明它们很可能结合不同的DNA序列,从而调控不同的基因,行使多样化的调控功能.然而,目前大多数C2H2型锌指蛋白结合的DNA序列仍不明确,这阻碍了C2H2型锌指蛋白的功能研究.目前,针对C2H2型锌指蛋白的靶序列预测已有一些初步的研究.本文介绍了C2H2型锌指基序与DNA结合的经典模式,并对C2H2型锌指蛋白靶序列预测方法中所用到的算法、训练集、金标准数据集及相应工具进行了全面系统的总结归纳,旨在丰富对C2H2型锌指蛋白靶序列预测原理和工具的认识,为C2H2型锌指蛋白靶序列的精确预测和更深入的功能研究打下基础.
C2H2 zinc finger proteins represent the largest family of transcription factors in mammalian. Their C2H2 zinc finger arrays are highly variable, indicating that most of them have unique DNA binding motifs,regulating different genes and playing diversified roles. However, the detailed regulatory functions of many C2H2 zinc finger proteins are unknown because of the unclear target sequences. The prediction of DNA-binding preferences of C2H2 zinc finger proteins is a commendable approach to figure it out. In this review, the canonical recognition pattern of C2H2 zinc fingers binding DNA was described. The prediction models of DNA-binding preferences of C2H2 zinc finger proteins according to their methods, training datasets, and golden standard datasets were summarized. This review is of great benefit to the comprehensive understanding of the prediction models of DNA-binding preferences of C2H2 zinc finger proteins. All of these information will facilitate the further theoretical and applied studies of C2H2 zinc finger proteins.
出处
《生物化学与生物物理进展》
SCIE
CAS
CSCD
北大核心
2017年第7期573-579,共7页
Progress In Biochemistry and Biophysics
基金
国家自然科学基金(31671376)
北京市科技新星计划(Z161100004916148)
国家国际科技合作专项(2014DFB30020
2014DFB30010)
国家重大科学研究计划(2015CB910700
2014CBA02001)
蛋白质组学国家重点实验室开放课题(SKLP-O201404
SKLP-O201507)资助项目~~
关键词
转录因子
C2H2型锌指蛋白
靶序列预测
transcription factor
C2H2 zinc finger proteins
prediction of DNA-binding preferences