摘要
蛋白质折叠规律研究是生命科学重大前沿课题,折叠分类是蛋白质折叠研究的基础。目前的蛋白质折叠类型分类基本上靠专家完成,不同的库分类并不相同,迫切需要一个建立在统一原理基础上的蛋白质折叠类型数据库。本文以ASTRAL-1.65数据库中序列同源性在25%以下、分辨率小于2.5的蛋白为基础,通过对蛋白质空间结构的观察及折叠类型特征的分析,提出以蛋白质折叠核心为中心、以蛋白质结构拓扑不变性为原则、以蛋白质折叠核心的规则结构片段组成、连接和空间排布为依据的蛋白质折叠类型分类方法,建立了低相似度蛋白质折叠分类数据库——LIFCA,包含259种蛋白质折叠类型。数据库的建立,将为进一步的蛋白质折叠建模及数据挖掘、蛋白质折叠识别、蛋白质折叠结构进化研究奠定基础。
The research on protein folding is in the frontier of life science,fold classification is the foundation of protein folding study Nowadays protein fold classification rely on experts,different database have different classification standards,so it is very important for us to build a protein folding database under the same criteria.this paper based on database ASTRAL-1.65,according to sequence homology below 25%,resolution below 2.5 and the three dimensional structure of protein and fold type characteristics analysis we put forward a protein fold type classification method which based on protein fold core,under the protein structure topology invariant,which according to protein fold core regular structure composition,connection and spatial arrangement.We established low identity protein fold classification database——LIFCA,which contains 259 protein fold types.The established of this database lay the foundation for our future works on protein fold modeling,date mining,protein fold Identification and protein fold structure evolution.
出处
《生物信息学》
2010年第3期245-247,253,共4页
Chinese Journal of Bioinformatics
基金
国家自然科学基金(30570427)
北京市自然科学基金(4092008)