蛋白质折叠类型分类方法及分类数据库被引量：5

Protein fold type classify methods and classification database2

下载PDF

导出

摘要蛋白质折叠规律研究是生命科学重大前沿课题,折叠分类是蛋白质折叠研究的基础。目前的蛋白质折叠类型分类基本上靠专家完成,不同的库分类并不相同,迫切需要一个建立在统一原理基础上的蛋白质折叠类型数据库。本文以ASTRAL-1.65数据库中序列同源性在25%以下、分辨率小于2.5的蛋白为基础,通过对蛋白质空间结构的观察及折叠类型特征的分析,提出以蛋白质折叠核心为中心、以蛋白质结构拓扑不变性为原则、以蛋白质折叠核心的规则结构片段组成、连接和空间排布为依据的蛋白质折叠类型分类方法,建立了低相似度蛋白质折叠分类数据库——LIFCA,包含259种蛋白质折叠类型。数据库的建立,将为进一步的蛋白质折叠建模及数据挖掘、蛋白质折叠识别、蛋白质折叠结构进化研究奠定基础。 The research on protein folding is in the frontier of life science,fold classification is the foundation of protein folding study Nowadays protein fold classification rely on experts,different database have different classification standards,so it is very important for us to build a protein folding database under the same criteria.this paper based on database ASTRAL-1.65,according to sequence homology below 25%,resolution below 2.5 and the three dimensional structure of protein and fold type characteristics analysis we put forward a protein fold type classification method which based on protein fold core,under the protein structure topology invariant,which according to protein fold core regular structure composition,connection and spatial arrangement.We established low identity protein fold classification database——LIFCA,which contains 259 protein fold types.The established of this database lay the foundation for our future works on protein fold modeling,date mining,protein fold Identification and protein fold structure evolution.

作者李晓琴仁文科刘岳徐海松乔辉

机构地区北京工业大学生命科学与生物工程学院

出处《生物信息学》 2010年第3期245-247,253,共4页 Chinese Journal of Bioinformatics

基金国家自然科学基金(30570427) 北京市自然科学基金(4092008)

关键词蛋白质折叠折叠类型分类数据库折叠核心低相似度 Protein fold Fold type classification Database Fold core Low identity

分类号 Q51 [生物学—生物化学]

引文网络
相关文献

参考文献14

1Chothia C.One thousand families for the molecular biologist[J].Nature,1992,357(6379):543-544.
2David Baker.A surprising simplicity to protein folding[J].Nature,2000.
3Daggett V,Fersht A.The present view of the mechanism of protein folding[J].Nat Rev Mol Cell Biol,2003,4(6):497-502.
4Daggett V,Fersht A.Is there a unifying mechanism for protein folding?[J].Trends Biochem Sci,2003,28(1):18-25.
5Gianni S,Guydosh N R,Khan F,et al.Unifying features in protein-folding mechanisms[J].Proc Natl Acad Sci USA,2003,100(23):13286-13291.
6Onuchic J N,Wolynes P G.Theory of protein folding[J].Curr Opin Struct Biol,2004,14(1):70-75.
7Chandonia JM,Hon G,Walker NS,Lo Conte L,Koehl P,Levitt M,Brenner SE.The ASTRAL compendium in 2004.Nucleic Acids Research,2004,32(Sp.Iss.SI):D189-D192.
8Murzin A G,Brenner S E,Hubbard T,et al.SCOP:a structural classification of proteins database for the investigation of sequences and structures[J].J.Mol.Biol.,1995,(247):536-540.
9Lo Conte L,Brenner S E,Hubbard T,et al.SCOP database in 2002:refinements accommodate structural genomics[J].Nucl.Acid Res,2002,30(1):264-267.
10C A.Orengo,A D.Michie,S.Jones,et al.CATH:A Hierarchic Classification of Protein Domain Structures[J].J.M.Structure,1997,5(8):1093-1108.

二级参考文献47

1施建宇,潘泉,张绍武,梁彦.基于支持向量机融合网络的蛋白质折叠子识别研究[J].生物化学与生物物理进展,2006,33(2):155-162. 被引量：19
2李菁,王炜.氨基酸残基归类及用简化后的字符识别蛋白质结构保守区域[J].中国科学（C辑）,2006,36(6):552-562. 被引量：1
3[2]Chothia C.One thousand families for the molecular biologist.Nature,1992,357(6379):543～544
4[3]Baker D.A surprising simplicity to protein folding.Nature,2000,405(6782):39～42
5[4]Ding CHQ,Dubchak L Multi-class protein fold recognition using support vector machines and neural networks.Bioinformatics,2001,17(4):349～358
6[7]Bowie JU,Luthy R,Eisenberg D.A method to identify protein sequences that fold into a known three-dimensional structure.Science,1991,253:164～170
7[8]Elofsson A,Fischer D,Rice DW,Le Grand S.A study of combined structure-sequence profiles.Folding ＆ Design,1998,1:451～461
8[9]Jones DT,Taylor WR,hornton JM.A new approach to protein fold recognition.Nature,1992,358:86～89
9[10]Bryant SH,Lawrence CE.An empirical energy function for threading protein sequence through folding motif.Proteins,1993,16:92～112
10[11]Mirny LA,Shakhnovich EI.Protein structure prediction by threading:why it works and why it does not.Mol Biol,1998,283:507～526

共引文献9

1李晓琴,刘岳,仁文科,乔辉.70种蛋白质折叠类型的单模型识别[J].生物物理学报,2009,25(S1):18-19.
2施建宇,张艳宁.使用图像特征构建快速有效的蛋白质折叠识别方法[J].生物物理学报,2009,25(2):106-116. 被引量：5
3刘岳,李晓琴,徐海松,乔辉.蛋白质折叠类型的分类建模与识别[J].物理化学学报,2009,25(12):2558-2564. 被引量：8
4刘岳,徐海松,乔辉,李晓琴.双绕蛋白质的分类与识别[J].生物信息学,2010,8(1):1-6. 被引量：1
5李晓琴,罗辽复.β类蛋白的构建模式及拓扑结构预测[J].内蒙古大学学报（自然科学版）,1999,30(2):169-173. 被引量：1
6李晓琴,罗辽复.α类蛋白的构建模式及拓扑结构预测[J].内蒙古大学学报（自然科学版）,1999,30(3):325-328.
7闫金丽,陈治伟,徐海松,李晓琴.基于功能域组分的蛋白质折叠类型识别[J].生物化学与生物物理进展,2011,38(2):166-172. 被引量：3
8李晓琴,仁文科,刘岳.利用隐马尔科夫模型识别蛋白质折叠类型[J].北京工业大学学报,2011,37(7):1103-1109.
9马帅,王勤,李晓琴.α/β类蛋白质折叠类型的分类方法研究[J].生物信息学,2014,12(2):123-132. 被引量：5

同被引文献43

1张玮,李晓琴,徐海松,任文科.蛋白质折叠类型识别方法研究[J].生物物理学报,2008,24(1):65-71. 被引量：5
2施建宇,潘泉,张绍武,梁彦.基于支持向量机融合网络的蛋白质折叠子识别研究[J].生物化学与生物物理进展,2006,33(2):155-162. 被引量：19
3TORDA A E. Perspectives in protein-fold recognition[ J]. Current Opinion in Structural Biology, 1997, 7 (2) : 200-205.
4FINKELSTEIN A V. Protein structure: what it is possible to predict now[ J]. Current Opinion in Structural Biology, 1997, 7: 60-71.
5JONES D. Progress in protein structure prediction[ Jl. Current Opinion in Structural Biology, 1997, 7: 377-387.
6CHOTHIA C. One thousand families for the molecular biologist[ J ]. Nature, 1992, 357 (6379) : 543-544.
7LI H, HELLING R, TANG C, et al. Emergence of preferred structures in a simple model of protein folding[ Jl. Science, 1996, 273 : 666-669.
8DAVID B. A surprising simplicity to protein folding[ J 1 ~ Nature, 2000, 405 (6782) : 39-42.
9ALTSCHUL S F, MADDEN T L, SCHAFFER A A, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs[ J]. Nucleic Acids Research, 1997, 25 (17) : 3389-3402.
10EISENBERG D. Into the black of night[J]. Nat Struct Biol, 1997, 4: 95-97.

引证文献5

1李晓琴,仁文科,刘岳.利用隐马尔科夫模型识别蛋白质折叠类型[J].北京工业大学学报,2011,37(7):1103-1109.
2孔令强,李晓琴.基于特征片段信息的PH domain-like barrel 蛋白质折叠类型分类方法[J].生物信息学,2012,10(2):125-129. 被引量：3
3马帅,王勤,李晓琴.α/β类蛋白质折叠类型的分类方法研究[J].生物信息学,2014,12(2):123-132. 被引量：5
4宗立平,李晓琴.α类蛋白质折叠类型自动化分类研究[J].生命科学研究,2016,20(5):381-388.
5张业晓,李晓琴.SCOP数据库蛋白质折叠类型的自动分类分析[J].生物信息学,2017,15(2):78-83. 被引量：1

二级引证文献7

1张春城,李晓琴.基于设计模板的BRD-like折叠类型综合分类方法[J].生物信息学,2016,14(2):100-107.
2宗立平,李晓琴.α类蛋白质折叠类型自动化分类研究[J].生命科学研究,2016,20(5):381-388.
3张业晓,李晓琴.SCOP数据库蛋白质折叠类型的自动分类分析[J].生物信息学,2017,15(2):78-83. 被引量：1
4刘力力,林子欣,胡锦赫,安基永,王佳,林善枝.山杏CAT家族基因的生物信息学预测及表达分析[J].分子植物育种,2018,16(22):7255-7263. 被引量：4
5吴亚楠,李贺,苏倩,王振铭,刘景,时一平.沙棘PAL家族基因的生物信息学分析[J].黑龙江农业科学,2019(4):15-17. 被引量：1
6徐世琦,李贺,邓伊亦,朱元玲,邓金兰.沙棘CAT家族基因的生物信息学分析[J].天津农业科学,2019,25(5):1-4. 被引量：2
7徐楠,郭敏亮.计算机模拟在酶工程教学中的应用研究[J].广东化工,2020,47(1):159-160. 被引量：4

1宗立平,李晓琴.α类蛋白质折叠类型自动化分类研究[J].生命科学研究,2016,20(5):381-388.
2马帅,王勤,李晓琴.α/β类蛋白质折叠类型的分类方法研究[J].生物信息学,2014,12(2):123-132. 被引量：5
3张春城,李晓琴.基于设计模板的BRD-like折叠类型综合分类方法[J].生物信息学,2016,14(2):100-107.
4刘俊杰,谢文静,李前忠.具有不同重复结构单元数的AFPⅢ分子的热滞活性[J].生物物理学报,2009,0(S1):348-349.
5李晓琴,刘岳,仁文科,乔辉.70种蛋白质折叠类型的单模型识别[J].生物物理学报,2009,25(S1):18-19.
6乔辉,李晓琴,徐海松,刘岳.α/β、全α和全β蛋白中的Cation-π相互作用[J].生物物理学报,2009,25(S1):179-180.
7董标,董方霆,钱小红.毛细管电泳及其在蛋白质折叠研究中的应用[J].生物技术通讯,1999,10(4):314-318.
8蒋胜竞,石国玺,毛琳,潘建斌,安黎哲,刘永俊,冯虎元.不同PCR引物在根系丛枝菌根真菌群落研究中的应用比较(英文)[J].微生物学报,2015,55(7):916-925. 被引量：5
9吴征镒,孙航,周浙昆,李德铢,彭华.中国种子植物区系地理[J].生物多样性,2011,19(1):124-124. 被引量：70
10闫金丽,陈治伟,徐海松,李晓琴.基于功能域组分的蛋白质折叠类型识别[J].生物化学与生物物理进展,2011,38(2):166-172. 被引量：3

生物信息学

2010年第3期

浏览历史

内容加载中请稍等...

蛋白质折叠类型分类方法及分类数据库被引量：5

参考文献14

二级参考文献47

共引文献9

同被引文献43

引证文献5

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

蛋白质折叠类型分类方法及分类数据库 被引量：5

参考文献14

二级参考文献47

共引文献9

同被引文献43

引证文献5

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

蛋白质折叠类型分类方法及分类数据库被引量：5