期刊文献+

基于支持向量机的脂肪酶耐热序列与嗜热序列分类研究 被引量:1

Classification of thermostable and thermophilic lipases using support vector machines
下载PDF
导出
摘要 从GenBank数据库中获取了微生物来源的嗜热脂肪酶序列77条,耐热脂肪酶序列65条,分别统计分析序列中20种氨基酸出现的频次,二肽片段、三肽片段出现的差异以及非相邻二元组合的偏爱性。在此基础上,利用支持向量机(SVM)进行序列分类研究。研究结果表明:在统计学意义上,20种天然氨基酸残基中,亮氨酸、脯氨酸、蛋氨酸、苯丙氨酸、色氨酸和酪氨酸在嗜热蛋白序列中出现的频率高于其在耐热蛋白中出现的频率;二肽片段KC,EE,KE,RE,VE,YI,EK,VK,EV,YV,EY,KY,VY和YY的出现频率在嗜热蛋白中显著高于其在耐热蛋白中出现的频率。三肽片段的出现频率和非相邻二元组合的序列偏爱性也显示与蛋白耐热性显著相关。训练集的分类准确率达99.65%,真实数据集的分类准确率达到98.41%。 The amino acid compositions,the distributions of N(N=2,3) neighboring amino acids and the non-adjacent di-residue coupling patterns in the sequences of 65 thermostable and 77 thermophilic lipases getting from GenBank were systematically analyzed.Based on the information,a statistical method based on support vector machines(SVMs) for discriminating thermophilic and thermostable lipases was developed.The results show that hydrophobic residues Leu,Pro,Met,Phe,Trp,as well as the polar residue Tyr have higher occurrences in thermophilic lipases than thermostable ones.The occurrences of KC,EE,KE,RE,VE,YI,EK,VK,EV,YV,EY,KY,VY and YY in thermophilic proteins are significantly more frequent.The composition of dipeptide,tripeptide and non-adjacent di-residue patterns contain more information than amino acid composition,and this information indicates the possible thermostable mechanism of microbial lipases.The accuracy of this method for the training dataset is 99.65%,and its accuracy for testing datasets is 98.41%.
出处 《中南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2011年第9期2543-2550,共8页 Journal of Central South University:Science and Technology
基金 国家自然科学基金资助项目(31000350)
关键词 氨基酸组成 多肽片段 非相邻二元组合 蛋白质热稳定性 支持向量机 amino acid composition n-peptide composition di-residue coupling protein stability support vector machines
  • 相关文献

参考文献19

  • 1Pemas M, Lo'z C, Prada A, et al. Structural basis for the kinetics of Candida rugosa Lipl and Lip3 isoenzymes[J]. Colloids Surf B Biointerfaces, 2002, 26(1/2): 67-74.
  • 2Jaeger K E, Ransac S. Bacterial lipases[J]. FEMS Microbiology Reviews, 1994, 15(1): 29-63.
  • 3Sharma R, Soni S K, Vohra R M, et al. Purification and characterization of a thermostable alkaline lipase from a new thermophilic Bacillus sp. RSJ-I[J]. Proc Biochem, 2002, 37(10): 1075-1084.
  • 4Hiol A, Jonzo M D, Rugani N, et al. Purification and characterization of an extracellular lipase from a thermophilic Rhizopus oryzae strain isolated from palm fruit[J]. Enzyme and Microbial Technology, 2000, 26(5/6): 421-430.
  • 5Kim H K, Park S Y, Lee J K, et al. Gene cloning and characterization of thermostable lipase from Bacillus stearothermophilus L1 Biosei[J]. Biotechnol. Biochem, 1998, 62(1): 66-71.
  • 6丁彦蕊.嗜热菌耐热性与蛋白质序列结构和功能关系的生物信息学研究[D].无锡:江南大学生物工程学院,2005:155-161.
  • 7Shen J W, Zhang J, Luo X M., et al. Prediction protein-protein interactions based only on sequences information[J]. PNAS, 2007, 104(11): 4337-4341.
  • 8朱蔚,郑佐华,袁有忠,周宗祥,毛裕民.编码序列的(G+C)%与蛋白质的耐热性相关性分析[J].Acta Genetica Sinica,1999,26(4):418-427. 被引量:6
  • 9丁彦蕊,蔡宇杰,须文波.蛋白质空间结构属性与全基因组微生物耐热性的关系[J].中国生物化学与分子生物学报,2007,23(4):323-330. 被引量:4
  • 10张振慧,王正华,王勇献.基于氨基酸和二肽组成的蛋白质四级结构分类研究[J].生物信息学,2007,5(2):49-52. 被引量:1

二级参考文献59

  • 1施建宇,潘泉,张绍武,程咏梅.基于氨基酸组成分布的蛋白质同源寡聚体分类研究[J].生物物理学报,2006,22(1):49-56. 被引量:9
  • 2陈敏,刘万卉,王静馨.模式识别在锌酵母发酵工艺优化控制中的应用[J].生物工程学报,1996,12(3):367-370. 被引量:2
  • 3[1]Klotz IM,Darnall DW,Langerman NR.Quaternary structure of proteins.In:Neurath H,Hill RL,editors.The protein[M].3rd ediction,Vol.1.New York:Academic Presess,1975,1:226-411.
  • 4[2]Robert G.Prediction of quaternary structure from primary structure[J].Bioinformatics,2001,(17):551-556.
  • 5[3]Zhang SW,Pan Q,Zhang HC,et al.Classification of protein quaternary structure with support vector machine[J].Bioinformatics,2003,(19):2390-2396.
  • 6[7]Chou KC,Cai YD.Predicting protein quaternary structure by pseudo amino acid composition[J].Proteins,2003,(53):282-289.
  • 7[8]Zhang CT,Zhang R.Q9,a content-balancing accuracy index to evaluate algorithms of protein secondary structure prediction[J].The International Journal of Biochemistry and Cell Biology,2003,(35):1256-1262.
  • 8[9]Bairoch A,Apeweiler R.The Swiss-Prot protein soquence data bank and its supplement TrEMBL[J].Nucleic Acids Research,2000,(25):31-36.
  • 9Lee Yongeok,J Gen Microbiol,1993年,139卷,1227页
  • 10Yi Ting,J Bacteriol,1991年,173卷,21期,6849页

共引文献10

同被引文献47

  • 1王彩梅,蔡少丽,吴义真,林琳.扩展青霉脂肪酶ep8与R182K叠加突变体的构建[J].生物技术通报,2007,23(4):165-168. 被引量:3
  • 2蔡少丽,林俊涵,王彩梅,林琳.K55R与ep8叠加突变对扩展青霉脂肪酶热稳定性的改善[J].生物工程学报,2007,23(4):677-680. 被引量:8
  • 3陶苏丹,刘佳,陈喜文,陈德富.点饱和突变技术及其在蛋白质工程中的应用[J].中国生物工程杂志,2007,27(8):82-86. 被引量:7
  • 4Jaeger K E,Eggert T.Lipases for biotechnology[J].Curr Opin Biotechnol,2002,13:390-397.
  • 5Hasan F,Shah A A,Hameed A.Industrial applications of microbial lipases[J].Enzyme Microb Technol,2006,39:235-251.
  • 6Eijsink V G,Gaseidnes S,Borchert T V,et al.Directed evolution of enzyme stability[J].Biomol Eng,2005,22:21-30.
  • 7Nardini M,Dijkstra B W.Alpha/beta-hydrolase fold enzymes:the family keeps growing[J].Curr Opin Struct Biol,1999,9:732-737.
  • 8Feller G,Narinx E,Arpigny J L,et al.Enzymes from psychrophilic organisms[J].FEMS Microbiol Rev,1996,18:189-202.
  • 9TAN Z,LI J,WU M,et al.High-level heterologous expression of an alkaline lipase gene from Penicillium cyclopium PG37 in Pichia pastoris[J].World J Microbiol Biotechnol,2011,27:2767-2774.
  • 10Reetz M T,Carballeira J D,Vogel A.Iterative saturation mutagenesis on the basis of B factors as a strategy for increasing protein thermostability[J].Angew Chem Int Ed Engl,2006,45:7745-7751.

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部