期刊文献+

基于LZ复杂性距离的蛋白质三维结构比较

Protein 3D structure comparison based on LZ complexity distance
下载PDF
导出
摘要 基于LZ复杂性距离提出了一种非比对的蛋白质三维结构比较方法.该方法以蛋白质结构单元间的条件LZ复杂性为特征参数,根据条件LZ复杂性计算LZ复杂性距离来作为蛋白质三维结构(不)相似程度的定量刻画.该方法可在二次多项式的时间限度内计算完成.蛋白质的结构数据采用接触图的表示方式,以避免PDB格式数据中的非结构信息和不同坐标系对结构比较的影响.以真实的蛋白质三维结构数据所组成的5个数据集为实例,基于LZ复杂性距离对各数据集中的蛋白质单链进行了结构聚类.聚类的结果符合各蛋白质单链在传统的结构分类数据库中的分类,表明论文提出的方法能够有效地对蛋白质三维结构进行定量比较. Based on the IZ complexity distance metric, an alignment-free method for comparison of protein 3D structure, was proposed. The new method takes the conditional LZ complexity between protein structural units as the feature parameter. And the LZ complexity distance, which quantifies the (dis-)similarity of protein structures, was calculated according to the parameter. The method was solvable in quadratic polynomial time. Contact map was adopted to represent protein structure in this work so that the impact of non-structural information and different coordinate system brought from PDB format data could be neglected for comparison. Protein single chains were clustered based on the LZ complexity distance over five different data sets made of real protein molecules. Clustering results were shown to be in good agreement with the classification of these protein single chains in the classical structure classification database, which demonstrated that the proposed method can be effectively used for the quantitative comparison of protein three-dimensional structures.
出处 《高技术通讯》 CAS CSCD 北大核心 2007年第7期742-748,共7页 Chinese High Technology Letters
基金 国家自然科学基金(60371046)资助项目.
关键词 生物信息学 蛋白质三维结构 结构比较 LZ复杂性距离 接触图 bioinfonnatics, protein 3D structure, structure comparison, LZ complexity distance, contact map
  • 相关文献

参考文献24

  • 1Orengo C A,Todd A E,Thornton J.From protein structure to function.Curr Opin Struct Biol,1999,9:374-382.
  • 2Eugene V K,Yuri I W,Georgy P K.The structure of the protein universe and genome evolution.Nature,2002,420:218-223.
  • 3Patrice K.Protein structure similarities.Curr Opin Struct Biol,2001,11:348-353.
  • 4Redfern O,Alastair G,Maibaum M,et al.Survey of current protein family databases and their application in comparative,structural and functional genomics.Journal of Chromatography B,2005,815:97-107.
  • 5Goldsmith F S,Honig B.Structural genomics:computational methods for structure analysis.Protein Science,2003,12:1813-1821.
  • 6Godzik A.The structural alignment between two proteins:is there a unique answer.Protein Science,1996,5(7):1325-1338.
  • 7Bryant S H,Altschul S F.Statistics of sequence-structure threading.Curr Opin Struct Biol,1995,5:236-244.
  • 8Szustakowski J D,Weng Z P.Protein structure alignment using a genetic algorithm.Proteins,2000,38:428-440.
  • 9Blankenbecler R,Ohlsson M,Peterson C,et al.Matching protein structures with fuzzy alignments.Proc Natl Acad Sci,2003,100(21):11936-11940.
  • 10Chen L N,Zhou T,Tang Y.Protein structure alignment by deterministic annealing.Bioinformatics,2005,21(1):51-62.

二级参考文献15

  • 1吕宝忠 钟扬 高莉萍.分子进化与系统发育[M].北京:高等教育出版社,2002..
  • 2Mount D W.Bioinformatics:sequence and genome analysis.Cold Spring Harbor,NY:Cold Spring Harbor Laboratory Press,2001.337-342
  • 3Vinga S,Almeidal J.Alignment-free sequence comparison:a review.Bioinformatics,2003,19(4):513-523
  • 4Hao B L,Qi J,Wang B.Prokaryotic phylogeny based on complete genomes without sequence alignment.Modern Physics Letters B,2003,17(2):91-94
  • 5Hao B L,Qi J.Prokaryote phylogeny without sequence alignment:from avoidance signature to composition distance.Journal of Bioinformatics and Computational Biology,2004,2(1):1-19
  • 6Li M,Vitanyi P.An Introduction to Kolmogorov Complexity and Its Applications(2nd edition).Berlin Heidelberg:Springer Verlag,1997.3-8
  • 7Li M,Badger J H,Chen X,et al.An information based sequence distance and its application to whole mitochondrial genome phylogeny.Bioinformatics,2001,17(2):149-154
  • 8Chen X,Kwong S,Li M.A compression algorithm for DNA sequences and its applications in genome comparison.Genome Inform Ser Workshop Genome Information,1999,10:51-61
  • 9Hisahiko S,Takashi Y.DNA data compression in the post genome era.Genome Informatics,2001,12:512-514
  • 10Lempel A,Ziv J.On the complexity of finite sequences.IEEE Transactions on Information Theory,1976,IT-22(1):75-81

共引文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部