期刊文献+

一种面向蛋白质复合体检测的图聚类方法 被引量:14

A Graph Clustering Method for Detecting Protein Complexes
下载PDF
导出
摘要 蛋白质互作用(protein-protein interaction,PPI)网络是广泛存在的一类复杂生物网络,其网络拓扑特征与功能模块分析密切相关.图聚类是对复杂网络进行分析和处理的一种重要计算方法.传统的PPI网络中蛋白质复合体检测算法通常对网络图中的对象进行硬划分,而寻找网络中的重叠簇的软聚类算法已成为当前研究热点之一.现有的软聚类算法较少关注寻找网络中具有重要生物意义的小规模非稠密簇.对此,基于网络中结点邻域给出了边关联强度的度量方法,并在此基础上提出了一种基于流模拟的PPI网络中复合体检测的图聚类(flow-simulation graph clustering,F-GCL)算法,该算法可以在快速发现PPI网络中的重叠簇的同时找到小规模非稠密簇;同时,与MCODE(molecular complex detection),MCL(Markov clustering),RNSC(restricted neighborhood search clustering)和CPM(clique percolation method)算法在6个酿酒酵母PPI网络上进行比较,该算法在F-measure,Accuracy,Separation方面表现了较好的性能. Protein-protein interaction (PPI) networks are widely present in complex biological networks. The topological features of PPI networks play an important role in analyzing the functional modules in networks. Some graph clustering methods have been successfully used to complex networks to detect protein complexes in PPI networks. Traditional graph clustering algorithms in PPI analyzing methods primarily focus on hard clustering for a network, while, nowadays soft clustering algorithms to find overlapped clusters have become one of the hotspots of current research. Existing soft clustering algorithms pay less attention on small-scale non-dense clusters, while some small-scale non-dense clusters often have important biological meaning in PPI networks. A measuring method of the association strength of edges is developed based on node neighborhoods in networks, and then a soft clustering algorithm named flow-simulation graph clustering (F-GCL) on the basis of flow simulation is presented to detect complexes in a PPI network. Experiments show that the proposed soft clustering algorithm F-GCL can simultaneously find out overlapping clusters and small-scale non- dense clusters without improving the running time. Compared with MCODE (molecular complex detection), MCL(Markov clustering), RNSC(restricted neighborhood search clustering) and CPM (clique percolation method) algorithms on six Saccharomyces cerevisiae PPI networks, the algorithm F-GCL shows considerable or better performance on three evaluating indicators: F-measure, Accuracy and Separation.
出处 《计算机研究与发展》 EI CSCD 北大核心 2015年第8期1784-1793,共10页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61432011 61272004) 山西省自然科学基金项目(2011011016-1) 教育部高等学校博士学科点专项科研基金项目(200801081017)
关键词 流模拟 图聚类 软聚类 蛋白质互作用网络 蛋白质复合体 flow simulation graph clustering soft clustering protein-protein interaction network protein complex
  • 相关文献

参考文献29

  • 1Huang Chien-Hung, Chou Szu-Yu, Ng Ka-Lok. Robustness of protein complex networks [C] //Proc of the 2nd Int Conf on Computer Science and Service System (CSSS). Los Alamitos, CA: IEEE Computer Society, 2012: 841-844.
  • 2郭茂祖,代启国,徐立秋,刘晓燕.一种蛋白质复合体模块度函数及其识别算法[J].计算机研究与发展,2014,51(10):2178-2186. 被引量:8
  • 3鱼亮,高琳,孙鹏岗.蛋白质网络中复合体和功能模块预测算法研究[J].计算机学报,2011,34(7):1239-1251. 被引量:10
  • 4冀俊忠,刘志军,刘红欣,刘椿年.蛋白质相互作用网络功能模块检测的研究综述[J].自动化学报,2014,40(4):577-593. 被引量:23
  • 5Zhao Bihai, Wang Jianxin, Li Min, et al. Detecting protein compleses based on uncertain graph model [J]. IEEE/ACM Trans on Computational Biology and Bioinformatics, 2014, 11 (3): 486-497.
  • 6Schaeffer S E. Graph clustering [J]. Computer Science Review, 2007, 1(1): 27-64.
  • 7Dongen S. Graph clustering by flow simulation [D]. Utrecht, Netherlands: University of Utrecht, 2000.
  • 8Shih Y K, Parthasarathy S. Identifying functional modules in interaction networks through overlapping Markov clustering [J]. Bioinformaties, 2012, 28(18): 473-479.
  • 9Bader G D, Hogoe C W. An automated method for finding molecular complexes in large protein interaction networks [J]. BMCBioinformatics, 2003, 4(1): 2.
  • 10Palla G, Der6nyi I, Farkas I, et al. Uncovering the overlapping community structure of complex networks in nature and society[J]. Nature, 2005, 435(7043): 814-818.

二级参考文献76

  • 1解(亻刍),汪小帆.复杂网络中的社团结构分析算法研究综述[J].复杂系统与复杂性科学,2005,2(3):1-12. 被引量:86
  • 2Lin X, Gao L et al. MOHCS: Towards mining overlapping highly connected subgraphs. Computing Research Repository, 2008, abs/0806. 3215.
  • 3Adamcsek B, Palla Get al. CFinder: Locating cliques and overlapping modules in biological networks. Bioinformatics, 2006, 22(8): 1021-1023.
  • 4Guldener U, Munsterkotter Met al. CYGD: The compre- hensive yeast genome database. Nucleic Acids Research, 2005, 33(Suppl. 1): D364-368.
  • 5Ruepp A, Zollner Aet al. The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomes. Nucleic Acids Research, 2004, 32 (18): 5539-5545.
  • 6Blatt M, Wiseman Set al. Superparamagnetic clustering of data. Physical Review Letter, 1996, 76(18): 3251-3254.
  • 7Bader G D, Hogue C W. Analyzing yeast protein-protein interaction data obtained from different sources. Nature Biotechnology, 2002, 20(10): 991-997.
  • 8Watts D J, Strogatz S H. Collective dynamics of 2smallworld' networks. Nature, 1998, $9S(6):440-44Z.
  • 9Sun P, Gao L. Fast algorithms for detecting overlapping funetlonal modules in PPI networks//Proeeedlngs of the IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology CIBCB 2009. Nashville, USA, 2009:247-254.
  • 10Gao L, Sun Pet al. Clustering Algorithms for detecting functional modules in protein interaction networks. Journal of Bioinformatics and Computational Biology, 2009, 7 (1) : 217-242.

共引文献32

同被引文献46

引证文献14

二级引证文献75

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部