期刊文献+

基因表达数据的分层近邻传播聚类算法 被引量:5

Gene expression data clustering algorithm using hierarchical affinity propagation
下载PDF
导出
摘要 为提高分层近邻传播聚类算法处理大规模基因表达数据的精确度,通过使用Pearson系数度量基因表达数据之间的相似性,构建相似性矩阵,在分层近邻传播聚类的自适应阶段加入全局数据信息,提出一种高效的分层近邻传播聚类算法。实验结果表明,与同类算法相比,该算法可以快速完成大规模基因表达数据的聚类,获得较高Silhouette(Sil)及Calinski-Harabasz(CH)指标值的聚类结果。 To improve the accuracy of gene expression data clustering obtained using hierarchical affinity propagation clustering algorithm,Pearson correlation coefficient was used to measure the similarity between gene expression data to construct the similarity matrix,the global information was added into the adapting stage in hierarchical affinity propagation procedure,and an efficient hierarchical affinity propagation algorithm was proposed.Experimental results show that compared with the other existing algorithms,the proposed algorithm can cluster the large-scale gene expression data fast and obtain the clustering results with high Silhouette index and Calinski-Harabasz index.
作者 吴娱 钟诚 尹梦晓 WU Yu ZHONG Cheng YIN Meng-xiao(School of Computer, Electronics and Information, Guangxi University, Nanning 530004, Chin)
出处 《计算机工程与设计》 北大核心 2016年第11期2961-2966,共6页 Computer Engineering and Design
基金 国家自然科学基金项目(61462005) 广西自然科学基金项目(2014XNSFAA118396 2014XNSFAA118361)
关键词 基因表达数据 聚类 分层近邻传播 自适应 全局数据 gene expression data clustering hierarchical affinity propagation adaptation global data
  • 相关文献

参考文献7

二级参考文献65

  • 1倪巍伟,陆介平,孙志挥.基于向量内积不等式的分布式k均值聚类算法[J].计算机研究与发展,2005,42(9):1493-1497. 被引量:15
  • 2Gelbard R, Goldman O, Spiegler I. Investigating Diversity of Clustering Methods: An Empirical Comparison[J]. Data & Knowledge Engineering, 2007, 63(1): 155-166.
  • 3Frey B J, Dueck D. Clustering by Passing Messages Between Data Points[J]. Science, 2007, 315(5814): 972-976.
  • 4Thedoridis S, Koutroumbas K. Pattern Recognition[M]. 3rd ed. Beijing, China: Publishing House of Electronics Industry, 2010.
  • 5Frey B J, Dueck D. Clustering by passing messages between data points. Science, 2007, 315(5814): 972-976
  • 6Kelly K. Affinity program slashes computing times [Online], available: http://www.news.utoronto.ca/bin6/070215-2952. asp. October 25, 2007
  • 7Dudoit S, Fridlyand J. A prediction-based resampling method for estimating the number of clusters in a dataset. Genome Biology, 2002, 3(7): 1-21
  • 8Wang K J. Supplement of adaptive affinity propagation clustering [Online], available: http://www.mathworks. com/matlabcentral/fileexchange/loadAut hor .do?object Type =author&objectId=1095267, October 25, 2007
  • 9Velamuru P K, Renaut R A, Guo H B, Chen K W. Robust clustering of positron emission tomography data. In: Joint Interface CSNA. USA: 2005
  • 10Dembele D, Kastner P. Fuzzy C-means method for clustering microarray data. Bioinformatics, 2003, 19(8): 973-980

共引文献215

同被引文献63

引证文献5

二级引证文献49

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部