期刊文献+

基于矩阵填充的肿瘤基因表达谱数据缺失点估计 被引量:3

Tumor Gene Expression Missing Value Estimation Based on Matrix Completion
下载PDF
导出
摘要 为解决肿瘤基因表达谱数据后续研究需要完整数据矩阵的问题,针对包含缺失点的数据集。提出基于矩阵填充(matrix completion)与模糊C均值(fuzzy c-means algorithm,FCM)相结合的缺失点估计方法(FCM_MC)。该方法充分利用肿瘤基因表达谱数据的冗余信息,通过模糊C均值聚类得到具有良好的低秩特性的基因语义片段,再利用矩阵填充方法分别对每个语义片段进行缺失点的重建。在不同数据集上进行实验,与传统缺失点估计算法比较。实验表明FCM_MC算法在缺失数据估计准确度和类结构保持度上效果得到有效提升,同时运行效率较高。 To solve the problem that the research of tumor gene expression data needs a complete data matrix,a missing value estimation method(FCM_MC) based on matrix completion(MC) and fuzzy c-means algorithm(FCM) is proposed for matrices contain missing values.The method makes full use of the redundancy information of tumor gene expression data,the low rank genetic semantics matrices are obtained by fuzzy c-mean clustering method.Then matrix completion theory was used to estimate the missing values of every semantics matrices.After the estimation of different data sets,our proposal with tradition missing value estimation algorithm were compared.Experimental results show the improvement of our method on missing value estimation accuracy and structure of class preserving accuracy with suitable efficiency.
出处 《科学技术与工程》 北大核心 2017年第7期63-68,89,共7页 Science Technology and Engineering
基金 国家自然科学基金(51365017 61305019) 江西省教育厅科技计划(GJJ150680)资助
关键词 矩阵填充 模糊C均值 低秩 基因语义 缺失值估计 matrix completion fuzzy C-means low rank genetic semantic missing value estimation
  • 相关文献

参考文献1

二级参考文献22

  • 1[1]Eisenberg D, Marcotte EM, Xenarios I, et al. Protein function in the post-genomic era [J]. Nature, 2000, 405(6788): 823-826.
  • 2[2]Debouck C, Goodfellow PN. DNA microarrays in drug discovery and development[J]. Nat Genet , 1999, 21(Suppl 1): 48-50.
  • 3[3]Eisen MB, Spellman PT, Brown PO, et al. Cluster analysis and display of genome-wide expression patterns [J]. Proc Natl Acad Sci U S A,1998,95(25):14863-14868.
  • 4[4]Spellman PT, Sherlock G, Zhang MQ, et al. Comprehensive identification of cell cycle regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization [J]. Mol Biol Cell, 1998, 9(12):3273-3297.
  • 5[5]Alizadeh AA, Eisen MB, Davis RE, et al. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J]. Nature, 2000, 403(6769): 503-511.
  • 6[6]Golub TR, Slonim DK, Tamayo P, et al. Molecular classification of cancer:class discovery and class prediction by gene expression monitoring[J]. Science, 1999, 286(5439): 531-537.
  • 7[7]Xiong M, Jin L, Li WJ, et al. Computational methods for gene expression-based tumor classification [J]. Biotechniques, 2000,29(6):1264-1270.
  • 8[8]Perou CM, Jeffrey SS, van de Rijn M, et al. Distinctive gene expression patterns in human mammary epithelial cells and breast cancers [J]. Proc Natl Acad Sci U S A,1999,96(16): 9212-9217.
  • 9[9]Perou, CM, Serlie, T, Eisen MB, et al. Molecular portraits of human breast tumours [J]. Nature, 2000, 406(6797):747-752.
  • 10[10]Alon U, Barkai N, Notterman DA, et al. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J]. Proc Natl Acad Sci U S A,1999,96(12): 6745-6750.

共引文献6

同被引文献29

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部