期刊文献+

基于马氏距离的缺失值填充算法 被引量:24

Missing value estimation for gene expression data based on Mahalanobis distance
下载PDF
导出
摘要 提出了一种基于马氏距离的填充算法来估计基因表达数据集中的缺失数据。该算法通过基因之间的马氏距离来选择最近邻居基因,并将已得到的估计值应用到后续的估计过程中,然后采用信息论中熵值的概念计算最近邻居的加权系数,得到缺失数据的填充值。实验结果证明了该算法具有有效性,其性能优于其他基于最近邻居法的缺失值处理算法。 A imputation method based on Mahalanobis distance was proposed to estimate missing values in the gene expression data. The nearest neighbors were chosen by the Mahalanobis distance between genes, and then the concept of entropy was utilized to obtain estimations of missing values. The imputed values were used for the later imputation. Experiments prove that the method is valid and its performance is higher than the other imputation methods based on k-nearest neighbors for gene expression data.
出处 《计算机应用》 CSCD 北大核心 2005年第12期2868-2871,共4页 journal of Computer Applications
基金 湖南省自然科学基金(03JJY3095)
关键词 微阵列 缺失值估计 马氏距离 信息熵 microarray missing value estimation Mahalanobis distance entropy
  • 相关文献

参考文献17

  • 1DUDOIT S,YANG YH,CALLOW MJ,et al.Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments[J].Statistica Sinica,2002,12(1):111-139.
  • 2ARBEITMAN MN,FURLONG EEM,IMAM F,et al.Gene expression during the life cycle of Drosophila melanogaster[J].Science,2002,297(5590):2270-2275.
  • 3GASCH AP,SPELLMAN PT,KAO CM,et al.Genomic expression programs in the response of yeast cells to environmental changes[J]. Molecular Biology of the Cell 2000,11:4241-4257.
  • 4BOHEN SP, TROYANSKAYA OG, ALTER O,et al.Variation in gene expression patterns in follicular lymphoma and the response to rituximab[J]. Proc Natl Acad Sci,USA,2003,100(4):1926-1930.
  • 5BROWN MP,GRUNDY WN,LIN D,et al.Knowledge-based analysis of microarray gene expression data by using support vector machines[J]. Proc. Natl Acad. Sci,USA,2000,97,262-267.
  • 6RAYCHAUDHURI S,STUART JM,ALTMAN R.Principal components analysis to summarize microarray experiments:application to sporulation time series[J]. Pac. Symp. 15Biocomput.,2000,455-466.
  • 7ALTER O,BROWN PO,BOTSTEIN D.Singular value decomposition for genome-wide expression data processing and modeling[J]. Proc. Natl Acad. Sci. USA,2000,97(18):10101-10106.
  • 8BUTTE AJ,YE J,NIEDERFELLNER G,et al.Determining significant fold differences in gene expression analysis[J]. Pac. Symp. Biocomput.,2001,6:6-17.
  • 9ALIZADEH AA,EISEN MB,DAVIS RE,et al.Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling[J]. Nature,2000,403,503-511.
  • 10TROYANSKAYA O,CANTOR M,SHERLOCK G,et al.Missing value estimation methods for DNA microarrays[J]. Bioinformatics,2001,17:520-525.

共引文献4

同被引文献206

引证文献24

二级引证文献125

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部