期刊文献+

Mining Gene Expression Profiles:An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition

Mining Gene Expression Profiles:An Integrated Implementation of Kernel Principal Component Analysis and Singular Value Decomposition
原文传递
导出
摘要 The detection of genes that show similar profiles under different experimental conditions is often an initial step in inferring the biological significance of such genes. Visualization tools are used to identify genes with similar profiles in microarray studies. Given the large number of genes recorded in microarray experiments, gene expression data are generally displayed on a low dimensional plot, based on linear methods. However, microarray data show nonlinearity, due to high-order terms of interaction between genes, so alternative approaches, such as kernel methods, may be more appropriate. We introduce a technique that combines kernel principal component analysis (KPCA) and Biplot to visualize gene expression profiles. Our approach relies on the singular value decomposition of the input matrix and incorporates an additional step that involves KPCA. The main properties of our method are the extraction of nonlinear features and the preservation of the input variables (genes) in the output display. We apply this algorithm to colon tumor, leukemia and lymphoma datasets. Our approach reveals the underlying structure of the gene expression profiles and provides a more intuitive understanding of the gene and sample association. The detection of genes that show similar profiles under different experimental conditions is often an initial step in inferring the biological significance of such genes. Visualization tools are used to identify genes with similar profiles in microarray studies. Given the large number of genes recorded in microarray experiments, gene expression data are generally displayed on a low dimensional plot, based on linear methods. However, microarray data show nonlinearity, due to high-order terms of interaction between genes, so alternative approaches, such as kernel methods, may be more appropriate. We introduce a technique that combines kernel principal component analysis (KPCA) and Biplot to visualize gene expression profiles. Our approach relies on the singular value decomposition of the input matrix and incorporates an additional step that involves KPCA. The main properties of our method are the extraction of nonlinear features and the preservation of the input variables (genes) in the output display. We apply this algorithm to colon tumor, leukemia and lymphoma datasets. Our approach reveals the underlying structure of the gene expression profiles and provides a more intuitive understanding of the gene and sample association.
出处 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2010年第3期200-210,共11页 基因组蛋白质组与生物信息学报(英文版)
基金 funded in part by grant MEC-MTM2008-00642
关键词 kernel method BIPLOT gene expression profile dimension reduction kernel method, biplot, gene expression profile, dimension reduction
  • 引文网络
  • 相关文献

参考文献23

  • 1Chambers,J.M.,et al.1983.Graphical Methods for Data Analysis.Duxbury Press,Belmont,USA.
  • 2Eisen,M.B.,et al.1998.Cluster analysis and display of genome-wide expression patterns.Proc.Natl.Acad.Sci.USA 95:14863-14868.
  • 3Tamayo,P.,et al.1999.Interpreting patterns of gene expression with self-organizing maps:methods and application to hematopoietic differentiation.Proc.Natl.Acad.Sci.USA 96:2907-2912.
  • 4Chu,W.,et al.2005.Biomarker discovery in microarray gene expression data with Gaussian processes.Bioinformatcis 21:3385-3393.
  • 5Zhao,X.and Cheung,L.W.2007.Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data.BMC Bioinformatcis 8:67.
  • 6Dettling,M.2004.BagBoosting for tumor classification with gene expression data.Bioinformatcis 20:3583-3593.
  • 7Diaz-Uriarte,R.and Alvarez de Andres,S.2006.Gene selection and classification of microarray data using random forest.BMC Bioinformatcis 7:3.
  • 8Alter,O.,et al.2000.Singular value decomposition for genome-wide expression data processing and modeling.Proc.Natl.Acad.Sci.USA 97:10101-10106.
  • 9Fellenberg,K.,et al.2001.Correspondence analysis applied to microarray data.Proc.Natl.Acad.Sci.USA 98:10781-10786.
  • 10Pittelkow,Y.E.and Wilson,S.R.2003.Visualisation of gene expression data-the GE-biplot,the Chip-plot and the Gene-plot.Star.Appl.Genet.Mol.Biol.2:Article 6.
;
使用帮助 返回顶部