期刊文献+

核主成分logistic回归模型在非线性关联分析中的应用

Applications of the kernel principal component analysis-based logistic regression model on nonlinear association study
原文传递
导出
摘要 目的将核主成分分析(KPCA)与logistic回归模型相结合,提出一种核主成分logistic(KPCA-based logis-tic)回归模型,用于复杂疾病基因定位的非线性关联分析。方法针对病例对照研究设计的关联分析,对候选基因区域内的单核苷酸多肽性(SNPs)进行核主成分分析,以核主成分为自变量构建logistic回归模型,并对GAW16类风湿关节炎数据中PTPN22和RNF186两个基因区域进行分析,以验证KPCA-based logistic回归模型的有效性和实用性。结果对PTPN22和RNF186两个基因区域的分析结果显示,KPCA-based logistic回归模型既能够检测出单点检验所能发现的区域(PTPN22),也能检测出单点检验所不能发现的区域(RNF186)。结论 KPCA-based logistic回归模型是一种有效的非线性关联分析方法,能够发现更多的易感区域。 Objective To combine the kernel principal component analysis(KPCA) and the logistic regression model to propose a KPCA-based logistic regression model for nonlinear association analysis of complex disease gene mapping.Methods For association study of case-control research design,the kernel principal component analysis(KPCA) was performed on single nucleotide polymorphisms(SNPs) of a candidate region to construct the logistic regression model with kernel principal components as independent variables,and then the PTPN22 and RNF186 gene regions of rheumatoid arthritis(RA) data from GAW16 were analyzed to illustrate the effectiveness and practicability of the KPCA-based logistic regression model.Results Application to the PTPN22 and RNF186 gene regions indicated that the KPCA-based logistic regression model could detect regions which could be detected by a single-locus test(PTPN22),and identify significant regions which could not be identified by a single-locus test(RNF186).Conclusion As an effective nonlinear association study method,the KPCA-based logistic regression model can identify more susceptible regions.
出处 《山东大学学报(医学版)》 CAS 北大核心 2011年第5期140-142,146,共4页 Journal of Shandong University:Health Sciences
基金 国家自然科学基金资助课题(30871392)
关键词 核主成分分析 LOGISTIC回归 复杂疾病基因定位 关联分析 Kernel principal component analysis; Logistic regression; Complex disease gene mapping; Association study;
  • 相关文献

参考文献9

  • 1Beyene J, Tritchler D, Asimit J L, et al. Gene-or region- based analysis of genome-wide association studies [ J ]. Genet Epidemiol, 2009, 33( Suppl 1 ) :S105-110.
  • 2Buil A, Martinez-Perez A, Perera-Lluna A, et al. A new gene-based association test for genome-wide association studies[J]. BMC Proc, 2009, 3(Suppl 7) :S130.
  • 3Qiao B, Huang C H, Cong L, et al. Genome-wide gene- based analysis of rheumatoid arthritis-associated interaction with PTPN22 and HLA-DRB1 [J]. BMC Proc, 2009, 3 ( Suppl 7) : S132.
  • 4Neale B M, Sham P C. The future of association studies: gene-based analysis and replication [ J ]. Am J Hum Gen- et, 2004, 75(3) :353-362.
  • 5Scholkopf B, Smola A, Muller K R. Nonlinear compo- nent analysis as a kernel eigenvalue problem [ J ]. Neural Comput, 1998, 10(3) :1299-1319.
  • 6Silva S, Botelho C, De Bem R A, et al. C-NLPCA: ex- tracting non-linear principal components of image dataset[ J ]. Anasis Ⅻ Simposio Brasileiro de Sensoriamento Remoto, 2005 (4) : 3495-3502.
  • 7Heo G, Gader P, Frigui H. RKF-PCA: Robust kernel fuzzy PCA[ J ]. Neural Networks, 2009, 22(5-6) :642- 650.
  • 8Chapelle O, Vapnik V, Bousquet O, et al. Choosing multiple parameters for support vector machines [ J ]. Maching Learning, 2002, 46( 1 ) : 131-159.
  • 9Kramer O. Covariance matrix self-adaptation and kernel regression-perspectives of evolutionary optimization in kernel machines [ J ]. Fundamenta Informaticae, 2010, 98(1) :87-106.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部