Genes associated with similar diseases are often functionally related.This principle is largely supported by many biological data sources,such as disease phenotype similarities,protein complexes,protein-protein intera...Genes associated with similar diseases are often functionally related.This principle is largely supported by many biological data sources,such as disease phenotype similarities,protein complexes,protein-protein interactions,pathways and gene expression profiles.Integrating multiple types of biological data is an effective method to identify disease genes for many genetic diseases.To capture the gene-disease associations based on biological networks,a kernel-based Markov random field(MRF)method is proposed by combining graph kernels and the MRF method.In the proposed method,three kinds of kernels are employed to describe the overall relationships of vertices in five biological networks,respectively,and a novel weighted MRF method is developed to integrate those data.In addition,an improved Gibbs sampling procedure and a novel parameter estimation method are proposed to generate predictions from the kernel-based MRF method.Numerical experiments are carried out by integrating known gene-disease associations,protein complexes,protein-protein interactions,pathways and gene expression profiles.The proposed kernel-based MRF method is evaluated by the leave-one-out cross validation paradigm,achieving an AUC score of 0.771 when integrating all those biological data in our experiments,which indicates that our proposed method is very promising compared with many existing methods.展开更多
基金supported by the Natural Sciences and Engineering Research Council of CanadaNational Natural Science Foundation of China(61428209,61232001)
文摘Genes associated with similar diseases are often functionally related.This principle is largely supported by many biological data sources,such as disease phenotype similarities,protein complexes,protein-protein interactions,pathways and gene expression profiles.Integrating multiple types of biological data is an effective method to identify disease genes for many genetic diseases.To capture the gene-disease associations based on biological networks,a kernel-based Markov random field(MRF)method is proposed by combining graph kernels and the MRF method.In the proposed method,three kinds of kernels are employed to describe the overall relationships of vertices in five biological networks,respectively,and a novel weighted MRF method is developed to integrate those data.In addition,an improved Gibbs sampling procedure and a novel parameter estimation method are proposed to generate predictions from the kernel-based MRF method.Numerical experiments are carried out by integrating known gene-disease associations,protein complexes,protein-protein interactions,pathways and gene expression profiles.The proposed kernel-based MRF method is evaluated by the leave-one-out cross validation paradigm,achieving an AUC score of 0.771 when integrating all those biological data in our experiments,which indicates that our proposed method is very promising compared with many existing methods.