Circular RNA(circRNA)is a novel non-coding endogenous RNAs.Evidence has shown that circRNAs are related to many biological processes and play essential roles in different biological functions.Although increasing numbe...Circular RNA(circRNA)is a novel non-coding endogenous RNAs.Evidence has shown that circRNAs are related to many biological processes and play essential roles in different biological functions.Although increasing numbers of circRNAs are discovered using high-throughput sequencing technologies,these techniques are still time-consuming and costly.In this study,we propose a computational method to predict circRNA-disesae associations which is based on metapath2 vec++and matrix factorization with integrated multiple data(called PCD MVMF).To construct more reliable networks,various aspects are considered.Firstly,circRNA annotation,sequence,and functional similarity networks are established,and disease-related genes and semantics are adopted to construct disease functional and semantic similarity networks.Secondly,metapath2 vec++is applied on an integrated heterogeneous network to learn the embedded features and initial prediction score.Finally,we use matrix factorization,take similarity as a constraint,and optimize it to obtain the final prediction results.Leave-one-out cross-validation,five-fold cross-validation,and f-measure are adopted to evaluate the performance of PCD MVMF.These evaluation metrics verify that PCD MVMF has better prediction performance than other methods.To further illustrate the performance of PCD MVMF,case studies of common diseases are conducted.Therefore,PCD MVMF can be regarded as a reliable and useful circRNA-disease association prediction tool.展开更多
Circular RNAs(circRNAs) play important roles in various biological processes, as essential non-coding RNAs that have effects on transcriptional and posttranscriptional gene expression regulation. Recently, many studie...Circular RNAs(circRNAs) play important roles in various biological processes, as essential non-coding RNAs that have effects on transcriptional and posttranscriptional gene expression regulation. Recently, many studies have shown that circRNAs can be regarded as micro RNA(miRNA) sponges, which are known to be associated with certain diseases. Therefore efficient computation methods are needed to explore miRNAcircRNA interactions, but only very few computational methods for predicting the associations between miRNAs and circRNAs exist. In this study, we adopt an improved random walk computational method, named KRWRMC, to express complicated associations between miRNAs and circRNAs. Our major contributions can be summed up in two points. First, in the conventional Random Walk Restart Heterogeneous(RWRH) algorithm, the computational method simply converts the circRNA/miRNA similarity network into the transition probability matrix;in contrast,we take the influence of the neighbor of the node in the network into account, which can suggest or stress some potential associations. Second, our proposed KRWRMC is the first computational model to calculate large numbers of miRNA-circRNA associations, which can be regarded as biomarkers to diagnose certain diseases and can thus help us to better understand complicated diseases. The reliability of KRWRMC has been verified by Leave One Out Cross Validation(LOOCV) and 10-fold cross validation, the results of which indicate that this method achieves excellent performance in predicting potential miRNA-circRNA associations.展开更多
基金supported by the National Natural Science Foundation of China(Nos.61972451,61672334,and 61902230)the Fundamental Research Funds for the Central Universities,Shaanxi Normal University(Nos.GK201901010 and 2018TS079)
文摘Circular RNA(circRNA)is a novel non-coding endogenous RNAs.Evidence has shown that circRNAs are related to many biological processes and play essential roles in different biological functions.Although increasing numbers of circRNAs are discovered using high-throughput sequencing technologies,these techniques are still time-consuming and costly.In this study,we propose a computational method to predict circRNA-disesae associations which is based on metapath2 vec++and matrix factorization with integrated multiple data(called PCD MVMF).To construct more reliable networks,various aspects are considered.Firstly,circRNA annotation,sequence,and functional similarity networks are established,and disease-related genes and semantics are adopted to construct disease functional and semantic similarity networks.Secondly,metapath2 vec++is applied on an integrated heterogeneous network to learn the embedded features and initial prediction score.Finally,we use matrix factorization,take similarity as a constraint,and optimize it to obtain the final prediction results.Leave-one-out cross-validation,five-fold cross-validation,and f-measure are adopted to evaluate the performance of PCD MVMF.These evaluation metrics verify that PCD MVMF has better prediction performance than other methods.To further illustrate the performance of PCD MVMF,case studies of common diseases are conducted.Therefore,PCD MVMF can be regarded as a reliable and useful circRNA-disease association prediction tool.
基金supported by the National Natural Science Foundation of China (No. 61672334)the Fundamental Research Funds for the Central Universities (No. GK201901010)
文摘Circular RNAs(circRNAs) play important roles in various biological processes, as essential non-coding RNAs that have effects on transcriptional and posttranscriptional gene expression regulation. Recently, many studies have shown that circRNAs can be regarded as micro RNA(miRNA) sponges, which are known to be associated with certain diseases. Therefore efficient computation methods are needed to explore miRNAcircRNA interactions, but only very few computational methods for predicting the associations between miRNAs and circRNAs exist. In this study, we adopt an improved random walk computational method, named KRWRMC, to express complicated associations between miRNAs and circRNAs. Our major contributions can be summed up in two points. First, in the conventional Random Walk Restart Heterogeneous(RWRH) algorithm, the computational method simply converts the circRNA/miRNA similarity network into the transition probability matrix;in contrast,we take the influence of the neighbor of the node in the network into account, which can suggest or stress some potential associations. Second, our proposed KRWRMC is the first computational model to calculate large numbers of miRNA-circRNA associations, which can be regarded as biomarkers to diagnose certain diseases and can thus help us to better understand complicated diseases. The reliability of KRWRMC has been verified by Leave One Out Cross Validation(LOOCV) and 10-fold cross validation, the results of which indicate that this method achieves excellent performance in predicting potential miRNA-circRNA associations.