期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
DeepRisk:A deep learning approach for genome-wide assessment of common disease risk 被引量:1
1
作者 Jiajie Peng Zhijie Bao +8 位作者 Jingyi Lia Ruijiang Han Yuxian Wang Lu Han Jinghao Peng Tao Wang Jianye Hao Zhongyu Wei xuequn shang 《Fundamental Research》 CAS CSCD 2024年第4期752-760,共9页
The potential for being able to identify individuals at high disease risk solely based on genotype data has garnered significant interest.Although widely applied,traditional polygenic risk scoring methods fall short,a... The potential for being able to identify individuals at high disease risk solely based on genotype data has garnered significant interest.Although widely applied,traditional polygenic risk scoring methods fall short,as they are built on additive models that fail to capture the intricate associations among single nucleotide polymorphisms(SNPs).This presents a limitation,as genetic diseases often arise from complex interactions between multiple SNPs.To address this challenge,we developed DeepRisk,a biological knowledge-driven deep learning method for modeling these complex,nonlinear associations among SNPs,to provide a more effective method for scoring the risk of common diseases with genome-wide genotype data.Evaluations demonstrated that DeepRisk outperforms existing PRs-based methods in identifying individuals at high risk for four common diseases:Alzheimer's disease,inflammatory bowel disease,type 2diabetes,and breast cancer. 展开更多
关键词 Disease risk prediction Deep learning Polygenic risk score Common disease risk Disease prevention
原文传递
Constrained query of order-preserving submatrix in gene expression data 被引量:2
2
作者 Tao JIANG Zhanhuai LI +3 位作者 xuequn shang Bolin CHEN Weibang LI Zhilei YIN 《Frontiers of Computer Science》 SCIE EI CSCD 2016年第6期1052-1066,共15页
Order-preserving submatrix (OPSM) has become important in modelling biologically meaningful subspace cluster, capturing the general tendency of gene expressions across a subset of conditions. With the advance of mic... Order-preserving submatrix (OPSM) has become important in modelling biologically meaningful subspace cluster, capturing the general tendency of gene expressions across a subset of conditions. With the advance of microarray and analysis techniques, big volume of gene expression datasets and OPSM mining results are produced. OPSM query can efficiently retrieve relevant OPSMs from the huge amount of OPSM datasets. However, improving OPSM query relevancy remains a difficult task in real life exploratory data analysis processing. First, it is hard to capture subjective interestingness aspects, e.g., the analyst's expectation given her/his domain knowledge. Second, when these expectations can be declaratively specified, it is still challenging to use them during the computational process of OPSM queries. With the best of our knowledge, existing methods mainly fo- cus on batch OPSM mining, while few works involve OPSM query. To solve the above problems, the paper proposes two constrained OPSM query methods, which exploit userdefined constraints to search relevant results from two kinds of indices introduced. In this paper, extensive experiments are conducted on real datasets, and experiment results demonstrate that the multi-dimension index (cIndex) and enumerating sequence index (esIndex) based queries have better performance than brute force search. 展开更多
关键词 gene expression data OPSM constrained query brute-force search feature sequence cIndex
原文传递
Flexibility and rigidity index for chromosome packing,flexibility and dynamics analysis
3
作者 Jiajie Peng Jinjin Yang +2 位作者 D Vijay Anand xuequn shang Kelin Xia 《Frontiers of Computer Science》 SCIE EI CSCD 2022年第4期203-213,共11页
The packing of genomic DNA from double helix into highly-order hierarchical assemblies has a great impact on chromosome flexibility,dynamics and functions.The open and accessible regions of chromosomes are primary bin... The packing of genomic DNA from double helix into highly-order hierarchical assemblies has a great impact on chromosome flexibility,dynamics and functions.The open and accessible regions of chromosomes are primary binding positions for regulatory elements and are crucial to nuclear processes and biological functions.Motivated by the success of flexibility-rigidity index(FRI)in biomolecular flexibility analysis and drug design,we propose an FRI-based model for quantitatively characterizing chromosome flexibility.Based on Hi-C data,a flexibility index for each locus can be evaluated.Physically,flexibility is tightly related to packing density.Highly compacted regions are usually more rigid,while loosely packed regions are more flexible.Indeed,a strong correlation is found between our flexibility index and DNase and ATAC values,which are measurements for chromosome accessibility.In addition,the genome regions with higher chromosome flexibility have a higher chance to be bound by transcription factors.Recently,the Gaussian network model(GNM)is applied to analyze the chromosome accessibility and a mobility profile has been proposed to characterize chromosome flexibility.Compared with GNM,our FRI is slightly more accurate(1%to 2%increase)and significantly more efficient in both computational time and costs.For a 5Kb resolution Hi-C data,the flexibility evaluation process only takes FRI a few minutes on a single-core processor.In contrast,GNM requires 1.5 hours on 10 CPUs.Moreover,interchromosome interactions can be easily combined into the flexibility evaluation,thus further enhancing the accuracy of our FRI.In contrast,the consideration of interchromosome information into GNM will significantly increase the size of its Laplacian(or Kirchhoff)matrix,thus becoming computationally extremely challenging for the current GNM.The software and supplementary document are available at https://github.com/jiajiepeng/FRI_chrFle. 展开更多
关键词 flexibility-rigidity index 3D genome chromosome flexibility
原文传递
Copy number variation related disease genes
4
作者 Chaima Aouiche xuequn shang Bolin Chen 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2018年第2期99-112,共14页
Background: One of the most important and challenging issues in biomedicine and genomics is how to identify disease related genes. Datasets from high-throughput biotechnologies have been widely used to overcome this ... Background: One of the most important and challenging issues in biomedicine and genomics is how to identify disease related genes. Datasets from high-throughput biotechnologies have been widely used to overcome this issue from various perspectives, e.g., epigenomics, genomics, transcriptomics, proteomics, metabolomics. At the genomic level, copy number variations (CNVs) have been recognized as critical genetic variations, which contribute significantly to genomic diversity. They have been associated with both common and complex diseases, and thus have a large influence on a variety of Mendelian and somatic genetic disorders. Results: In this review, based on a variety of complex diseases, we give an overview about the critical role of using CNVs for identifying disease related genes, and discuss on details the different high-throughput and sequencing methods applied for CNV detection. Some limitations and challenges concerning CNV are also highlighted. Conclusions: Reliable detection of CNVs will not only allow discriminating driver mutations for various diseases, but also helps to develop personalized medicine when integrating it with other genomic features. 展开更多
关键词 CNV disease gene complex disease targeted approach genome-wide approach whole exome sequencing
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部