Despite considerable advances in extracting crucial insights from bio-omics data to unravel the intricate mechanisms underlying complex traits,the absence of a universal multi-modal computational tool with robust inte...Despite considerable advances in extracting crucial insights from bio-omics data to unravel the intricate mechanisms underlying complex traits,the absence of a universal multi-modal computational tool with robust interpretability for accurate phenotype prediction and identification of trait-associated genes remains a challenge.This study introduces the dual-extraction modeling(DEM)approach,a multi-modal deep-learning architecture designed to extract representative features from heterogeneous omics datasets,enabling the prediction of complex trait phenotypes.Through comprehensive benchmarking experiments,we demonstrate the efficacy of DEM in classification and regression prediction of complex traits.DEM consistently exhibits superior accuracy,robustness,generalizability,and flexibility.Notably,we establish its effectiveness in predicting pleiotropic genes that influence both flowering time and rosette leaf number,underscoring its commendable interpretability.In addition,we have developed user-friendly software to facilitate seamless utilization of DEM’s functions.In summary,this study presents a state-of-the-art approach with the ability to effectively predict qualitative and quantitative traits and identify functional genes,confirming its potential as a valuable tool for exploring the genetic basis of complex traits.展开更多
The Collaborative Cross(CC)mouse model is a next‐generation mouse genetic reference population(GRP)designated for a high‐resolution quantitative trait loci(QTL)mapping of complex traits during health and disease.The...The Collaborative Cross(CC)mouse model is a next‐generation mouse genetic reference population(GRP)designated for a high‐resolution quantitative trait loci(QTL)mapping of complex traits during health and disease.The CC lines were generated from reciprocal crosses of eight divergent mouse founder strains composed of five classical and three wild‐derived strains.Complex traits are defined to be controlled by variations within multiple genes and the gene/environment interactions.In this article,we introduce and present variety of protocols and results of studying the host response to infectious and chronic diseases,including type 2 diabetes and metabolic diseases,body composition,immune response,colorectal cancer,susceptibility to Aspergillus fumigatus,Klebsiella pneumoniae,Pseudomonas aeruginosa,sepsis,and mixed infections of Porphyromonas gingivalis and Fusobacterium nucleatum,which were conducted at our laboratory using the CC mouse population.These traits are observed at multiple levels of the body systems,including metabolism,body weight,immune profile,susceptibility or resistance to the development and progress of infectious or chronic diseases.Herein,we present full protocols and step‐by‐step methods,implemented in our laboratory for the phenotypic and genotypic characterization of the different CC lines,mapping the gene underlying the host response to these infections and chronic diseases.The CC mouse model is a unique and powerful GRP for dissecting the host genetic architectures underlying complex traits,including chronic and infectious diseases.展开更多
Phenotypic plasticity is the ability of a given genotype to produce multiple phenotypes in response to changing environmental conditions.Understanding the genetic basis of phenotypic plasticity and establishing a pred...Phenotypic plasticity is the ability of a given genotype to produce multiple phenotypes in response to changing environmental conditions.Understanding the genetic basis of phenotypic plasticity and establishing a predictive model is highly relevant to future agriculture under a changing climate.Here we report findings on the genetic basis of phenotypic plasticity for 23 complex traits using a diverse maize population planted at five sites with distinct environmental conditions.We found that latituderelated environmental factors were the main drivers of across-site variation in flowering time traits but not in plant architecture or yield traits.For the 23 traits,we detected 109 quantitative trait loci(QTLs),29 for mean values,66 for plasticity,and 14 for both parameters,and 80%of the QTLs interacted with latitude.The effects of several QTLs changed in magnitude or sign,driving variation in phenotypic plasticity.We experimentally validated one plastic gene,ZmTPS14.1,whose effect was likely mediated by the compensation effect of ZmSPL6 from a downstream pathway.By integrating genetic diversity,environmental variation,and their interaction into a joint model,we could provide site-specific predictions with increased accuracy by as much as 9.9%,2.2%,and 2.6%for days to tassel,plant height,and ear weight,respectively.This study revealed a complex genetic architecture involving multiple alleles,pleiotropy,and genotype-byenvironment interaction that underlies variation in the mean and plasticity of maize complex traits.It provides novel insights into the dynamic genetic architecture of agronomic traits in response to changing environments,paving a practical way toward precision agriculture.展开更多
A spate of high-powered genome-wide association studies (GWAS) have recently identified numerous single-nucleotide polymorphisms (SNPs) robustly linked with complex disease. Despite interrogating the majority of c...A spate of high-powered genome-wide association studies (GWAS) have recently identified numerous single-nucleotide polymorphisms (SNPs) robustly linked with complex disease. Despite interrogating the majority of common human variation, these SNPs only account for a small proportion of the phenotypic variance, which suggests genetic factors are acting in concert with non-genetic factors. Although environmental measures are logical covariants for genotype-phenotype investigations, another non-genetic intermediary exists: epigenetics. Epigenetics is the analysis of somatically-acquired and, in some cases, transgenerationally inherited epigenetic modifications that regulate gene expression, and offers to bridge the gap between genetics and environment to understand phenotype. The most widely studied epigenetic mark is DNA methylation. Aberrant methylation at gene promoters is strongly implicated in disease etiology, most notably cancer. This review will highlight the importance of DNA methylation as an epigenetic regulator, outline techniques to characterize the DNA methylome and present the idea of reverse phenotyping, where multiple layers of analysis are integrated at the individual level to create personalized digital phenotypes and, at a phenotype level, to identify novel molecular signatures of disease.展开更多
全基因组关联分析(genomewide association study,GWAS)是应用人类基因组中数以百万计的单核苷酸多态性(single nucleotide polymorphism,SNP)为标记进行病例-对照关联分析,以期发现影响复杂性疾病发生的遗传特征的一种新策略。近年来,...全基因组关联分析(genomewide association study,GWAS)是应用人类基因组中数以百万计的单核苷酸多态性(single nucleotide polymorphism,SNP)为标记进行病例-对照关联分析,以期发现影响复杂性疾病发生的遗传特征的一种新策略。近年来,随着人类基因组计划和基因组单倍体图谱计划的实施,人们已通过GWAS方法发现并鉴定了大量与人类性状或复杂性疾病关联的遗传变异,为进一步了解控制人类复杂性疾病发生的遗传特征提供了重要的线索。然而,由于造成复杂性疾病/性状的因素较多,而且GWAS研究系统较为复杂,因此目前GWAS本身亦存在诸多的问题。本文将从研究方式、研究对象、遗传标记,以及统计分析等方面,探讨GWAS的研究现状以及存在的潜在问题,并展望GWAS今后的发展方向。展开更多
利用高密度单核苷酸多态(Single nucleotide polymorphism,SNP)标记在全基因组范围内检测影响复杂疾病/性状的染色体区段或基因,已经成为目前遗传学领域新的突破点之一。在全基因组关联研究(Genome-wide association study,GWAS)取得大...利用高密度单核苷酸多态(Single nucleotide polymorphism,SNP)标记在全基因组范围内检测影响复杂疾病/性状的染色体区段或基因,已经成为目前遗传学领域新的突破点之一。在全基因组关联研究(Genome-wide association study,GWAS)取得大量成果之后,研究者们对在全基因范围内研究交互作用产生了极大的热情。近几年,对交互作用的研究,无论是在方法的研发、实际的应用以及统计学上的交互向生物学上的交互转化,还是在信息组学的整合,都呈现快速发展的趋势。已有很多策略和方法被尝试用于进行全基因组交互作用分析,这些研究推动了对复杂疾病/性状遗传机制的进一步认识。基于目前全基因组交互分析所采用的各类数据处理方法的理论与算法的异同,文章拟对目前使用较为广泛的回归类方法、机器学习方法、贝叶斯模型法、SNP筛选类方法和基于并行程序的方法等5类方法加以评述,着重介绍了这些方法的算法原理、计算效率以及差别之处,以期能够为相关领域的研究者提供参考。展开更多
在过去的5年中,全基因组关联研究(Genome-wide association study,GWAS)方法已被证明是研究复杂疾病和性状遗传易感变异的一种有效手段。目前,各国科学家在多种复杂疾病和性状中开展了大量的GWAS,对肿瘤、糖尿病、心脏病、神经精神疾病...在过去的5年中,全基因组关联研究(Genome-wide association study,GWAS)方法已被证明是研究复杂疾病和性状遗传易感变异的一种有效手段。目前,各国科学家在多种复杂疾病和性状中开展了大量的GWAS,对肿瘤、糖尿病、心脏病、神经精神疾病、自身免疫及免疫相关疾病等复杂疾病以及一些常见性状(如身高、体重、血脂、色素等)的遗传易感基因研究取得了重大成果。截止到2010年9月11日,运用GWAS开展了对近200种复杂疾病/性状的研究,发现了3000多个疾病相关的遗传变异。文章就GWAS的发展及其在复杂疾病/性状中的应用做一综述。展开更多
2005年至今,全基因组关联研究(Genome-wide association study,GWAS)发现了大量复杂疾病/性状相关变异。近来,科学家们关注的焦点又集中在了如何利用GWAS数据进行深入分析,期待发现更多复杂疾病/性状的易感基因。一些新的策略和方法已...2005年至今,全基因组关联研究(Genome-wide association study,GWAS)发现了大量复杂疾病/性状相关变异。近来,科学家们关注的焦点又集中在了如何利用GWAS数据进行深入分析,期待发现更多复杂疾病/性状的易感基因。一些新的策略和方法已经被尝试应用到复杂疾病/性状GWAS的后续研究中,例如深入分析GWAS数据;鉴定新的复杂疾病/性状易感基因/位点;国际合作和Meta分析;易感区域精细定位及测序;多种疾病共同易感基因研究;以及基因型填补,基于通路的关联分析,基因-基因、基因-环境交互作用和上位研究等。这些策略和方法的应用弥补了经典GWAS的一些不足之处,进一步推动了人类对复杂疾病/性状遗传机制的认识。文章对上述研究的策略、方法以及所面临的问题和挑战进行了综述,为读者描绘了GWAS后期工作的一个简要框架。展开更多
过去20多年复杂疾病易感基因鉴定的主要方法是连锁分析和关联研究。因为连锁分析确定的数量性状位点通常很宽,加之对区域内大部分基因的功能以及基因功能和疾病之间联系的认识十分有限,所以从数量性状位点到基因的识别是一个挑战。近年...过去20多年复杂疾病易感基因鉴定的主要方法是连锁分析和关联研究。因为连锁分析确定的数量性状位点通常很宽,加之对区域内大部分基因的功能以及基因功能和疾病之间联系的认识十分有限,所以从数量性状位点到基因的识别是一个挑战。近年来发展了一些利用公共数据库的信息预测疾病易感基因的计算生物学方法。文章简要介绍了DGP、GeneSeeker、Prioritizer、PROSPECTR and SUSPECTS及Endeavor5种计算生物学方法的基本原理,以2型糖尿病/肥胖和骨质疏松症易感基因的预测为例说明它们的应用方法,并讨论了这些方法的局限及应用前景。展开更多
复杂疾病目前正在全球范围流行,极大地影响人类的健康。研究发现,复杂疾病的性状受到多个位点的相互作用影响。目前的全基因组关联分析(Genome-wide association study,GWAS)仅仅解析单个SNP位点对疾病易感性的贡献,单纯依靠这一种策略...复杂疾病目前正在全球范围流行,极大地影响人类的健康。研究发现,复杂疾病的性状受到多个位点的相互作用影响。目前的全基因组关联分析(Genome-wide association study,GWAS)仅仅解析单个SNP位点对疾病易感性的贡献,单纯依靠这一种策略并不能在寻找复杂疾病的病因上得到根本性的突破。基因-基因相互作用可能是复杂疾病致病的主要因素之一。针对这一点,科学家已经提出了一些检验基因相互作用的算法,包括惩罚logistic回归模型、多因子降维(Multifactor dimensional reduction)、集合关联法(Set-association approach)、贝叶斯网络(Bayesian networks)、随机森林法等。文章首先对目前这些方法做了综述,并指出了其中的不足,包括计算复杂度太高、假设驱动、数据会过度拟合、对低维数据不敏感等,进而简述了一种由笔者所在实验室开发的基于GPU的研究基因相互作用的算法,该算法复杂度低,不需要任何假设,没有边际效应,有很好的稳定性,速度快,适用于进行全基因组范围内的基因-基因相互作用计算。展开更多
Altitude acclimatization is a human physiological process of adjusting to the decreased oxygen availability.Since several physiological processes are involved and their correlations are complicated,the analyses of sin...Altitude acclimatization is a human physiological process of adjusting to the decreased oxygen availability.Since several physiological processes are involved and their correlations are complicated,the analyses of single traits are insufficient in revealing the complex mechanism of high-altitude acclimatization.In this study,we examined these physiological responses as the composite phenotypes that are represented by a linear combination of physiological traits.We developed a strategy that combines both spectral clustering and partial least squares path modeling(PLSPM)to define composite phenotypes based on a cohort study of 883 Chinese Han males.In addition,we captured 14 composite phenotypes from 28 physiological traits of high-altitude acclimatization.Using these composite phenotypes,we applied k-means clustering to reveal hidden population physiological heterogeneity in high-altitude acclimatization.Furthermore,we employed multivariate linear regression to systematically model(Models 1 and 2)oxygen saturation(SpO_(2))changes in high-altitude acclimatization and evaluated model fitness performance.Composite phenotypes based on Model 2 fit better than single trait-based Model 1 in all measurement indices.This new strategy of using composite phenotypes may be potentially employed as a general strategy for complex traits research such as genetic loci discovery and analyses of phenomics.展开更多
TSC renal cystic disease is poorly understood and has no approved treatment.In a new principal cell-targeted murine model of Tsc cystic disease,the renal cystic epithelium is mostly composed of type A intercalated cel...TSC renal cystic disease is poorly understood and has no approved treatment.In a new principal cell-targeted murine model of Tsc cystic disease,the renal cystic epithelium is mostly composed of type A intercalated cells with an intact Tsc2 gene confirmed by sequencing,although these cells exhibit a Tsc-mutant disease phenotype.We used a newly derived targeted murine model in lineage tracing and extracellular vesicle(EV)characterization experiments and a cell culture model in EV characterization and cellular induction experiments to understand TSC cystogenesis.Using lineage tracing experiments,we found principal cells undergo clonal expansion but contribute very few cells to the cyst.We determined that cystic kidneys contain more interstitial EVs than noncystic kidneys,excrete fewer EVs in urine,and contain EVs in cyst fluid.Moreover,the loss of Tsc2 gene in EV-producing cells greatly changes the effect of EVs on renal tubular epithelium,such that the epithelium develops increased secretory and proliferative pathway activity.We demonstate that the mTORC1 pathway activity is independent form the EV production,and that the EV effects for a single cell line can vary significantly.TSC cystogenesis involves significant contribution from genetically intact cells conscripted to the mutant phenotype by mutant cell derived EVs.展开更多
基金supported by the National Natural Science Foundation of China(32370723,32000410)。
文摘Despite considerable advances in extracting crucial insights from bio-omics data to unravel the intricate mechanisms underlying complex traits,the absence of a universal multi-modal computational tool with robust interpretability for accurate phenotype prediction and identification of trait-associated genes remains a challenge.This study introduces the dual-extraction modeling(DEM)approach,a multi-modal deep-learning architecture designed to extract representative features from heterogeneous omics datasets,enabling the prediction of complex trait phenotypes.Through comprehensive benchmarking experiments,we demonstrate the efficacy of DEM in classification and regression prediction of complex traits.DEM consistently exhibits superior accuracy,robustness,generalizability,and flexibility.Notably,we establish its effectiveness in predicting pleiotropic genes that influence both flowering time and rosette leaf number,underscoring its commendable interpretability.In addition,we have developed user-friendly software to facilitate seamless utilization of DEM’s functions.In summary,this study presents a state-of-the-art approach with the ability to effectively predict qualitative and quantitative traits and identify functional genes,confirming its potential as a valuable tool for exploring the genetic basis of complex traits.
基金Hendrech and Eiran Gotwert FundWellcome, Grant/Award Number: 085906/Z/08/Z, 075491/Z/04 and 090532/Z/09/Z+6 种基金Tel-Aviv UniversityIsraeli Science foundation, Grant/Award Number: 429/09, 961/15 and 1085/18Binational Science Foundation, Grant/Award Number: 2015077German Israeli Science Foundation, Grant/Award Number: I-63-410.20-2017Israeli Cancer Research FundCancer Research Counsel-UK Cancer Biology Research Center
文摘The Collaborative Cross(CC)mouse model is a next‐generation mouse genetic reference population(GRP)designated for a high‐resolution quantitative trait loci(QTL)mapping of complex traits during health and disease.The CC lines were generated from reciprocal crosses of eight divergent mouse founder strains composed of five classical and three wild‐derived strains.Complex traits are defined to be controlled by variations within multiple genes and the gene/environment interactions.In this article,we introduce and present variety of protocols and results of studying the host response to infectious and chronic diseases,including type 2 diabetes and metabolic diseases,body composition,immune response,colorectal cancer,susceptibility to Aspergillus fumigatus,Klebsiella pneumoniae,Pseudomonas aeruginosa,sepsis,and mixed infections of Porphyromonas gingivalis and Fusobacterium nucleatum,which were conducted at our laboratory using the CC mouse population.These traits are observed at multiple levels of the body systems,including metabolism,body weight,immune profile,susceptibility or resistance to the development and progress of infectious or chronic diseases.Herein,we present full protocols and step‐by‐step methods,implemented in our laboratory for the phenotypic and genotypic characterization of the different CC lines,mapping the gene underlying the host response to these infections and chronic diseases.The CC mouse model is a unique and powerful GRP for dissecting the host genetic architectures underlying complex traits,including chronic and infectious diseases.
基金funded by the Natural Science Foundation of China(31961133002,31901553,and 31771879)the National Key Research and Development Program of China(2020YFE0202300)+3 种基金the Science and Technology Major Program of Hubei Province(2021ABA011)the Swedish Research Council for Environment,Agricultural Sciences,and Spatial Planning(2019-01600)the Key Science and Technology Project of the China National Tobacco Corporation(110202101040 JY-17)the Jilin Scientific and Technological Development Program(20190201290JC).
文摘Phenotypic plasticity is the ability of a given genotype to produce multiple phenotypes in response to changing environmental conditions.Understanding the genetic basis of phenotypic plasticity and establishing a predictive model is highly relevant to future agriculture under a changing climate.Here we report findings on the genetic basis of phenotypic plasticity for 23 complex traits using a diverse maize population planted at five sites with distinct environmental conditions.We found that latituderelated environmental factors were the main drivers of across-site variation in flowering time traits but not in plant architecture or yield traits.For the 23 traits,we detected 109 quantitative trait loci(QTLs),29 for mean values,66 for plasticity,and 14 for both parameters,and 80%of the QTLs interacted with latitude.The effects of several QTLs changed in magnitude or sign,driving variation in phenotypic plasticity.We experimentally validated one plastic gene,ZmTPS14.1,whose effect was likely mediated by the compensation effect of ZmSPL6 from a downstream pathway.By integrating genetic diversity,environmental variation,and their interaction into a joint model,we could provide site-specific predictions with increased accuracy by as much as 9.9%,2.2%,and 2.6%for days to tassel,plant height,and ear weight,respectively.This study revealed a complex genetic architecture involving multiple alleles,pleiotropy,and genotype-byenvironment interaction that underlies variation in the mean and plasticity of maize complex traits.It provides novel insights into the dynamic genetic architecture of agronomic traits in response to changing environments,paving a practical way toward precision agriculture.
文摘A spate of high-powered genome-wide association studies (GWAS) have recently identified numerous single-nucleotide polymorphisms (SNPs) robustly linked with complex disease. Despite interrogating the majority of common human variation, these SNPs only account for a small proportion of the phenotypic variance, which suggests genetic factors are acting in concert with non-genetic factors. Although environmental measures are logical covariants for genotype-phenotype investigations, another non-genetic intermediary exists: epigenetics. Epigenetics is the analysis of somatically-acquired and, in some cases, transgenerationally inherited epigenetic modifications that regulate gene expression, and offers to bridge the gap between genetics and environment to understand phenotype. The most widely studied epigenetic mark is DNA methylation. Aberrant methylation at gene promoters is strongly implicated in disease etiology, most notably cancer. This review will highlight the importance of DNA methylation as an epigenetic regulator, outline techniques to characterize the DNA methylome and present the idea of reverse phenotyping, where multiple layers of analysis are integrated at the individual level to create personalized digital phenotypes and, at a phenotype level, to identify novel molecular signatures of disease.
文摘全基因组关联分析(genomewide association study,GWAS)是应用人类基因组中数以百万计的单核苷酸多态性(single nucleotide polymorphism,SNP)为标记进行病例-对照关联分析,以期发现影响复杂性疾病发生的遗传特征的一种新策略。近年来,随着人类基因组计划和基因组单倍体图谱计划的实施,人们已通过GWAS方法发现并鉴定了大量与人类性状或复杂性疾病关联的遗传变异,为进一步了解控制人类复杂性疾病发生的遗传特征提供了重要的线索。然而,由于造成复杂性疾病/性状的因素较多,而且GWAS研究系统较为复杂,因此目前GWAS本身亦存在诸多的问题。本文将从研究方式、研究对象、遗传标记,以及统计分析等方面,探讨GWAS的研究现状以及存在的潜在问题,并展望GWAS今后的发展方向。
文摘利用高密度单核苷酸多态(Single nucleotide polymorphism,SNP)标记在全基因组范围内检测影响复杂疾病/性状的染色体区段或基因,已经成为目前遗传学领域新的突破点之一。在全基因组关联研究(Genome-wide association study,GWAS)取得大量成果之后,研究者们对在全基因范围内研究交互作用产生了极大的热情。近几年,对交互作用的研究,无论是在方法的研发、实际的应用以及统计学上的交互向生物学上的交互转化,还是在信息组学的整合,都呈现快速发展的趋势。已有很多策略和方法被尝试用于进行全基因组交互作用分析,这些研究推动了对复杂疾病/性状遗传机制的进一步认识。基于目前全基因组交互分析所采用的各类数据处理方法的理论与算法的异同,文章拟对目前使用较为广泛的回归类方法、机器学习方法、贝叶斯模型法、SNP筛选类方法和基于并行程序的方法等5类方法加以评述,着重介绍了这些方法的算法原理、计算效率以及差别之处,以期能够为相关领域的研究者提供参考。
文摘在过去的5年中,全基因组关联研究(Genome-wide association study,GWAS)方法已被证明是研究复杂疾病和性状遗传易感变异的一种有效手段。目前,各国科学家在多种复杂疾病和性状中开展了大量的GWAS,对肿瘤、糖尿病、心脏病、神经精神疾病、自身免疫及免疫相关疾病等复杂疾病以及一些常见性状(如身高、体重、血脂、色素等)的遗传易感基因研究取得了重大成果。截止到2010年9月11日,运用GWAS开展了对近200种复杂疾病/性状的研究,发现了3000多个疾病相关的遗传变异。文章就GWAS的发展及其在复杂疾病/性状中的应用做一综述。
文摘2005年至今,全基因组关联研究(Genome-wide association study,GWAS)发现了大量复杂疾病/性状相关变异。近来,科学家们关注的焦点又集中在了如何利用GWAS数据进行深入分析,期待发现更多复杂疾病/性状的易感基因。一些新的策略和方法已经被尝试应用到复杂疾病/性状GWAS的后续研究中,例如深入分析GWAS数据;鉴定新的复杂疾病/性状易感基因/位点;国际合作和Meta分析;易感区域精细定位及测序;多种疾病共同易感基因研究;以及基因型填补,基于通路的关联分析,基因-基因、基因-环境交互作用和上位研究等。这些策略和方法的应用弥补了经典GWAS的一些不足之处,进一步推动了人类对复杂疾病/性状遗传机制的认识。文章对上述研究的策略、方法以及所面临的问题和挑战进行了综述,为读者描绘了GWAS后期工作的一个简要框架。
文摘过去20多年复杂疾病易感基因鉴定的主要方法是连锁分析和关联研究。因为连锁分析确定的数量性状位点通常很宽,加之对区域内大部分基因的功能以及基因功能和疾病之间联系的认识十分有限,所以从数量性状位点到基因的识别是一个挑战。近年来发展了一些利用公共数据库的信息预测疾病易感基因的计算生物学方法。文章简要介绍了DGP、GeneSeeker、Prioritizer、PROSPECTR and SUSPECTS及Endeavor5种计算生物学方法的基本原理,以2型糖尿病/肥胖和骨质疏松症易感基因的预测为例说明它们的应用方法,并讨论了这些方法的局限及应用前景。
基金supported by Shanghai Municipal Science and Technology Major Project(2017SHZDZX01)National Science Foundation of China(31330038)+5 种基金CAMS Innovation Fund for Medical Sciences(2019-I2M-5-066)Science and Technology Committee of Shanghai Municipality(16JC1400500)Ministry of Science and Technology(2015FY1117000)the 111 Project(B13016)Major Project of Special Development Funds of Zhangjiang National Independent Innovation Demonstration Zone(ZJ2019-ZD-004)supported by the Postdoctoral Science Foundation of China(2018M640333).
文摘Altitude acclimatization is a human physiological process of adjusting to the decreased oxygen availability.Since several physiological processes are involved and their correlations are complicated,the analyses of single traits are insufficient in revealing the complex mechanism of high-altitude acclimatization.In this study,we examined these physiological responses as the composite phenotypes that are represented by a linear combination of physiological traits.We developed a strategy that combines both spectral clustering and partial least squares path modeling(PLSPM)to define composite phenotypes based on a cohort study of 883 Chinese Han males.In addition,we captured 14 composite phenotypes from 28 physiological traits of high-altitude acclimatization.Using these composite phenotypes,we applied k-means clustering to reveal hidden population physiological heterogeneity in high-altitude acclimatization.Furthermore,we employed multivariate linear regression to systematically model(Models 1 and 2)oxygen saturation(SpO_(2))changes in high-altitude acclimatization and evaluated model fitness performance.Composite phenotypes based on Model 2 fit better than single trait-based Model 1 in all measurement indices.This new strategy of using composite phenotypes may be potentially employed as a general strategy for complex traits research such as genetic loci discovery and analyses of phenomics.
基金This work was supported by DoD(No.W81XWH-14-1-0343)(JJB)Federal Express Chair of Excellence(JJB),and Children’s Foundation Research Institute(JJB).
文摘TSC renal cystic disease is poorly understood and has no approved treatment.In a new principal cell-targeted murine model of Tsc cystic disease,the renal cystic epithelium is mostly composed of type A intercalated cells with an intact Tsc2 gene confirmed by sequencing,although these cells exhibit a Tsc-mutant disease phenotype.We used a newly derived targeted murine model in lineage tracing and extracellular vesicle(EV)characterization experiments and a cell culture model in EV characterization and cellular induction experiments to understand TSC cystogenesis.Using lineage tracing experiments,we found principal cells undergo clonal expansion but contribute very few cells to the cyst.We determined that cystic kidneys contain more interstitial EVs than noncystic kidneys,excrete fewer EVs in urine,and contain EVs in cyst fluid.Moreover,the loss of Tsc2 gene in EV-producing cells greatly changes the effect of EVs on renal tubular epithelium,such that the epithelium develops increased secretory and proliferative pathway activity.We demonstate that the mTORC1 pathway activity is independent form the EV production,and that the EV effects for a single cell line can vary significantly.TSC cystogenesis involves significant contribution from genetically intact cells conscripted to the mutant phenotype by mutant cell derived EVs.