To unravel the genetic mechanisms of disease and physiological traits,it requires comprehensive sequencing analysis of large sample size in Chinese populations.Here,we report the primary results of the Chinese Academy...To unravel the genetic mechanisms of disease and physiological traits,it requires comprehensive sequencing analysis of large sample size in Chinese populations.Here,we report the primary results of the Chinese Academy of Sciences Precision Medicine Initiative(CASPMI)project launched by the Chinese Academy of Sciences,including the de novo assembly of a northern Han reference genome(NH1.0)and whole genome analyses of 597 healthy people coming from most areas in China.Given the two existing reference genomes for Han Chinese(YH and HX1)were both from the south,we constructed NH1.0,a new reference genome from a northern individual,by combining the sequencing strategies of PacBio,10×Genomics,and Bionano mapping.Using this integrated approach,we obtained an N50 scaffold size of 46.63 Mb for the NH1.0 genome and performed a comparative genome analysis of NH1.0 with YH and HX1.In order to generate a genomic variation map of Chinese populations,we performed the whole-genome sequencing of 597 participants and identified 24.85 million(M)single nucleotide variants(SNVs),3.85 M small indels,and 106,382 structural variations.In the association analysis with collected phenotypes,we found that the T allele of rs1549293 in KAT8 significantly correlated with the waist circumference in northern Han males.Moreover,significant genetic diversity in MTHFR,TCN2,FADS1,and FADS2,which associate with circulating folate,vitamin B12,or lipid metabolism,was observed between northerners and southerners.Especially,for the homocysteine-increasing allele of rs1801133(MTHFR 677T),we hypothesize that there exists a “comfort”zone for a high frequency of 677T between latitudes of 35–45 degree North.Taken together,our results provide a high-quality northern Han reference genome and novel population-specific data sets of genetic variants for use in the personalized and precision medicine.展开更多
Postzygotic mutations are acquired in normal tissues throughout an individual’s lifetime and hold clues for identifying mutagenic factors.Here,we investigated postzygotic mutation spectra of healthy individuals using...Postzygotic mutations are acquired in normal tissues throughout an individual’s lifetime and hold clues for identifying mutagenic factors.Here,we investigated postzygotic mutation spectra of healthy individuals using optimized ultra-deep exome sequencing of the time-series samples from the same volunteer as well as the samples from different individuals.In blood,sperm,and muscle cells,we resolved three common types of mutational signatures.Signatures A and B represent clocklike mutational processes,and the polymorphisms of epigenetic regulation genes influence the proportion of signature B in mutation profiles.Notably,signature C,characterized by C>T transitions at GpCpN sites,tends to be a feature of diverse normal tissues.Mutations of this type are likely to occur early during embryonic development,supported by their relatively high allelic frequencies,presence in multiple tissues,and decrease in occurrence with age.Almost none of the public datasets for tumors feature this signature,except for 19.6%of samples of clear cell renal cell carcinoma with increased activation of the hypoxia-inducible factor 1(HIF-1)signaling pathway.Moreover,the accumulation of signature C in the mutation profile was accelerated in a human embryonic stem cell line with drug-induced activation of HIF-1α.Thus,embryonic hypoxia may explain this novel signature across multiple normal tissues.Our study suggests that hypoxic condition in an early stage of embryonic development is a crucial factor inducing C>T transitions at GpCpN sites;and individuals’genetic background may also influence their postzygotic mutation profiles.展开更多
基金supported by the grants of Key Program of the Chinese Academy of Sciences(Grant No.KJZD-EW-L14 awarded to CZ)the National Key R&D Program of China from the Ministry of Science and Technology of China(Grant No.2016YFB0201702 awarded to JX,as well as Grant Nos.2016YFC0901701 and 2018YFC0910700 awarded to XF)
文摘To unravel the genetic mechanisms of disease and physiological traits,it requires comprehensive sequencing analysis of large sample size in Chinese populations.Here,we report the primary results of the Chinese Academy of Sciences Precision Medicine Initiative(CASPMI)project launched by the Chinese Academy of Sciences,including the de novo assembly of a northern Han reference genome(NH1.0)and whole genome analyses of 597 healthy people coming from most areas in China.Given the two existing reference genomes for Han Chinese(YH and HX1)were both from the south,we constructed NH1.0,a new reference genome from a northern individual,by combining the sequencing strategies of PacBio,10×Genomics,and Bionano mapping.Using this integrated approach,we obtained an N50 scaffold size of 46.63 Mb for the NH1.0 genome and performed a comparative genome analysis of NH1.0 with YH and HX1.In order to generate a genomic variation map of Chinese populations,we performed the whole-genome sequencing of 597 participants and identified 24.85 million(M)single nucleotide variants(SNVs),3.85 M small indels,and 106,382 structural variations.In the association analysis with collected phenotypes,we found that the T allele of rs1549293 in KAT8 significantly correlated with the waist circumference in northern Han males.Moreover,significant genetic diversity in MTHFR,TCN2,FADS1,and FADS2,which associate with circulating folate,vitamin B12,or lipid metabolism,was observed between northerners and southerners.Especially,for the homocysteine-increasing allele of rs1801133(MTHFR 677T),we hypothesize that there exists a “comfort”zone for a high frequency of 677T between latitudes of 35–45 degree North.Taken together,our results provide a high-quality northern Han reference genome and novel population-specific data sets of genetic variants for use in the personalized and precision medicine.
基金supported by the grants from the Strategic Priority Research Program of Chinese Academy of Sciences(Grant No.XDB13020500)the National Natural Science Foundation of China(NSFC)(Grant Nos.91131905,31471199,and 91631304)+3 种基金the Key Research Program of Chinese Academy of Sciences(Grant No.KJZD-EW-L14 to CZ)the NSFC(Grant Nos.31440057 and 31701081 to WC)the 111 Project(Grant No.B13003 to WC and DZ)the Innovation Promotion Association of Chinese Academy of Sciences(Grant Nos.2016098 to DZ and 2019103 to AC)。
文摘Postzygotic mutations are acquired in normal tissues throughout an individual’s lifetime and hold clues for identifying mutagenic factors.Here,we investigated postzygotic mutation spectra of healthy individuals using optimized ultra-deep exome sequencing of the time-series samples from the same volunteer as well as the samples from different individuals.In blood,sperm,and muscle cells,we resolved three common types of mutational signatures.Signatures A and B represent clocklike mutational processes,and the polymorphisms of epigenetic regulation genes influence the proportion of signature B in mutation profiles.Notably,signature C,characterized by C>T transitions at GpCpN sites,tends to be a feature of diverse normal tissues.Mutations of this type are likely to occur early during embryonic development,supported by their relatively high allelic frequencies,presence in multiple tissues,and decrease in occurrence with age.Almost none of the public datasets for tumors feature this signature,except for 19.6%of samples of clear cell renal cell carcinoma with increased activation of the hypoxia-inducible factor 1(HIF-1)signaling pathway.Moreover,the accumulation of signature C in the mutation profile was accelerated in a human embryonic stem cell line with drug-induced activation of HIF-1α.Thus,embryonic hypoxia may explain this novel signature across multiple normal tissues.Our study suggests that hypoxic condition in an early stage of embryonic development is a crucial factor inducing C>T transitions at GpCpN sites;and individuals’genetic background may also influence their postzygotic mutation profiles.