In a study published in Nature on June 10, researchers from Dr. YANG Hui’s Lab at the CAS Institute of Neuroscience (ION), and collaborators from the CAS-MPG Partner Institute for Computational Biology (PICB) and Sic...In a study published in Nature on June 10, researchers from Dr. YANG Hui’s Lab at the CAS Institute of Neuroscience (ION), and collaborators from the CAS-MPG Partner Institute for Computational Biology (PICB) and Sichuan University demonstrated that DNA base editors generated tens of thousands of off-target RNA single nucleotide variants (SNVs) and these off-target SNVs could be eliminated by introducing point mutations to the deaminases, built-in enzymes that act to rewrite the DNA bases.展开更多
为了评判烟用接装纸内在质量稳定性,应用近红外光纤漫反射技术扫描烟用接装纸,对所得光谱进行标准正则变换(Standard Normal Variation,SNV)和一阶微分处理后,采用主成分分析(PrincipalComponent Analysis,PCA)法进行特征抽提,根据主成...为了评判烟用接装纸内在质量稳定性,应用近红外光纤漫反射技术扫描烟用接装纸,对所得光谱进行标准正则变换(Standard Normal Variation,SNV)和一阶微分处理后,采用主成分分析(PrincipalComponent Analysis,PCA)法进行特征抽提,根据主成分空间下的马氏距离建立校正集,对不同厂家烟用接装纸进行模式识别,并建立评价模型对相同厂家接装纸内在质量稳定性进行评判。结果表明,建立的校正集模型可有效识别不同厂家的烟用接装纸,评价模型对相同厂家接装纸内在质量稳定性的判别完全准确。展开更多
Single-nucleotide variants(SNVs)are crucial in disease development,but their accurate detection is challenging due to their low abundance and interference from wild-type targets.Although nucleic acid analogs like pept...Single-nucleotide variants(SNVs)are crucial in disease development,but their accurate detection is challenging due to their low abundance and interference from wild-type targets.Although nucleic acid analogs like peptide nucleic acids(PNAs)have been used for SNV detection,they often lack programmable sensitivity and specificity due to poorly calculated thermodynamics and kinetics.Here,we present a computational method for calculating the stacking energy of PNA and DNA hybrids,leveraging nearest neighbor parameters.Validation against experimental data from 16 sequences under varied hybridization conditions yielded good agreement using Bland-Altman analysis,with all data points falling within the confidence interval.Our findings indicate that PNA-DNA hybridization is thermodynamically more stable and exhibits kinetics 140-fold faster than DNA-DNA hybridization for identical sequences.Utilizing this computational framework,we designed PNA toehold probes,which were screened via simulations and experiments.This combined approach facilitated the identification of highly sensitive and specific PNA toehold probes for single point mutation detection via strand displacement reaction.Our results demonstrate the successful application of PNA toehold probes for detecting point mutations with high sensitivity and specificity,achieving a selective amplification of approximately 200-fold for variants with a variant allele frequency(VAF)of 0.5%using quantitative polymerase chain reaction.展开更多
Autism spectrum disorder(ASD)is a neurodevelopmental disorder with high genetic heritability but heterogeneity.Fully understanding its genetics requires whole-genome sequencing(WGS),but the ASD studies utilizing WGS d...Autism spectrum disorder(ASD)is a neurodevelopmental disorder with high genetic heritability but heterogeneity.Fully understanding its genetics requires whole-genome sequencing(WGS),but the ASD studies utilizing WGS data in Chinese population are limited.In this study,we present a WGS study for 334 individuals,including 112 ASD patients and their non-ASD parents.We identified 146 de novo variants in coding regions in 85 cases and 60 inherited variants in coding regions.By integrating these variants with an association model,we identified 33 potential risk genes(P<0.001)enriched in neuron and regulation related biological process.Besides the well-known ASD genes(SCN2A,NF1,SHANK3,CHD8 etc.),several high confidence genes were highlighted by a series of functional analyses,including CTNND1,DGKZ,LRP1,DDN,ZNF483,NR4A2,SMAD6,INTS1,and MRPL12,with more supported evidence from GO enrichment,expression and network analysis.We also integrated RNA-seq data to analyze the effect of the variants on the gene expression and found 12 genes in the individuals with the related variants had relatively biased expression.We further presented the clinical phenotypes of the proband carrying the risk genes in both our samples and Caucasian samples to show the effect of the risk genes on phenotype.Regarding variants in noncoding regions,a total of 74 de novo variants and 30 inherited variants were predicted as pathogenic with high confidence,which were mapped to specific genes or regulatory features.The number of de novo variants found in patient was significantly associated with the parents’ages at the birth of the child,and gender with trend.We also identified small de novo structural variants in ASD trios.The results in this study provided important evidence for understanding the genetic mechanism of ASD.展开更多
Gastric cancer(GC) is a highly heterogeneous disease with multiple cellular types and poor prognosis.However, the cellular evolution and molecular basis of GC at the individual intra-tumor level has not been well demo...Gastric cancer(GC) is a highly heterogeneous disease with multiple cellular types and poor prognosis.However, the cellular evolution and molecular basis of GC at the individual intra-tumor level has not been well demonstrated. We performed single-cell whole exome sequencing to detect somatic singlenucleotide variants(SNVs) and significantly mutated genes(SMGs) among 34 tumor cells and 9 normal cells from a patient with GC. The Complete Prediction for Protein Conformation(CPPC) approach directly predicting the folding conformation of the protein 3D structure with Protein Folding Shape Code, combined with functional experiments were used to confirm the characterization of mutated SMGs in GC cells. We identified 201 somatic SNVs, including 117 non-synonymous mutations in GC cells. Further analysis identified 24 significant mutated genes(SMGs) in single cells, for which a single amino acid change might affect protein conformation. Among them, two genes(CDC27 and FLG) that were mutated only in single cells but not in the corresponding tumor tissue, were recurrently present in another GC tissue cohort, and may play a potential role to promote carcinogenesis, as confirmed by functional characterization. Our findings showed a mutational landscape of GC at intra-tumor level for the first time and provided opportunities for understanding the heterogeneity and individualized target therapy for this disease.展开更多
The human digestive tract is home to trillions of microbes,and owing to the advent of next-generation sequencing at the turn of the century and the development of bioinformatics technology,researchers have been able t...The human digestive tract is home to trillions of microbes,and owing to the advent of next-generation sequencing at the turn of the century and the development of bioinformatics technology,researchers have been able to unravel the picture of the gut microbiome(1).展开更多
RNA base editing is a promising tool in precise molecular therapy. Currently, there are two widely used RNA base editors, REPAIRand RESCUE. REPAIR only facilitates A-to-I conversions, while RESCUE performs both A-to-I...RNA base editing is a promising tool in precise molecular therapy. Currently, there are two widely used RNA base editors, REPAIRand RESCUE. REPAIR only facilitates A-to-I conversions, while RESCUE performs both A-to-I and C-to-U conversions. Thus, RESCUEcan generate twice the number of mutations compared to REPAIR. However, transcription-wide impact due to RESCUE-induced offtarget single-nucleotide variants (SNVs) is not fully appreciated. Therefore, to determine the off-target effects of RESCUE-mediatedediting, we employed transcription-wide sequencing on cells edited by RESCUE. The SNVs showed different off-target effects onmRNA, circRNA, lncRNA, and miRNA expression patterns and their interacting networks. Our results illustrate the transcriptionwide impact of RESCUE-induced off-target SNVs and highlight the need for careful characterization of the off-target impact by thisediting platform.展开更多
Among multiple sclerosis(MS)susceptibility genes,the strongest non-human leukocyte antigen(HLA)signal in the Italian population maps to the TNFSF14 gene encoding LIGHT,a glycoprotein involved in dendritic cell(DC)matu...Among multiple sclerosis(MS)susceptibility genes,the strongest non-human leukocyte antigen(HLA)signal in the Italian population maps to the TNFSF14 gene encoding LIGHT,a glycoprotein involved in dendritic cell(DC)maturation.Through fine-mapping in a large Italian dataset(4,198 patients with MS and3,903 controls),we show that the TNFSF14 intronic SNP rs1077667 is the primarily MS-associated variant in the region.Expression quantitative trait locus(e QTL)analysis indicates that the MS risk allele is significantly associated with reduced TNFSF14 messenger RNA levels in blood cells,which is consistent with the allelic imbalance in RNA-Seq reads(P<0.0001).The MS risk allele is associated with reduced levels of TNFSF14 gene expression(P<0.01)in blood cells from 84 Italian patients with MS and 80 healthy controls(HCs).Interestingly,patients with MS are lower expressors of TNFSF14 compared to HC(P<0.007).Individuals homozygous for the MS risk allele display an increased percentage of LIGHT-positive peripheral blood myeloid DCs(CD11 c+,P=0.035)in 37 HCs,as well as in in vitro monocyte-derived DCs from 22 HCs(P=0.04).Our findings suggest that the intronic variant rs1077667 alters the expression of TNFSF14 in immune cells,which may play a role in MS pathogenesis.展开更多
BACKGROUND: Cancer immunotherapy uses one's own immune system to fight cancerous cells. As immune system is hard- wired to distinguish self and non-self, cancer immunotherapy is predicted to target cancerous cells s...BACKGROUND: Cancer immunotherapy uses one's own immune system to fight cancerous cells. As immune system is hard- wired to distinguish self and non-self, cancer immunotherapy is predicted to target cancerous cells specifically, therefore is less toxic than chemotherapy and radiation therapy, two major treatments for cancer. Cancer immunologists have spent decades to search for the specific targets in cancerous cells. METHODS: Due to the recent advances in high throughput sequencing and bioinformatics, evidence has merged that the neoantigens in cancerous cells are probably the cancer-specific targets that lead to the destruction of cancer. We will review the transplantable murine tumor models for cancer immunotherapy and the bioinformatics tools used to navigate mouse genome to identify tumor-rejecting neoantigens. RESULTS: Several groups have independently identified point mutations that can be recognized by T cells of host immune system. It is consistent with the note that the formation ofpeptide-MHC I-TCR complex is critical to activate T cells. Both anchor residue and TCR-facing residue mutations have been reported. While TCR-facing residue mutations may directly activate specific T cells, anchor residue mutations improve the binding of peptides to MHC I molecules, which increases the presentation of peptides and the T cell activation indirectly. CONCLUSIONS: Our work indicates that the affinity of neoepitopes for MHC I is not a predictor for anti-tumor immune responses in mice. Instead differential agretopic index (DAI), the numerical difference of epitope-MHC I affinities between the mutated and un-mutated sequences is a significant predictor. A similar bioinformatics pipeline has been developed to generate personalized vaccines to treat human ovarian cancer in a Phase I clinical trial.展开更多
Allele specific expression is essential for cellular programming and development and the diversity of cellular phenotypes. Traditional analysis methods utilize RNA and depend on single nucleotide polymorphisms,thus to...Allele specific expression is essential for cellular programming and development and the diversity of cellular phenotypes. Traditional analysis methods utilize RNA and depend on single nucleotide polymorphisms,thus to suffer from limited amount of materials for analysis. The rapid development of next-generation sequencing technologies provides more comprehensive and powerful approaches to analyze the genomic, epigenetic, and transcriptomic data, and further to detect and measure allele specific expressions. It will potentially enhance the understanding of the allele specific expressions, their complexities, and the effect on biological processes. In this paper, we extensively review the state-of-art enabling technologies and tools to analyze, detect, and measure allele specific expressions, compare their features, and point out the future trend of the methods.展开更多
文摘In a study published in Nature on June 10, researchers from Dr. YANG Hui’s Lab at the CAS Institute of Neuroscience (ION), and collaborators from the CAS-MPG Partner Institute for Computational Biology (PICB) and Sichuan University demonstrated that DNA base editors generated tens of thousands of off-target RNA single nucleotide variants (SNVs) and these off-target SNVs could be eliminated by introducing point mutations to the deaminases, built-in enzymes that act to rewrite the DNA bases.
文摘为了评判烟用接装纸内在质量稳定性,应用近红外光纤漫反射技术扫描烟用接装纸,对所得光谱进行标准正则变换(Standard Normal Variation,SNV)和一阶微分处理后,采用主成分分析(PrincipalComponent Analysis,PCA)法进行特征抽提,根据主成分空间下的马氏距离建立校正集,对不同厂家烟用接装纸进行模式识别,并建立评价模型对相同厂家接装纸内在质量稳定性进行评判。结果表明,建立的校正集模型可有效识别不同厂家的烟用接装纸,评价模型对相同厂家接装纸内在质量稳定性的判别完全准确。
基金support from the National Key R&D Program of China(2021YFF1200300)the National Natural Science Foundation of China(Nos.22174094,22274097)+1 种基金the Fundamental Research Funds for the Central Universities(YG2023QNA33)Young Leading Scientists Cultivation Plan supportedby ShanghaiMunicipal Education Commission(ZXWH1082101).
文摘Single-nucleotide variants(SNVs)are crucial in disease development,but their accurate detection is challenging due to their low abundance and interference from wild-type targets.Although nucleic acid analogs like peptide nucleic acids(PNAs)have been used for SNV detection,they often lack programmable sensitivity and specificity due to poorly calculated thermodynamics and kinetics.Here,we present a computational method for calculating the stacking energy of PNA and DNA hybrids,leveraging nearest neighbor parameters.Validation against experimental data from 16 sequences under varied hybridization conditions yielded good agreement using Bland-Altman analysis,with all data points falling within the confidence interval.Our findings indicate that PNA-DNA hybridization is thermodynamically more stable and exhibits kinetics 140-fold faster than DNA-DNA hybridization for identical sequences.Utilizing this computational framework,we designed PNA toehold probes,which were screened via simulations and experiments.This combined approach facilitated the identification of highly sensitive and specific PNA toehold probes for single point mutation detection via strand displacement reaction.Our results demonstrate the successful application of PNA toehold probes for detecting point mutations with high sensitivity and specificity,achieving a selective amplification of approximately 200-fold for variants with a variant allele frequency(VAF)of 0.5%using quantitative polymerase chain reaction.
基金supported by the National Program for Brain Science and Brain-like Intelligence Technology of China (2021ZD0200800)Beijing Municipal Science and Technology Commission (Z181100001518005)+1 种基金the National Natural Science Foundation of China (31401139, 32170613, 81671358, 81873803)the Natural Science Foundation of Beijing Municipality (7232225)
文摘Autism spectrum disorder(ASD)is a neurodevelopmental disorder with high genetic heritability but heterogeneity.Fully understanding its genetics requires whole-genome sequencing(WGS),but the ASD studies utilizing WGS data in Chinese population are limited.In this study,we present a WGS study for 334 individuals,including 112 ASD patients and their non-ASD parents.We identified 146 de novo variants in coding regions in 85 cases and 60 inherited variants in coding regions.By integrating these variants with an association model,we identified 33 potential risk genes(P<0.001)enriched in neuron and regulation related biological process.Besides the well-known ASD genes(SCN2A,NF1,SHANK3,CHD8 etc.),several high confidence genes were highlighted by a series of functional analyses,including CTNND1,DGKZ,LRP1,DDN,ZNF483,NR4A2,SMAD6,INTS1,and MRPL12,with more supported evidence from GO enrichment,expression and network analysis.We also integrated RNA-seq data to analyze the effect of the variants on the gene expression and found 12 genes in the individuals with the related variants had relatively biased expression.We further presented the clinical phenotypes of the proband carrying the risk genes in both our samples and Caucasian samples to show the effect of the risk genes on phenotype.Regarding variants in noncoding regions,a total of 74 de novo variants and 30 inherited variants were predicted as pathogenic with high confidence,which were mapped to specific genes or regulatory features.The number of de novo variants found in patient was significantly associated with the parents’ages at the birth of the child,and gender with trend.We also identified small de novo structural variants in ASD trios.The results in this study provided important evidence for understanding the genetic mechanism of ASD.
基金supported by the National Key Research and Development Program of China (2017YFC1308900)Beijing Municipal Commission of Health and Family Planning Project (PXM2018_026279_000005)+1 种基金National High-tech R&D Program of China (2012AA02A203, No.2012AA02A504)Beijing talent fund
文摘Gastric cancer(GC) is a highly heterogeneous disease with multiple cellular types and poor prognosis.However, the cellular evolution and molecular basis of GC at the individual intra-tumor level has not been well demonstrated. We performed single-cell whole exome sequencing to detect somatic singlenucleotide variants(SNVs) and significantly mutated genes(SMGs) among 34 tumor cells and 9 normal cells from a patient with GC. The Complete Prediction for Protein Conformation(CPPC) approach directly predicting the folding conformation of the protein 3D structure with Protein Folding Shape Code, combined with functional experiments were used to confirm the characterization of mutated SMGs in GC cells. We identified 201 somatic SNVs, including 117 non-synonymous mutations in GC cells. Further analysis identified 24 significant mutated genes(SMGs) in single cells, for which a single amino acid change might affect protein conformation. Among them, two genes(CDC27 and FLG) that were mutated only in single cells but not in the corresponding tumor tissue, were recurrently present in another GC tissue cohort, and may play a potential role to promote carcinogenesis, as confirmed by functional characterization. Our findings showed a mutational landscape of GC at intra-tumor level for the first time and provided opportunities for understanding the heterogeneity and individualized target therapy for this disease.
基金This study was supported by grants from the National Natural Science Foundation of China(No.82273406)the Natural Science Foundation of Jiangsu Province(No.BK20211378).
文摘The human digestive tract is home to trillions of microbes,and owing to the advent of next-generation sequencing at the turn of the century and the development of bioinformatics technology,researchers have been able to unravel the picture of the gut microbiome(1).
基金supported by grants from the Ministry of Agriculture of China(2016ZX08009003-006 and 2011ZX08006-001)the National Key Laboratory Open Fund Project(2020SKLAB6-24)+1 种基金the ZJU-Hangzhou Global Scientific and Technological Innovation Center,Zhejiang University(02020200-K02013008)the National Natural Science Foundation of China(32071347).
文摘RNA base editing is a promising tool in precise molecular therapy. Currently, there are two widely used RNA base editors, REPAIRand RESCUE. REPAIR only facilitates A-to-I conversions, while RESCUE performs both A-to-I and C-to-U conversions. Thus, RESCUEcan generate twice the number of mutations compared to REPAIR. However, transcription-wide impact due to RESCUE-induced offtarget single-nucleotide variants (SNVs) is not fully appreciated. Therefore, to determine the off-target effects of RESCUE-mediatedediting, we employed transcription-wide sequencing on cells edited by RESCUE. The SNVs showed different off-target effects onmRNA, circRNA, lncRNA, and miRNA expression patterns and their interacting networks. Our results illustrate the transcriptionwide impact of RESCUE-induced off-target SNVs and highlight the need for careful characterization of the off-target impact by thisediting platform.
基金supported by the Italian Foundation of Multiple Sclerosis(FISM,2011/R/142015/R/10,2019/R-Multi/033)by the Italian Ministry of Health(RF-2016-02361294)the AGING Project for Department of Excellence at the Department of Translational Medicine(DIMET),Universitàdel Piemonte Orientale,Novara,Italy+1 种基金supported by Consorzio Interuniversitario di Biotecnologie(CIB)partially supported by Multiple MS project(Horizon 2020 European Grant 733161),Stockholm。
文摘Among multiple sclerosis(MS)susceptibility genes,the strongest non-human leukocyte antigen(HLA)signal in the Italian population maps to the TNFSF14 gene encoding LIGHT,a glycoprotein involved in dendritic cell(DC)maturation.Through fine-mapping in a large Italian dataset(4,198 patients with MS and3,903 controls),we show that the TNFSF14 intronic SNP rs1077667 is the primarily MS-associated variant in the region.Expression quantitative trait locus(e QTL)analysis indicates that the MS risk allele is significantly associated with reduced TNFSF14 messenger RNA levels in blood cells,which is consistent with the allelic imbalance in RNA-Seq reads(P<0.0001).The MS risk allele is associated with reduced levels of TNFSF14 gene expression(P<0.01)in blood cells from 84 Italian patients with MS and 80 healthy controls(HCs).Interestingly,patients with MS are lower expressors of TNFSF14 compared to HC(P<0.007).Individuals homozygous for the MS risk allele display an increased percentage of LIGHT-positive peripheral blood myeloid DCs(CD11 c+,P=0.035)in 37 HCs,as well as in in vitro monocyte-derived DCs from 22 HCs(P=0.04).Our findings suggest that the intronic variant rs1077667 alters the expression of TNFSF14 in immune cells,which may play a role in MS pathogenesis.
文摘BACKGROUND: Cancer immunotherapy uses one's own immune system to fight cancerous cells. As immune system is hard- wired to distinguish self and non-self, cancer immunotherapy is predicted to target cancerous cells specifically, therefore is less toxic than chemotherapy and radiation therapy, two major treatments for cancer. Cancer immunologists have spent decades to search for the specific targets in cancerous cells. METHODS: Due to the recent advances in high throughput sequencing and bioinformatics, evidence has merged that the neoantigens in cancerous cells are probably the cancer-specific targets that lead to the destruction of cancer. We will review the transplantable murine tumor models for cancer immunotherapy and the bioinformatics tools used to navigate mouse genome to identify tumor-rejecting neoantigens. RESULTS: Several groups have independently identified point mutations that can be recognized by T cells of host immune system. It is consistent with the note that the formation ofpeptide-MHC I-TCR complex is critical to activate T cells. Both anchor residue and TCR-facing residue mutations have been reported. While TCR-facing residue mutations may directly activate specific T cells, anchor residue mutations improve the binding of peptides to MHC I molecules, which increases the presentation of peptides and the T cell activation indirectly. CONCLUSIONS: Our work indicates that the affinity of neoepitopes for MHC I is not a predictor for anti-tumor immune responses in mice. Instead differential agretopic index (DAI), the numerical difference of epitope-MHC I affinities between the mutated and un-mutated sequences is a significant predictor. A similar bioinformatics pipeline has been developed to generate personalized vaccines to treat human ovarian cancer in a Phase I clinical trial.
文摘Allele specific expression is essential for cellular programming and development and the diversity of cellular phenotypes. Traditional analysis methods utilize RNA and depend on single nucleotide polymorphisms,thus to suffer from limited amount of materials for analysis. The rapid development of next-generation sequencing technologies provides more comprehensive and powerful approaches to analyze the genomic, epigenetic, and transcriptomic data, and further to detect and measure allele specific expressions. It will potentially enhance the understanding of the allele specific expressions, their complexities, and the effect on biological processes. In this paper, we extensively review the state-of-art enabling technologies and tools to analyze, detect, and measure allele specific expressions, compare their features, and point out the future trend of the methods.