AIM: To develop a prognostic gene set that can predict patient overall survival status based on the whole genome expression analysis. METHODS: Using Illumina HumanWG-6 BeadChip followed by semi-supervised analysis, we...AIM: To develop a prognostic gene set that can predict patient overall survival status based on the whole genome expression analysis. METHODS: Using Illumina HumanWG-6 BeadChip followed by semi-supervised analysis, we analyzed the expression of 47 296 transcripts in two batches of gastric cancer patients who underwent surgical resection. Thirty-nine samples in the first batch were used as the training set to discover candidate markers correlated to overall survival, and thirty-three samples in the second batch were used for validation. RESULTS: A panel of ten genes were identified as prognostic marker in the first batch samples and classified patients into a lowand a high-risk group with significantly different survival times (P = 0.000047). This prognostic marker was then verified in an independent validation sample batch (P = 0.0009). By comparing with the traditional Tumor-node-metastasis (TNM) staging system, this ten-gene prognostic marker showed consistent prognosis results. It was the only independent prognostic value by multivariate Cox regression analysis (P = 0.007). Interestingly, six of these ten genes are ribosomal proteins, suggesting a possible association between the deregulation of ribosome related gene expression and the poor prognosis. CONCLUSION: A ten-gene marker correlated with overall prognosis, including 6 ribosomal proteins, was identified and verified, which may complement the predictive value of TNM staging system.展开更多
DNA composition dynamics across genomes of diverse taxonomy is a major subject of genome analyses. DNA composition changes are characteristics of both replication and repair machineries. We investigated 3,611,007 sing...DNA composition dynamics across genomes of diverse taxonomy is a major subject of genome analyses. DNA composition changes are characteristics of both replication and repair machineries. We investigated 3,611,007 single nucleotide polymorphisms (SNPs) generated by comparing two sequenced rice genomes from distant inbred lines (subspecies), including those from 242,811 introns and 45,462 protein-coding sequences (CDSs). Neighboring-nucleotide effects (NNEs) of these SNPs are diverse, depending on structural content-based classifications (genomewide, intronic, and CDS) and sequence context-based categories (A/C, A/G, A/T, C/G, C/T, and G/T substitutions) of the analyzed SNPs. Strong and evident NNEs and nucleotide proportion biases surrounding the analyzed SNPs were observed in 1-3 bp sequences on both sides of an SNP. Strong biases were observed around neighboring nucleotides of protein-coding SNPs, which exhibit a periodicity of three in nucleotide content, constrained by a combined effect of codon-related rules and DNA repair mechanisms. Unlike a previous finding in the human genome, we found negative correlation between GC contents of chromosomes and the magnitude of corresponding bias of nucleotide C at -1 site and G at +1 site. These results will further our understanding of the mutation mechanism in rice as well as its evolutionary implications.展开更多
基金Supported by the National 863 Program (SQ2009AA02-XK1482570 and 2006AA02A402)Beijing Municipal Committeeof Science and Technology (D0905001040631) Beijing Capi-tal Development Foundation of Health Bureau (2007-2051)
文摘AIM: To develop a prognostic gene set that can predict patient overall survival status based on the whole genome expression analysis. METHODS: Using Illumina HumanWG-6 BeadChip followed by semi-supervised analysis, we analyzed the expression of 47 296 transcripts in two batches of gastric cancer patients who underwent surgical resection. Thirty-nine samples in the first batch were used as the training set to discover candidate markers correlated to overall survival, and thirty-three samples in the second batch were used for validation. RESULTS: A panel of ten genes were identified as prognostic marker in the first batch samples and classified patients into a lowand a high-risk group with significantly different survival times (P = 0.000047). This prognostic marker was then verified in an independent validation sample batch (P = 0.0009). By comparing with the traditional Tumor-node-metastasis (TNM) staging system, this ten-gene prognostic marker showed consistent prognosis results. It was the only independent prognostic value by multivariate Cox regression analysis (P = 0.007). Interestingly, six of these ten genes are ribosomal proteins, suggesting a possible association between the deregulation of ribosome related gene expression and the poor prognosis. CONCLUSION: A ten-gene marker correlated with overall prognosis, including 6 ribosomal proteins, was identified and verified, which may complement the predictive value of TNM staging system.
基金The work was supported partly by the National High-Tech Research and Development Program(863 program,No.2003AA231050)the National Natural Science Foundation of China(No.10371126).
文摘DNA composition dynamics across genomes of diverse taxonomy is a major subject of genome analyses. DNA composition changes are characteristics of both replication and repair machineries. We investigated 3,611,007 single nucleotide polymorphisms (SNPs) generated by comparing two sequenced rice genomes from distant inbred lines (subspecies), including those from 242,811 introns and 45,462 protein-coding sequences (CDSs). Neighboring-nucleotide effects (NNEs) of these SNPs are diverse, depending on structural content-based classifications (genomewide, intronic, and CDS) and sequence context-based categories (A/C, A/G, A/T, C/G, C/T, and G/T substitutions) of the analyzed SNPs. Strong and evident NNEs and nucleotide proportion biases surrounding the analyzed SNPs were observed in 1-3 bp sequences on both sides of an SNP. Strong biases were observed around neighboring nucleotides of protein-coding SNPs, which exhibit a periodicity of three in nucleotide content, constrained by a combined effect of codon-related rules and DNA repair mechanisms. Unlike a previous finding in the human genome, we found negative correlation between GC contents of chromosomes and the magnitude of corresponding bias of nucleotide C at -1 site and G at +1 site. These results will further our understanding of the mutation mechanism in rice as well as its evolutionary implications.