The types of mutations and their corresponding frequencies are difficult to measure in complex genomes. In this study, a high-throughput method was developed to identify spontaneous loss-of-function alleles for the re...The types of mutations and their corresponding frequencies are difficult to measure in complex genomes. In this study, a high-throughput method was developed to identify spontaneous loss-of-function alleles for the resistance gene N and the transgenic avirulence gene P50 in allotetraploid tobacco. A total of 2134 loss- of-function alleles of the N gene were identified after screening 14 million F1 hybrids. Analysis of these mutants revealed striking evolutionary patterns for genes in polyploids. Only 14 of the loss-of-function mutations were caused by spontaneous point mutations or indels, while the others were caused by home- ologous recombination (with a frequency of 1/12 000) or chromosome loss (1/15 000). Loss of the chromosome with the PS0 insertion occurred at a similar frequency (1/13 000), and the frequency of spon- taneous segmental deletion in this chromosome was 1/16 000. Both homeologous recombination and chromosome loss considerably decreased the viability of the mutants. Our data suggest that the high mutation rate in polyploids is probably due to the occurrence of homeologous recombination and the toler- ance of large mutations such as chromosome loss in polyploid genomes. Frequent mutations tend to drive polyploids to extinction unless a novel mutation helps the polyploid to effectively compete with diploids or find a new ecological niche.展开更多
Many of our major crop species are polyploids, containing more than one genome or set of chromosomes. Polyploid crops present unique challenges, including difficulties in genome assembly, in discriminating between mul...Many of our major crop species are polyploids, containing more than one genome or set of chromosomes. Polyploid crops present unique challenges, including difficulties in genome assembly, in discriminating between multiple gene and sequence copies, and in genetic mapping, hindering use of genomic data for genetics and breeding. Polyploid genomes may also be more prone to containing structural variation, such as loss of gene copies or sequences(presence–absence variation) and the presence of genes or sequences in multiple copies(copynumber variation). Although the two main types of genomic structural variation commonly identified are presence–absence variation and copy-number variation, we propose that homeologous exchanges constitute a third major form of genomic structural variation in polyploids. Homeologous exchanges involve the replacement of one genomic segment by a similar copy from another genome or ancestrally duplicated region, and are known to be extremely common in polyploids. Detecting all kinds of genomic structural variation is challenging, but recent advances such as optical mapping and long-read sequencing offer potential strategies to help identify structural variants even in complex polyploid genomes. All three major types of genomic structural variation(presence–absence, copy-number, and homeologous exchange) are now known to influence phenotypes in crop plants, with examples of flowering time, frost tolerance, and adaptive and agronomic traits. In this review,we summarize the challenges of genome analysis in polyploid crops, describe the various types of genomic structural variation and the genomics technologies and data that can be used to detect them, and collate information produced to date related to the impact of genomic structural variation on crop phenotypes. We highlight the importance of genomic structural variation for the future genetic improvement of polyploid crops.展开更多
The plant hormone abscisic acid (ABA) regulates many important physiological and developmental processes in plants. The objective of this study was to clone the ABA 8′-hydroxylase gene in common wheat. In the prese...The plant hormone abscisic acid (ABA) regulates many important physiological and developmental processes in plants. The objective of this study was to clone the ABA 8′-hydroxylase gene in common wheat. In the present study, we used the eDNA sequence of barley HvCYP707A1 gene (GenBank accession no. AB239299) as a probe for BLAST search against the common wheat (Triticum aestivum L.) EST database in GenBank. All wheat ESTs sharing high similarity with the reference gene were subjected to contig assembly. Primers were designed based on the constructed contigs to clone the wheat CYP707A1 gene, designated as TaCYP707A1. The genomic DNA sequence of TaCYPTO7A1 gene comprised five exons and four introns, with a size of 2225 bp. The corresponding cDNA sequence of TaCYP707A1 was 1737 bp, containing an open reading frame (ORF) of 1431 bp, a 42-bp 5′-untranslated region (UTR) and a 264-bp 3′UTR, with 94.9% of identical sequences to HvCYP707A1 gene (AB239299). The neighbor joining tree indicated that the deduced amino acid sequences of TaCYP707A1 gene was highly similar to those of barley and rice. The TaCYP707A1 gene was located on chromosome 6BL using a set of Chinese Spring nullisomic-tetrasomic lines and ditelosomic line 6BS. These results will be of high importance in understanding of molecular mechanism of ABA catabolism.展开更多
Population genetics studies of allopolyploid species lag behind those of diploid species because of practical difficulties in analysis of homeologs-duplicated gene copies originating from hybridized parental species. ...Population genetics studies of allopolyploid species lag behind those of diploid species because of practical difficulties in analysis of homeologs-duplicated gene copies originating from hybridized parental species. Pool-Seq, i.e. massive parallel sequencing of pooled individuals, has high potential for detecting nucleotide polymorphisms within and among multiple populations;however, its use has been limited to diploid species. We applied Pool-Seq to an allopolyploid species by developing a bioinformatic pipeline that assigns reads to each homeolog as well as to each polymorphic allele within each homeolog. We simultaneously sequenced eight genes from twenty individuals from each of 24 populations, and found over 100 polymorphic sites in each homeolog. For two sites, we estimated allele frequencies using the number of reads and then validated these estimations by making individual-based estimations. Pool-Seq using our bioinformatic pipeline allows efficient evaluation of nucleotide polymorphisms in a large number of individuals, even in allopolyploid species.展开更多
Understanding the relationship between genotype and phenotype is a major biological question and being able to predict phenotypes based on molecular genotypes is integral to molecular breeding. Whole- genome duplicati...Understanding the relationship between genotype and phenotype is a major biological question and being able to predict phenotypes based on molecular genotypes is integral to molecular breeding. Whole- genome duplications have shaped the history of all flowering plants and present challenges to elucidating the relationship between genotype and phenotype, especially in neopolyploid species. Although single nucleotide polymorphisms (SNPs) have become popular tools for genetic mapping, discovery and appli- cation of SNPs in polyploids has been difficult. Here, we summarize common experimental approaches to SNP calling, highlighting recent polyploid successes. To examine the impact of software choice on these analyses, we called SNPs among five peanut genotypes using different alignment programs (BWA-mem and Bowtie 2) and variant callers (SAMtools, GATK, and Freebayes). Alignments produced by Bowtie 2 and BWA-mem and analyzed in SAMtools shared 24.5% concordant SNPs, and SAMtools, GATK, and Freebayes shared 1.4% concordant SNPs. A subsequent analysis of simulated Brassica napus chromosome 1A and 1C genotypes demonstrated that, of the three software programs, SAMtools performed with the highest sensitivity and specificity on Bowtie 2 alignments. These results, however, are likely to vary among species, and we therefore propose a series of best practices for SNP calling in polyploids.展开更多
Interspecific hybridization is a driving force in evolution and speciation of higher plants. Interspecific hybridization often induces immediate and saltational changes in gene expression, a phenomenon collectively te...Interspecific hybridization is a driving force in evolution and speciation of higher plants. Interspecific hybridization often induces immediate and saltational changes in gene expression, a phenomenon collectively termed "transcriptome shock". Although transcriptome shock has been reported in various plant and animal taxa, the extent and pattern of shock-induced expression changes are often highly idiosyncratic, and hence entails additional investigations. Here, we produced a set of interspecific F1 triploid hybrid plants between Oryza sativa, ssp. japonica (2n=2x=24, genome AA) and the tetraploid form of O. punctata (2n=4x =48, genome, BBCC), and conducted RNA-seq transcriptome profiling of the hybrids and their exact parental plants. We analyzed both homeolog expression bias and overall gene expression level difference in the hybrids relative to the in silico "hybrids" (parental mixtures). We found that approximately 16% (2,541) of the 16,112 expressed genes in leaf tissue of the F1 hybrids showed nonadditive expression, which were specifically enriched in photosynthesis-related pathways. Interestingly, changes in the maternal homeolog expression, including non-stochastic silencing, were the major causes for altered homeolog expression partitioning in the F1 hybrids. Our findings have provided further insights into the tran- scriptome response to interspecific hybridization and heterosis.展开更多
文摘The types of mutations and their corresponding frequencies are difficult to measure in complex genomes. In this study, a high-throughput method was developed to identify spontaneous loss-of-function alleles for the resistance gene N and the transgenic avirulence gene P50 in allotetraploid tobacco. A total of 2134 loss- of-function alleles of the N gene were identified after screening 14 million F1 hybrids. Analysis of these mutants revealed striking evolutionary patterns for genes in polyploids. Only 14 of the loss-of-function mutations were caused by spontaneous point mutations or indels, while the others were caused by home- ologous recombination (with a frequency of 1/12 000) or chromosome loss (1/15 000). Loss of the chromosome with the PS0 insertion occurred at a similar frequency (1/13 000), and the frequency of spon- taneous segmental deletion in this chromosome was 1/16 000. Both homeologous recombination and chromosome loss considerably decreased the viability of the mutants. Our data suggest that the high mutation rate in polyploids is probably due to the occurrence of homeologous recombination and the toler- ance of large mutations such as chromosome loss in polyploid genomes. Frequent mutations tend to drive polyploids to extinction unless a novel mutation helps the polyploid to effectively compete with diploids or find a new ecological niche.
基金supported by the Deutsche Forschungsgemeinschaft(MA6473/1-1,MA6473/2-1)
文摘Many of our major crop species are polyploids, containing more than one genome or set of chromosomes. Polyploid crops present unique challenges, including difficulties in genome assembly, in discriminating between multiple gene and sequence copies, and in genetic mapping, hindering use of genomic data for genetics and breeding. Polyploid genomes may also be more prone to containing structural variation, such as loss of gene copies or sequences(presence–absence variation) and the presence of genes or sequences in multiple copies(copynumber variation). Although the two main types of genomic structural variation commonly identified are presence–absence variation and copy-number variation, we propose that homeologous exchanges constitute a third major form of genomic structural variation in polyploids. Homeologous exchanges involve the replacement of one genomic segment by a similar copy from another genome or ancestrally duplicated region, and are known to be extremely common in polyploids. Detecting all kinds of genomic structural variation is challenging, but recent advances such as optical mapping and long-read sequencing offer potential strategies to help identify structural variants even in complex polyploid genomes. All three major types of genomic structural variation(presence–absence, copy-number, and homeologous exchange) are now known to influence phenotypes in crop plants, with examples of flowering time, frost tolerance, and adaptive and agronomic traits. In this review,we summarize the challenges of genome analysis in polyploid crops, describe the various types of genomic structural variation and the genomics technologies and data that can be used to detect them, and collate information produced to date related to the impact of genomic structural variation on crop phenotypes. We highlight the importance of genomic structural variation for the future genetic improvement of polyploid crops.
基金supported by the National Basic Research Program of China(2009CB118300)the National 863 Program of China(2006AA10Z1A7and2006AA100102)the International Collaboration Project from the Ministry of Agriculture of China(2006-G2)
文摘The plant hormone abscisic acid (ABA) regulates many important physiological and developmental processes in plants. The objective of this study was to clone the ABA 8′-hydroxylase gene in common wheat. In the present study, we used the eDNA sequence of barley HvCYP707A1 gene (GenBank accession no. AB239299) as a probe for BLAST search against the common wheat (Triticum aestivum L.) EST database in GenBank. All wheat ESTs sharing high similarity with the reference gene were subjected to contig assembly. Primers were designed based on the constructed contigs to clone the wheat CYP707A1 gene, designated as TaCYP707A1. The genomic DNA sequence of TaCYPTO7A1 gene comprised five exons and four introns, with a size of 2225 bp. The corresponding cDNA sequence of TaCYP707A1 was 1737 bp, containing an open reading frame (ORF) of 1431 bp, a 42-bp 5′-untranslated region (UTR) and a 264-bp 3′UTR, with 94.9% of identical sequences to HvCYP707A1 gene (AB239299). The neighbor joining tree indicated that the deduced amino acid sequences of TaCYP707A1 gene was highly similar to those of barley and rice. The TaCYP707A1 gene was located on chromosome 6BL using a set of Chinese Spring nullisomic-tetrasomic lines and ditelosomic line 6BS. These results will be of high importance in understanding of molecular mechanism of ABA catabolism.
文摘Population genetics studies of allopolyploid species lag behind those of diploid species because of practical difficulties in analysis of homeologs-duplicated gene copies originating from hybridized parental species. Pool-Seq, i.e. massive parallel sequencing of pooled individuals, has high potential for detecting nucleotide polymorphisms within and among multiple populations;however, its use has been limited to diploid species. We applied Pool-Seq to an allopolyploid species by developing a bioinformatic pipeline that assigns reads to each homeolog as well as to each polymorphic allele within each homeolog. We simultaneously sequenced eight genes from twenty individuals from each of 24 populations, and found over 100 polymorphic sites in each homeolog. For two sites, we estimated allele frequencies using the number of reads and then validated these estimations by making individual-based estimations. Pool-Seq using our bioinformatic pipeline allows efficient evaluation of nucleotide polymorphisms in a large number of individuals, even in allopolyploid species.
文摘Understanding the relationship between genotype and phenotype is a major biological question and being able to predict phenotypes based on molecular genotypes is integral to molecular breeding. Whole- genome duplications have shaped the history of all flowering plants and present challenges to elucidating the relationship between genotype and phenotype, especially in neopolyploid species. Although single nucleotide polymorphisms (SNPs) have become popular tools for genetic mapping, discovery and appli- cation of SNPs in polyploids has been difficult. Here, we summarize common experimental approaches to SNP calling, highlighting recent polyploid successes. To examine the impact of software choice on these analyses, we called SNPs among five peanut genotypes using different alignment programs (BWA-mem and Bowtie 2) and variant callers (SAMtools, GATK, and Freebayes). Alignments produced by Bowtie 2 and BWA-mem and analyzed in SAMtools shared 24.5% concordant SNPs, and SAMtools, GATK, and Freebayes shared 1.4% concordant SNPs. A subsequent analysis of simulated Brassica napus chromosome 1A and 1C genotypes demonstrated that, of the three software programs, SAMtools performed with the highest sensitivity and specificity on Bowtie 2 alignments. These results, however, are likely to vary among species, and we therefore propose a series of best practices for SNP calling in polyploids.
基金supported by the National Natural Science Foundation of China (NSFC#30990243)the State Key Basic Research and Development Plan of China (2013CBA01404)+1 种基金the Program for Introducing Talents to Universities (#B07017)a Graduate Student Innovation Fund (12SSXT130)
文摘Interspecific hybridization is a driving force in evolution and speciation of higher plants. Interspecific hybridization often induces immediate and saltational changes in gene expression, a phenomenon collectively termed "transcriptome shock". Although transcriptome shock has been reported in various plant and animal taxa, the extent and pattern of shock-induced expression changes are often highly idiosyncratic, and hence entails additional investigations. Here, we produced a set of interspecific F1 triploid hybrid plants between Oryza sativa, ssp. japonica (2n=2x=24, genome AA) and the tetraploid form of O. punctata (2n=4x =48, genome, BBCC), and conducted RNA-seq transcriptome profiling of the hybrids and their exact parental plants. We analyzed both homeolog expression bias and overall gene expression level difference in the hybrids relative to the in silico "hybrids" (parental mixtures). We found that approximately 16% (2,541) of the 16,112 expressed genes in leaf tissue of the F1 hybrids showed nonadditive expression, which were specifically enriched in photosynthesis-related pathways. Interestingly, changes in the maternal homeolog expression, including non-stochastic silencing, were the major causes for altered homeolog expression partitioning in the F1 hybrids. Our findings have provided further insights into the tran- scriptome response to interspecific hybridization and heterosis.