Brassica rapa comprises several important cultivated vegetables and oil crops.Current reference genome assemblies of Brassica rapa are quite fragmented and not highly contiguous,thereby limiting extensive genetic and ...Brassica rapa comprises several important cultivated vegetables and oil crops.Current reference genome assemblies of Brassica rapa are quite fragmented and not highly contiguous,thereby limiting extensive genetic and genomic analyses.Here,we report an improved assembly of the B.rapa genome(v3.0)using single-molecule sequencing,optical mapping,and chromosome conformation capture technologies(Hi-C).Relative to the previous reference genomes,our assembly features a contig N50 size of 1.45 Mb,representing a~30-fold improvement.We also identified a new event that occurred in the B.rapa genome~1.2 million years ago,when a long terminal repeat retrotransposon(LTR-RT)expanded.Further analysis refined the relationship of genome blocks and accurately located the centromeres in the B.rapa genome.The B.rapa genome v3.0 will serve as an important community resource for future genetic and genomic studies in B.rapa.This resource will facilitate breeding efforts in B.rapa,as well as comparative genomic analysis with other Brassica species.展开更多
Since the publication of this article,the authors have noticed that the total gene models(45,985),tandem arrays(2077),tandem genes(4963),redundancy removed(43,099),syntenic genes(39,858),nonsyntenic genes(3241),genes ...Since the publication of this article,the authors have noticed that the total gene models(45,985),tandem arrays(2077),tandem genes(4963),redundancy removed(43,099),syntenic genes(39,858),nonsyntenic genes(3241),genes on chromosomes(45,411),genes on scaffolds(574)of B.rapa reference genome v3.0 were mistaken in the article.展开更多
Non-heading Chinese cabbage(NHCC)is an important leafy vegetable cultivated worldwide.Here,we report the first high-quality,chromosome-level genome of NHCC001 based on PacBio,Hi-C,and Illumina sequencing data.The asse...Non-heading Chinese cabbage(NHCC)is an important leafy vegetable cultivated worldwide.Here,we report the first high-quality,chromosome-level genome of NHCC001 based on PacBio,Hi-C,and Illumina sequencing data.The assembled NHCC001 genome is 405.33 Mb in size with a contig N50 of 2.83 Mb and a scaffold N50 of 38.13 Mb.Approximately 53%of the assembled genome is composed of repetitive sequences,among which long terminal repeats(LTRs,20.42%of the genome)are the most abundant.Using Hi-C data,97.9%(396.83 Mb)of the sequences were assigned to 10 pseudochromosomes.Genome assessment showed that this B.rapa NHCC001 genome assembly is of better quality than other currently available B.rapa assemblies and that it contains 48,158 protein-coding genes,99.56%of which are annotated in at least one functional database.Comparative genomic analysis confirmed that B.rapa NHCC001 underwent a whole-genome triplication(WGT)event shared with other Brassica species that occurred after the WGD events shared with Arabidopsis.Genes related to ascorbic acid metabolism showed little variation among the three B.rapa subspecies.The numbers of genes involved in glucosinolate biosynthesis and catabolism were higher in NHCC001 than in Chiifu and Z1,due primarily to tandem duplication.The newly assembled genome will provide an important resource for research on B.rapa,especially B.rapa ssp.chinensis.展开更多
Flower type is an important and extremely complicated trait of chrysanthemum.The corolla tube merged degree(CTMD)and the relative number of ray florets(RNRF)are the two key factors affecting chrysanthemum flower type....Flower type is an important and extremely complicated trait of chrysanthemum.The corolla tube merged degree(CTMD)and the relative number of ray florets(RNRF)are the two key factors affecting chrysanthemum flower type.However,few reports have clarified the inheritance of these two complex traits,which limits directed breeding for flower-type improvement.In this study,305 F1 hybrids were obtained from two parents with obvious differences in CTMD and RNRF performance.Using specific-locus amplified fragment sequencing(SLAF-seq)technology,we constructed a high-density genetic linkage map with an average map distance of 0.76 cM.Three major QTLs controlling CTMD and four major QTLs underlying RNRF were repeatedly detected in the 2 years.Moreover,the synteny between the genetic map and other Compositae species was investigated,and weak collinearity was observed.In QTL regions with a high degree of genomic collinearity,eight annotated genes were probed in the Helianthus annuus L.and Lactuca sativa L.var.ramosa Hort.genomes.Furthermore,20 and 11 unigenes were identified via BLAST searches between the SNP markers of the QTL regions and the C.vestitum and C.lavandulifolium transcriptomes,respectively.These results lay a foundation for molecular marker-assisted breeding and candidate gene exploration in chrysanthemum without a reference assembly.展开更多
Olive(Olea europaea L.)is internationally renowned for its high-end product,extra virgin olive oil.An incomplete genome of O.europaea was previously obtained using shotgun sequencing in 2016.To further explore the gen...Olive(Olea europaea L.)is internationally renowned for its high-end product,extra virgin olive oil.An incomplete genome of O.europaea was previously obtained using shotgun sequencing in 2016.To further explore the genetic and breeding utilization of olive,an updated draft genome of olive was obtained using Oxford Nanopore third-generation sequencing and Hi-C technology.Seven different assembly strategies were used to assemble the fi nal genome of 1.30 Gb,with contig and scaffold N50 sizes of4.67 Mb and 42.60 Mb,respectively.This greatly increased the quality of the olive genome.We assembled 1.1 Gb of sequences of the total olive genome to 23 pseudochromosomes by Hi-C,and 53,518 protein-coding genes were predicted in the current assembly.Comparative genomics analyses,including gene family expansion and contraction,whole-genome replication,phylogenetic analysis,and positive selection,were performed.Based on the obtained high-quality olive genome,a total of nine gene families with 202 genes were identi fi ed in the oleuropein biosynthesis pathway,which is twice the number ofgenes identi fi ed from the previous data.This new accession of the olive genome is of suf fi cient quality for genome-wide studies on gene function in olive and has provided a foundation for the molecular breeding of olive species.展开更多
Zanthoxylum bungeanum is an important spice and medicinal plant that is unique for its accumulation of abundant secondary metabolites,which create a characteristic aroma and tingling sensation in the mouth.Owing to th...Zanthoxylum bungeanum is an important spice and medicinal plant that is unique for its accumulation of abundant secondary metabolites,which create a characteristic aroma and tingling sensation in the mouth.Owing to the high proportion of repetitive sequences,high heterozygosity,and increased chromosome number of Z.bungeanum,the assembly of its chromosomal pseudomolecules is extremely challenging.Here,we present a genome sequence for Z.bungeanum,with a dramatically expanded size of 4.23 Gb,assembled into 68 chromosomes.This genome is approximately tenfold larger than that of its close relative Citrus sinensis.After the divergence of Zanthoxylum and Citrus,the lineage-specific whole-genome duplication event q-WGD approximately 26.8 million years ago(MYA)and the recent transposable element(TE)burst~6.41 MYA account for the substantial genome expansion in Z.bungeanum.The independent Zanthoxylum-specific WGD event was followed by numerous fusion/fission events that shaped the genomic architecture.Integrative genomic and transcriptomic analyses suggested that prominent speciesspecific gene family expansions and changes in gene expression have shaped the biosynthesis of sanshools,terpenoids,and anthocyanins,which contribute to the special flavor and appearance of Z.bungeanum.In summary,the reference genome provides a valuable model for studying the impact of WGDs with recent TE activity on gene gain and loss and genome reconstruction and provides resources to accelerate Zanthoxylum improvement.展开更多
Chayote(Sechium edule)is an agricultural crop in the Cucurbitaceae family that is rich in bioactive components.To enhance genetic research on chayote,we used Nanopore third-generation sequencing combined with Hi-C dat...Chayote(Sechium edule)is an agricultural crop in the Cucurbitaceae family that is rich in bioactive components.To enhance genetic research on chayote,we used Nanopore third-generation sequencing combined with Hi-C data to assemble a draft chayote genome.A chromosome-level assembly anchored on 14 chromosomes(N50 contig and scaffold sizes of 8.40 and 46.56 Mb,respectively)estimated the genome size as 606.42 Mb,which is large for the Cucurbitaceae,with 65.94%(401.08 Mb)ofthe genome comprising repetitive sequences;28,237 protein-coding genes were predicted.Comparative genome analysis indicated that chayote and snake gourd diverged from sponge gourd and that a whole-genome duplication(WGD)event occurred in chayote at 25±4 Mya.Transcriptional and metabolic analysis revealed genes involved in fruit texture,pigment,fl avor,fl avonoids,antioxidants,and plant hormones during chayote fruit development.The analysis of the genome,transcriptome,and metabolome provides insights into chayote evolution and lays the groundwork for future research on fruit and tuber development and genetic improvements in chayote.展开更多
Cerasus serrulata is a flowering cherry germplasm resource for ornamental purposes.In this work,we present a de novo chromosome-scale genome assembly of C.serrulata by the use of Nanopore and Hi-C sequencing technolog...Cerasus serrulata is a flowering cherry germplasm resource for ornamental purposes.In this work,we present a de novo chromosome-scale genome assembly of C.serrulata by the use of Nanopore and Hi-C sequencing technologies.The assembled C.serrulata genome is 265.40 Mb across 304 contigs and 67 scaffolds,with a contig N50 of 1.56 Mb and a scaffold N50 of 31.12 Mb.It contains 29,094 coding genes,27,611(94.90%)of which are annotated in at least one functional database.Synteny analysis indicated that C.serrulata and C.avium have 333 syntenic blocks composed of 14,072 genes.Blocks on chromosome 01 of C.serrulata are distributed on all chromosomes of C.avium,implying that chromosome 01 is the most ancient or active of the chromosomes.The comparative genomic analysis confirmed that C.serrulata has 740 expanded gene families,1031 contracted gene families,and 228 rapidly evolving gene families.By the use of 656 single-copy orthologs,a phylogenetic tree composed of 10 species was constructed.The present C.serrulata species diverged from Prunus yedoensis~17.34 million years ago(Mya),while the divergence of C.serrulata and C.avium was estimated to have occurred∼21.44 Mya.In addition,a total of 148 MADS-box family gene members were identified in C.serrulata,accompanying the loss of the AGL32 subfamily and the expansion of the SVP subfamily.The MYB and WRKY gene families comprising 372 and 66 genes could be divided into seven and eight subfamilies in C.serrulata,respectively,based on clustering analysis.Nine hundred forty-one plant disease-resistance genes(R-genes)were detected by searching C.serrulata within the PRGdb.This research provides high-quality genomic information about C.serrulata as well as insights into the evolutionary history of Cerasus species.展开更多
Saline lakes represent a particularly interesting aquatic environment for harboring high microbial diversity.However,the microbial diversity in different states and locations of saline lake is often overlooked.We stud...Saline lakes represent a particularly interesting aquatic environment for harboring high microbial diversity.However,the microbial diversity in different states and locations of saline lake is often overlooked.We studied for the first time the diversity and relative composition of the microbial communities in the Chagan Lake,NE China,and investigated the dif ferences in microbial species and physical and chemical factors in different geographical localities of the lake.After extracting the total DNA of the sample,we tested the library with the established library,sequenced the qualified library with Illumina HiSeq 2500,and studied the bacterial diversity by 16 S rRNA targeted metagenomics analysis.Results reveal that the highest microbial abundance in Chagan Lake at genus level was Proteobacterium followed by Actinomycetes and Bacteroides.In addition,we compared the microbial composition within the lake using alpha-and beta-diversity indices,showing that both diversity and evenness were the highest in the middle of the lake and lowest in the west of lake areas,and in the upper,middle,and lower depth of water columns,the low water column had the highest species number in the whole water environment,but the dif ference was not significant.We believe that physicochemical factors contribute to the formation of microbial community composition and diversity.In aquaculture industry,it is impossible apply horticulture for making full use of the spatial dif ferences in the microbial composition of the water.Therefore,combining cultured aquatic animal with the most suitable microbial species is a good way to boost the breeding ef fect for greater economic value.展开更多
Drought stress limits the growth and development of grapevines,thereby reducing productivity,but the mechanisms by which grapevines respond to drought stress remain largely uncharacterized.Here,we characterized a grou...Drought stress limits the growth and development of grapevines,thereby reducing productivity,but the mechanisms by which grapevines respond to drought stress remain largely uncharacterized.Here,we characterized a group A bZIP gene from“Kyoho”grapevine,VlbZIP30,which was shown to be induced by abscisic acid(ABA)and dehydration stress.Overexpression of VlbZIP30 in transgenic Arabidopsis thaliana enhanced dehydration tolerance.Transcriptome analysis revealed that a major proportion of ABA-responsive and/or drought-responsive genes are transcriptionally regulated by VlbZIP30 during ABA or mannitol treatment at the cotyledon greening stage.We identified an A.thaliana G-box motif(CACGTG)and a potential grapevine G-box motif(MCACGTGK)in the promoters of the 39 selected A.thaliana genes upregulated in the transgenic plants and in the 35 grapevine homologs,respectively.Subsequently,using two grapevine-related databases,we found that 74%(23/31)and 84%(21/25)of the detected grapevine genes were significantly upregulated by ABA and drought stress,respectively,suggesting that these genes are involved in ABA or dehydration stress and may be regulated by VlbZIP30 in grapevine.We propose that VlbZIP30 functions as a positive regulator of dehydration-responsive signaling in the ABA core signaling pathway.展开更多
Strawberry(Fragaria x ananassa)is an allopolyploid species with diverse and complex transcripts.The regulatory mechanisms of fruit development and maturation have been extensively studied;however,little is known about...Strawberry(Fragaria x ananassa)is an allopolyploid species with diverse and complex transcripts.The regulatory mechanisms of fruit development and maturation have been extensively studied;however,little is known about the signaling mechanisms that direct this process in octoploid strawberry(Fragaria x ananassa).Here,we used long-read sequencing(LRS)technology and RNA-seq analysis to investigate the diversity and complexity of the polyploid transcriptome and differentially expressed transcripts along four successive fruit developmental stages of cultivated strawberry.We obtained a reference transcriptome with 119,897 unique full-length isoforms,including 2017 new isoforms and 2510 long noncoding RNAs.Based on the genome of the plausible progenitor(Fragaria vesca),20,229 alternative splicing(AS)events were identified.Using this transcriptome,we found 17,485 differentially expressed transcripts during strawberry fruit development,including 527 transcription factors(TFs)belonging to 41 families.The expression profiles of all members of the auxin,ABA pathway,and anthocyanin biosynthesis gene families were also examined,and many of them were highly expressed at the ripe fruit stage,strongly indicating that the role of those genes is in the regulation of fruit ripening.We produce a high-quality reference transcriptome for octoploid strawberry,including much of the full-length transcript diversity,to help understand the regulatory mechanisms of fruit development and maturation of polyploid species,particularly via elucidation of the biochemical pathways involved in auxin,ABA,and anthocyanin biosynthesis.展开更多
Stevia(Stevia rebaudiana Bertoni)is well known for its very sweet steviol glycosides(SGs)consisting of a common tetracyclic diterpenoid steviol backbone and a variable glycone.Steviol glycosides are 150–300 times swe...Stevia(Stevia rebaudiana Bertoni)is well known for its very sweet steviol glycosides(SGs)consisting of a common tetracyclic diterpenoid steviol backbone and a variable glycone.Steviol glycosides are 150–300 times sweeter than sucrose and are used as natural zero-calorie sweeteners.However,the most promising compounds are biosynthesized in small amounts.Based on Illumina,PacBio,and Hi-C sequencing,we constructed a chromosome-level assembly of Stevia covering 1416 Mb with a contig N50 value of 616.85 kb and a scaffold N50 value of 106.55 Mb.More than four-fifths of the Stevia genome consisted of repetitive elements.We annotated 44,143 high-confidence protein-coding genes in the high-quality genome.Genome evolution analysis suggested that Stevia and sunflower diverged~29.4 million years ago(Mya),shortly after the whole-genome duplication(WGD)event(WGD-2,~32.1 Mya)that occurred in their common ancestor.Comparative genomic analysis revealed that the expanded genes in Stevia were mainly enriched for biosynthesis of specialized metabolites,especially biosynthesis of terpenoid backbones,and for further oxidation and glycosylation of these compounds.We further identified all candidate genes involved in SG biosynthesis.Collectively,our current findings on the Stevia reference genome will be very helpful for dissecting the evolutionary history of Stevia and for discovering novel genes contributing to SG biosynthesis and other important agronomic traits in future breeding programs.展开更多
Snake gourd(Trichosanthes anguina L.),which belongs to the Cucurbitaceae family,is a popular ornamental and food crop species with medicinal value and is grown in many parts of the world.Although progress has been mad...Snake gourd(Trichosanthes anguina L.),which belongs to the Cucurbitaceae family,is a popular ornamental and food crop species with medicinal value and is grown in many parts of the world.Although progress has been made in its genetic improvement,the organization,composition,and evolution of the snake gourd genome remain largely unknown.Here,we report a high-quality genome assembly for snake gourd,comprising 202 contigs,with a total size of 919.8 Mb and an N50 size of 20.1 Mb.These findings indicate that snake gourd has one of the largest genomes of Cucurbitaceae species sequenced to date.The snake gourd genome assembly harbors 22,874 protein-coding genes and 80.0%of the genome consists of repetitive sequences.Phylogenetic analysis reveals that snake gourd is closely related to sponge gourd but diverged from their common ancestor~33–47 million years ago.The genome sequence reported here serves as a valuable resource for snake gourd genetic research and comparative genomic studies in Cucurbitaceae and other plant species.In addition,fruit transcriptome analysis reveals the candidate genes related to quality traits during snake gourd fruit development and provides a basis for future research on snake gourd fruit development and ripening at the transcript level.展开更多
Over the last several hundred years,donkeys have adapted to high-altitude conditions on the Tibetan Plateau.Interestingly,the kiang,a closely related equid species,also inhabits this region.Previous reports have demon...Over the last several hundred years,donkeys have adapted to high-altitude conditions on the Tibetan Plateau.Interestingly,the kiang,a closely related equid species,also inhabits this region.Previous reports have demonstrated the importance of specific genes and adaptive introgression in divergent lineages for adaptation to hypoxic conditions on the Tibetan Plateau.Here,we assessed whether donkeys and kiangs adapted to the Tibetan Plateau via the same or different biological pathways and whether adaptive introgression has occurred.We assembled a de novo genome from a kiang individual and analyzed the genomes of five kiangs and 93 donkeys(including 24 from the Tibetan Plateau).Our analyses suggested the existence of a strong hard selective sweep at the EPAS1 locus in kiangs.In Tibetan donkeys,however,another gene,i.e.,EGLN1,was likely involved in their adaptation to high altitude.In addition,admixture analysis found no evidence for interspecific gene flow between kiangs and Tibetan donkeys.Our findings indicate that despite the short evolutionary time scale since the arrival of donkeys on the Tibetan Plateau,as well as the existence of a closely related species already adapted to hypoxia,Tibetan donkeys did not acquire adaptation via admixture but instead evolved adaptations via a different biological pathway.展开更多
Paper mulberry(Broussonetia papyrifera)is a well-known woody tree historically used for Cai Lun papermaking,one of the four great inventions of ancient China.More recently,Paper mulberry has also been used as forage t...Paper mulberry(Broussonetia papyrifera)is a well-known woody tree historically used for Cai Lun papermaking,one of the four great inventions of ancient China.More recently,Paper mulberry has also been used as forage to address the shortage of feedstuff because of its digestible crude fiber and high protein contents.In this study,we obtained a chromosome-scale genome assembly for Paper mulberry using integrated approaches,including Illumina and PacBio sequencing platform as well as Hi-C,optical,and genetic maps.The assembled Paper mulberry genome consists of 386.83 Mb,which is close to the estimated size,and 99.25%(383.93 Mb)of the assembly was assigned to 13 pseudochromosomes.Comparative genomic analysis revealed the expansion and contraction in the flavonoid and lignin biosynthetic gene families,respectively,accounting for the enhanced flavonoid and decreased lignin biosynthesis in Paper mulberry.Moreover,the increased ratio of syringyl-lignin to guaiacyl-lignin in Paper mulberry underscores its suitability for use in medicine,forage,papermaking,and barkcloth making.We also identified the rootassociated microbiota of Paper mulberry and found that Pseudomonas and Rhizobia were enriched in its roots and may provide the source of nitrogen for its stems and leaves via symbiotic nitrogen fixation.Collectively,these results suggest that Paper mulberry might have undergone adaptive evolution and recruited nitrogen-fixing microbes to promote growth by enhancing flavonoid production and altering lignin monomer composition.Our study provides significant insights into genetic basis of the usefulness of Paper mulberry in papermaking and barkcloth making,and as forage.These insights will facilitate further domestication and selection as well as industrial utilization of Paper mulberry worldwide.展开更多
N6-Methyladenine (6mA)DNA methylation has recently been implicated as a potential new epigenetic marker in eukaryotes,including the dioot modelArabidopsis thaliana.However,the conservation and divergence of 6mA distri...N6-Methyladenine (6mA)DNA methylation has recently been implicated as a potential new epigenetic marker in eukaryotes,including the dioot modelArabidopsis thaliana.However,the conservation and divergence of 6mA distribution patterns and functions in plants remain elusive.Here we report high-quality 6mA methylomes at single-nucleotide resolution in rice based on substantially improved genome sequences of two rice cultivars,Nipponbare (Nip;Japonica)and 93-11 (Indica).Analysis of 6mA genomic distribution and its association with transcription suggest that 6mA distribution and function is rather conserved between rice and Arabidopsis.We found that 6mA levels are positively correlated with the expression of key stressrelated genes,which may be responsible for the difference in stress tolerance between Nip and 93-11. Moreover,we showed that mutations in DDM1 cause defects in plant growth and decreased 6mA level. Our results reveal that 6mA is a conserved DNA modification that is positively associated with gene expression and contributes to key agronomic traits in plants.展开更多
Crop weediness,especially that of weedy rice(Oryza sativa f.spontanea),remains mysterious.Weedy rice possesses robust ecological adaptability;however,how this strain originated and gradually formed proprietary genetic...Crop weediness,especially that of weedy rice(Oryza sativa f.spontanea),remains mysterious.Weedy rice possesses robust ecological adaptability;however,how this strain originated and gradually formed proprietary genetic features remains unclear?Here,we demonstrate that weedy rice at Asian high latitudes(WRAH)is phylogenetically well defined and possesses unselected genomic characteristics in many divergence regions between weedy and cultivated rice.We also identified novel quantitative trait loci underlying weedy-specific traits,and revealed that a genome block on the end of chromosome 1 is associated with rice weediness.To identify the genomic modifications underlying weedy rice evolution,we generated the first de novo assembly of a high-quality weedy rice genome(WR04-6),and conducted a comparative genomics study between WR04-6 with other rice reference genomes.Multiple lines of evidence,including the results of demographic scenario comparisons,suggest that differentiation between weedy rice and cultivated rice was initiated by genetic improvement of cultivated rice and that the essence of weediness arose through semi-domestication.A plant height model further implied that the origin of WRAH can be modeled as an evolutionary game and indicated that strategy-based selection driven by fitness shaped its genomic diversity.展开更多
Chinese cabbage is the most consumed leafy crop in East Asian countries.However,premature bolting induced by continuous low temperatures severely decreases the yield and quality of the Chinese cabbage, and therefore r...Chinese cabbage is the most consumed leafy crop in East Asian countries.However,premature bolting induced by continuous low temperatures severely decreases the yield and quality of the Chinese cabbage, and therefore restricts its planting season and geographic distribution.In the past 40years,spring Chinese cabbage with strong winterness has been selected to meet the market demand.Here,we report a genome variation map of Chinese cabbage generated from the resequencing data of 194 geographically diverse accessions of three ecotypes.In-depth analyses of the selection sweeps and genome-wide patterns revealed that spring Chinese cabbage was selected from a specific population of autumn Chinese cabbage around the area of Shandong peninsula in northern China.We identified 23 genomic loci that underwent intensive selection,and further demonstrated by gene expression and haplotype analyses that the incorporation of elite alleles of VERNALISATION INSENTIVE 3.1(BrVIN3.1)and FLOWER LOCUS C 1(BrFLC1)is a determinant genetic source of variation during selection.Moreover,we showed that the quantitative response of BrVIN3.1 to cold due to the sequence variations in the cis elements of the BrVlN3.1 promoter significantly contributes to bolting-time variation in Chinese cabbage.Collectively, our study provides valuable insights into the genetic basis of spring Chinese cabbage selection and will facilitate the breeding of bolting-resistant Varieties by molecular-marker-assisted selection,transgenic or gene editingapproaches.展开更多
Rapeseed (Brassica napus),an important oilseed crop,has adapted to diverse climate zones and latitudes by forming three main ecotype groups,namely winter,semiwinter,and spring types. However,genetic variations underly...Rapeseed (Brassica napus),an important oilseed crop,has adapted to diverse climate zones and latitudes by forming three main ecotype groups,namely winter,semiwinter,and spring types. However,genetic variations underlying the divergence of these ecotypes are largely unknown. Here,we report the global pattern of genetic polymorphisms in rapeseed determined by resequencing a worldwide collection of 991 germplasm accessions.A total of 5.56 and 5.53 million singlenucleotide polymorphisms (SNPs)as Well as 1.86 and 1.92 million InDels were identified by mapping reads to the reference genomes of "Darmor-bzh"and "Tapidor,"respectively.We generated a map of allelic drift paths that shows splits and mixtures of the main populations,and revealed an asymmetric evolution of the two subgenomes of B.napus by calculating the genetic diversity and linkage disequilibrium parameters.Selective-sweep analysis revealed genetic changes in genes orthologous to those regulating various aspects of plant development and response to stresses.A genome-wide association study identified SNPs in the promoter regions of FLOWERING LOCUS T and FLOWERING LOCUS C orthologs that corresponded to the different rapeseed ecotype groups. Our study provides important insights into the genomic footprints of rapeseed evolution and flowering-time divergence among three ecotype groups,and will facilitate screening of molecular markers for accelerating rapeseed breeding.展开更多
基金This work is supported by the National Natural Science Foundation of China(NSFC grants 31630068 and 31722048)the National Program on Key Research Project(2016YFD0100307)+1 种基金the Science and Technology Innovation Program of the Chinese Academy of Agricultural Sciences,the Key Laboratory of Biology and Genetic Improvement of Horticultural Crops,Ministry of Agriculture,P.R.China,the Prospect of Shandong Seed Project,China(Shandong Gov.(2015)Reference No.212)the China Scholarship Council(CSC)(No.201503250081).
文摘Brassica rapa comprises several important cultivated vegetables and oil crops.Current reference genome assemblies of Brassica rapa are quite fragmented and not highly contiguous,thereby limiting extensive genetic and genomic analyses.Here,we report an improved assembly of the B.rapa genome(v3.0)using single-molecule sequencing,optical mapping,and chromosome conformation capture technologies(Hi-C).Relative to the previous reference genomes,our assembly features a contig N50 size of 1.45 Mb,representing a~30-fold improvement.We also identified a new event that occurred in the B.rapa genome~1.2 million years ago,when a long terminal repeat retrotransposon(LTR-RT)expanded.Further analysis refined the relationship of genome blocks and accurately located the centromeres in the B.rapa genome.The B.rapa genome v3.0 will serve as an important community resource for future genetic and genomic studies in B.rapa.This resource will facilitate breeding efforts in B.rapa,as well as comparative genomic analysis with other Brassica species.
文摘Since the publication of this article,the authors have noticed that the total gene models(45,985),tandem arrays(2077),tandem genes(4963),redundancy removed(43,099),syntenic genes(39,858),nonsyntenic genes(3241),genes on chromosomes(45,411),genes on scaffolds(574)of B.rapa reference genome v3.0 were mistaken in the article.
基金supported by the National Key Research and Development Program(2016YFD0101701)the National Vegetable Industry Technology System(CARS-23-A-06)+1 种基金the National Natural Science Foundation of China(31872106)the National Key Programs for Bok Choy Breeding of China(111821301354052283).
文摘Non-heading Chinese cabbage(NHCC)is an important leafy vegetable cultivated worldwide.Here,we report the first high-quality,chromosome-level genome of NHCC001 based on PacBio,Hi-C,and Illumina sequencing data.The assembled NHCC001 genome is 405.33 Mb in size with a contig N50 of 2.83 Mb and a scaffold N50 of 38.13 Mb.Approximately 53%of the assembled genome is composed of repetitive sequences,among which long terminal repeats(LTRs,20.42%of the genome)are the most abundant.Using Hi-C data,97.9%(396.83 Mb)of the sequences were assigned to 10 pseudochromosomes.Genome assessment showed that this B.rapa NHCC001 genome assembly is of better quality than other currently available B.rapa assemblies and that it contains 48,158 protein-coding genes,99.56%of which are annotated in at least one functional database.Comparative genomic analysis confirmed that B.rapa NHCC001 underwent a whole-genome triplication(WGT)event shared with other Brassica species that occurred after the WGD events shared with Arabidopsis.Genes related to ascorbic acid metabolism showed little variation among the three B.rapa subspecies.The numbers of genes involved in glucosinolate biosynthesis and catabolism were higher in NHCC001 than in Chiifu and Z1,due primarily to tandem duplication.The newly assembled genome will provide an important resource for research on B.rapa,especially B.rapa ssp.chinensis.
基金performed under the National Natural Science Foundation of China(No.31530064)the National Key Research and Development Plan(No.2018YFD1000405)+1 种基金the Beijing Science and Technology Project(No.Z191100008519002)the Major Research Achievement Cultivation Project of Beijing Forestry University(No.2017CGP012).
文摘Flower type is an important and extremely complicated trait of chrysanthemum.The corolla tube merged degree(CTMD)and the relative number of ray florets(RNRF)are the two key factors affecting chrysanthemum flower type.However,few reports have clarified the inheritance of these two complex traits,which limits directed breeding for flower-type improvement.In this study,305 F1 hybrids were obtained from two parents with obvious differences in CTMD and RNRF performance.Using specific-locus amplified fragment sequencing(SLAF-seq)technology,we constructed a high-density genetic linkage map with an average map distance of 0.76 cM.Three major QTLs controlling CTMD and four major QTLs underlying RNRF were repeatedly detected in the 2 years.Moreover,the synteny between the genetic map and other Compositae species was investigated,and weak collinearity was observed.In QTL regions with a high degree of genomic collinearity,eight annotated genes were probed in the Helianthus annuus L.and Lactuca sativa L.var.ramosa Hort.genomes.Furthermore,20 and 11 unigenes were identified via BLAST searches between the SNP markers of the QTL regions and the C.vestitum and C.lavandulifolium transcriptomes,respectively.These results lay a foundation for molecular marker-assisted breeding and candidate gene exploration in chrysanthemum without a reference assembly.
基金The National Key R&D Program of China(2018YFD1000603-3)Fundamental Research Funds for the Central Non-profit Research Institution of Chinese Academy of Forestry(CAFYBB2018QB001)。
文摘Olive(Olea europaea L.)is internationally renowned for its high-end product,extra virgin olive oil.An incomplete genome of O.europaea was previously obtained using shotgun sequencing in 2016.To further explore the genetic and breeding utilization of olive,an updated draft genome of olive was obtained using Oxford Nanopore third-generation sequencing and Hi-C technology.Seven different assembly strategies were used to assemble the fi nal genome of 1.30 Gb,with contig and scaffold N50 sizes of4.67 Mb and 42.60 Mb,respectively.This greatly increased the quality of the olive genome.We assembled 1.1 Gb of sequences of the total olive genome to 23 pseudochromosomes by Hi-C,and 53,518 protein-coding genes were predicted in the current assembly.Comparative genomics analyses,including gene family expansion and contraction,whole-genome replication,phylogenetic analysis,and positive selection,were performed.Based on the obtained high-quality olive genome,a total of nine gene families with 202 genes were identi fi ed in the oleuropein biosynthesis pathway,which is twice the number ofgenes identi fi ed from the previous data.This new accession of the olive genome is of suf fi cient quality for genome-wide studies on gene function in olive and has provided a foundation for the molecular breeding of olive species.
基金This research was financially supported by the National Key R&D Program of China(2018YFD1000605)the Tianjin Science Fund for Distinguished Young Scholars(18JCJQJC48300).
文摘Zanthoxylum bungeanum is an important spice and medicinal plant that is unique for its accumulation of abundant secondary metabolites,which create a characteristic aroma and tingling sensation in the mouth.Owing to the high proportion of repetitive sequences,high heterozygosity,and increased chromosome number of Z.bungeanum,the assembly of its chromosomal pseudomolecules is extremely challenging.Here,we present a genome sequence for Z.bungeanum,with a dramatically expanded size of 4.23 Gb,assembled into 68 chromosomes.This genome is approximately tenfold larger than that of its close relative Citrus sinensis.After the divergence of Zanthoxylum and Citrus,the lineage-specific whole-genome duplication event q-WGD approximately 26.8 million years ago(MYA)and the recent transposable element(TE)burst~6.41 MYA account for the substantial genome expansion in Z.bungeanum.The independent Zanthoxylum-specific WGD event was followed by numerous fusion/fission events that shaped the genomic architecture.Integrative genomic and transcriptomic analyses suggested that prominent speciesspecific gene family expansions and changes in gene expression have shaped the biosynthesis of sanshools,terpenoids,and anthocyanins,which contribute to the special flavor and appearance of Z.bungeanum.In summary,the reference genome provides a valuable model for studying the impact of WGDs with recent TE activity on gene gain and loss and genome reconstruction and provides resources to accelerate Zanthoxylum improvement.
基金the Beijing Municipal Science and Technology Commission(Z191100008619004 and Z191100004019010)the Key Project"Science and Technology Boost the Feonomy 2020",the Special Innovation Ability Construction Fund of Beijing Academy of Agricultural and Forestry Sciences(20180404 and 20200427)+4 种基金the China Agriculture Research System Project(CARS-23)the National Key Research and Ripening Program of China,Collaborative Innovation Center of Beijing Academy of Agricultural and Forestry Sciences(201915)the Young Investigat or Fund of Beijing Academy of Agricultural and Forestry Sciences(202016)the National Natural Scienee Foundation of China(31772022)the Natural Scienee Foundation of Beijing(6182016).
文摘Chayote(Sechium edule)is an agricultural crop in the Cucurbitaceae family that is rich in bioactive components.To enhance genetic research on chayote,we used Nanopore third-generation sequencing combined with Hi-C data to assemble a draft chayote genome.A chromosome-level assembly anchored on 14 chromosomes(N50 contig and scaffold sizes of 8.40 and 46.56 Mb,respectively)estimated the genome size as 606.42 Mb,which is large for the Cucurbitaceae,with 65.94%(401.08 Mb)ofthe genome comprising repetitive sequences;28,237 protein-coding genes were predicted.Comparative genome analysis indicated that chayote and snake gourd diverged from sponge gourd and that a whole-genome duplication(WGD)event occurred in chayote at 25±4 Mya.Transcriptional and metabolic analysis revealed genes involved in fruit texture,pigment,fl avor,fl avonoids,antioxidants,and plant hormones during chayote fruit development.The analysis of the genome,transcriptome,and metabolome provides insights into chayote evolution and lays the groundwork for future research on fruit and tuber development and genetic improvements in chayote.
基金supported by grants from the Science Foundation of Jiangsu Province,China(BE2017037)the Forestry Science and Technology Innovation and Popularization Project in Jiangsu Province,China(LYKJ[2017]14,LYKJ[2018]29,LYKJ[2018]43).
文摘Cerasus serrulata is a flowering cherry germplasm resource for ornamental purposes.In this work,we present a de novo chromosome-scale genome assembly of C.serrulata by the use of Nanopore and Hi-C sequencing technologies.The assembled C.serrulata genome is 265.40 Mb across 304 contigs and 67 scaffolds,with a contig N50 of 1.56 Mb and a scaffold N50 of 31.12 Mb.It contains 29,094 coding genes,27,611(94.90%)of which are annotated in at least one functional database.Synteny analysis indicated that C.serrulata and C.avium have 333 syntenic blocks composed of 14,072 genes.Blocks on chromosome 01 of C.serrulata are distributed on all chromosomes of C.avium,implying that chromosome 01 is the most ancient or active of the chromosomes.The comparative genomic analysis confirmed that C.serrulata has 740 expanded gene families,1031 contracted gene families,and 228 rapidly evolving gene families.By the use of 656 single-copy orthologs,a phylogenetic tree composed of 10 species was constructed.The present C.serrulata species diverged from Prunus yedoensis~17.34 million years ago(Mya),while the divergence of C.serrulata and C.avium was estimated to have occurred∼21.44 Mya.In addition,a total of 148 MADS-box family gene members were identified in C.serrulata,accompanying the loss of the AGL32 subfamily and the expansion of the SVP subfamily.The MYB and WRKY gene families comprising 372 and 66 genes could be divided into seven and eight subfamilies in C.serrulata,respectively,based on clustering analysis.Nine hundred forty-one plant disease-resistance genes(R-genes)were detected by searching C.serrulata within the PRGdb.This research provides high-quality genomic information about C.serrulata as well as insights into the evolutionary history of Cerasus species.
基金Supported by the National Natural Sciences Foundational of China(No.30972191)the 948 Program from the Ministry of Agriculture of China(No.2014Z34)
文摘Saline lakes represent a particularly interesting aquatic environment for harboring high microbial diversity.However,the microbial diversity in different states and locations of saline lake is often overlooked.We studied for the first time the diversity and relative composition of the microbial communities in the Chagan Lake,NE China,and investigated the dif ferences in microbial species and physical and chemical factors in different geographical localities of the lake.After extracting the total DNA of the sample,we tested the library with the established library,sequenced the qualified library with Illumina HiSeq 2500,and studied the bacterial diversity by 16 S rRNA targeted metagenomics analysis.Results reveal that the highest microbial abundance in Chagan Lake at genus level was Proteobacterium followed by Actinomycetes and Bacteroides.In addition,we compared the microbial composition within the lake using alpha-and beta-diversity indices,showing that both diversity and evenness were the highest in the middle of the lake and lowest in the west of lake areas,and in the upper,middle,and lower depth of water columns,the low water column had the highest species number in the whole water environment,but the dif ference was not significant.We believe that physicochemical factors contribute to the formation of microbial community composition and diversity.In aquaculture industry,it is impossible apply horticulture for making full use of the spatial dif ferences in the microbial composition of the water.Therefore,combining cultured aquatic animal with the most suitable microbial species is a good way to boost the breeding ef fect for greater economic value.
基金This work was supported by the National Natural Science Foundation of China(31572110)as well as the Program for Innovative Research Team of Grape Germplasm Resources and Breeding(2013KCT-25).
文摘Drought stress limits the growth and development of grapevines,thereby reducing productivity,but the mechanisms by which grapevines respond to drought stress remain largely uncharacterized.Here,we characterized a group A bZIP gene from“Kyoho”grapevine,VlbZIP30,which was shown to be induced by abscisic acid(ABA)and dehydration stress.Overexpression of VlbZIP30 in transgenic Arabidopsis thaliana enhanced dehydration tolerance.Transcriptome analysis revealed that a major proportion of ABA-responsive and/or drought-responsive genes are transcriptionally regulated by VlbZIP30 during ABA or mannitol treatment at the cotyledon greening stage.We identified an A.thaliana G-box motif(CACGTG)and a potential grapevine G-box motif(MCACGTGK)in the promoters of the 39 selected A.thaliana genes upregulated in the transgenic plants and in the 35 grapevine homologs,respectively.Subsequently,using two grapevine-related databases,we found that 74%(23/31)and 84%(21/25)of the detected grapevine genes were significantly upregulated by ABA and drought stress,respectively,suggesting that these genes are involved in ABA or dehydration stress and may be regulated by VlbZIP30 in grapevine.We propose that VlbZIP30 functions as a positive regulator of dehydration-responsive signaling in the ABA core signaling pathway.
文摘Strawberry(Fragaria x ananassa)is an allopolyploid species with diverse and complex transcripts.The regulatory mechanisms of fruit development and maturation have been extensively studied;however,little is known about the signaling mechanisms that direct this process in octoploid strawberry(Fragaria x ananassa).Here,we used long-read sequencing(LRS)technology and RNA-seq analysis to investigate the diversity and complexity of the polyploid transcriptome and differentially expressed transcripts along four successive fruit developmental stages of cultivated strawberry.We obtained a reference transcriptome with 119,897 unique full-length isoforms,including 2017 new isoforms and 2510 long noncoding RNAs.Based on the genome of the plausible progenitor(Fragaria vesca),20,229 alternative splicing(AS)events were identified.Using this transcriptome,we found 17,485 differentially expressed transcripts during strawberry fruit development,including 527 transcription factors(TFs)belonging to 41 families.The expression profiles of all members of the auxin,ABA pathway,and anthocyanin biosynthesis gene families were also examined,and many of them were highly expressed at the ripe fruit stage,strongly indicating that the role of those genes is in the regulation of fruit ripening.We produce a high-quality reference transcriptome for octoploid strawberry,including much of the full-length transcript diversity,to help understand the regulatory mechanisms of fruit development and maturation of polyploid species,particularly via elucidation of the biochemical pathways involved in auxin,ABA,and anthocyanin biosynthesis.
基金supported by the National Key Research and Development Program of China(2018YFD1000300)National Crop Germplasm Resources Protection of Ministry of Agriculture of China(NB2018-2130135-07)。
文摘Since the publication of this article,the authors have noticed that the part of acknowledgement is missing from article.Here is the acknowledgement.
基金the National Natural Science Foundation of China(31701497 and 31601371)the Natural Science Foundation of Jiangsu Province(BK20160600 and BK20180312),the Jiangsu Key Laboratory for the Research and Utilization of Plant Resources(JSPKLB201801 and JSPKLB201832).
文摘Stevia(Stevia rebaudiana Bertoni)is well known for its very sweet steviol glycosides(SGs)consisting of a common tetracyclic diterpenoid steviol backbone and a variable glycone.Steviol glycosides are 150–300 times sweeter than sucrose and are used as natural zero-calorie sweeteners.However,the most promising compounds are biosynthesized in small amounts.Based on Illumina,PacBio,and Hi-C sequencing,we constructed a chromosome-level assembly of Stevia covering 1416 Mb with a contig N50 value of 616.85 kb and a scaffold N50 value of 106.55 Mb.More than four-fifths of the Stevia genome consisted of repetitive elements.We annotated 44,143 high-confidence protein-coding genes in the high-quality genome.Genome evolution analysis suggested that Stevia and sunflower diverged~29.4 million years ago(Mya),shortly after the whole-genome duplication(WGD)event(WGD-2,~32.1 Mya)that occurred in their common ancestor.Comparative genomic analysis revealed that the expanded genes in Stevia were mainly enriched for biosynthesis of specialized metabolites,especially biosynthesis of terpenoid backbones,and for further oxidation and glycosylation of these compounds.We further identified all candidate genes involved in SG biosynthesis.Collectively,our current findings on the Stevia reference genome will be very helpful for dissecting the evolutionary history of Stevia and for discovering novel genes contributing to SG biosynthesis and other important agronomic traits in future breeding programs.
基金supported by grants from the Beijing Municipal Science and Technology Commission(Z191100008619004 and Z191100004019010)the Key Project of“Science and Technology Boost the Economy 2020,”the Special Innovation Ability Construction Fund of the Beijing Academy of Agricultural and Forestry Sciences(20180705 and 20200427)+5 种基金the China Agriculture Research System Project(CARS-23)the Collaborative Innovation Center of Beijing Academy of Agricultural and Forestry Sciences(201915)the Young Investigator Fund of the Beijing Academy of Agricultural and Forestry Sciences(202016)the National Natural Science Foundation of China(31772022)the Natural Science Foundation of Beijing(6182016)USDA National Institute of Food and Agriculture Specialty Crop Research Initiative(2015-51181-24285).
文摘Snake gourd(Trichosanthes anguina L.),which belongs to the Cucurbitaceae family,is a popular ornamental and food crop species with medicinal value and is grown in many parts of the world.Although progress has been made in its genetic improvement,the organization,composition,and evolution of the snake gourd genome remain largely unknown.Here,we report a high-quality genome assembly for snake gourd,comprising 202 contigs,with a total size of 919.8 Mb and an N50 size of 20.1 Mb.These findings indicate that snake gourd has one of the largest genomes of Cucurbitaceae species sequenced to date.The snake gourd genome assembly harbors 22,874 protein-coding genes and 80.0%of the genome consists of repetitive sequences.Phylogenetic analysis reveals that snake gourd is closely related to sponge gourd but diverged from their common ancestor~33–47 million years ago.The genome sequence reported here serves as a valuable resource for snake gourd genetic research and comparative genomic studies in Cucurbitaceae and other plant species.In addition,fruit transcriptome analysis reveals the candidate genes related to quality traits during snake gourd fruit development and provides a basis for future research on snake gourd fruit development and ripening at the transcript level.
基金supported by the National Natural Science Foundation of China (31621062)Strategic Priority Research Program of the Chinese Academy of Sciences (XDA2004010302)+4 种基金Second Tibetan Plateau Scientific Expedition and Research (STEP)Program (2019QZKK05010703)supported by the National Natural Science Foundation of China (91731304, 31822048)Strategic Priority Research Program of the Chinese Academy of Sciences (XDB13020600)Qinghai Department of Science and Technology Major ProjectState Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan,Yunnan University(2018KF001)supported by the Animal Branch of the Germplasm Bank of Wild Species,Chinese Academy of Sciences (Large Research Infrastructure Funding)
文摘Over the last several hundred years,donkeys have adapted to high-altitude conditions on the Tibetan Plateau.Interestingly,the kiang,a closely related equid species,also inhabits this region.Previous reports have demonstrated the importance of specific genes and adaptive introgression in divergent lineages for adaptation to hypoxic conditions on the Tibetan Plateau.Here,we assessed whether donkeys and kiangs adapted to the Tibetan Plateau via the same or different biological pathways and whether adaptive introgression has occurred.We assembled a de novo genome from a kiang individual and analyzed the genomes of five kiangs and 93 donkeys(including 24 from the Tibetan Plateau).Our analyses suggested the existence of a strong hard selective sweep at the EPAS1 locus in kiangs.In Tibetan donkeys,however,another gene,i.e.,EGLN1,was likely involved in their adaptation to high altitude.In addition,admixture analysis found no evidence for interspecific gene flow between kiangs and Tibetan donkeys.Our findings indicate that despite the short evolutionary time scale since the arrival of donkeys on the Tibetan Plateau,as well as the existence of a closely related species already adapted to hypoxia,Tibetan donkeys did not acquire adaptation via admixture but instead evolved adaptations via a different biological pathway.
基金the National Natural Science Foundation of China(31770360,31870247)the Poverty Relief Project of the Chinese Academy of Sciences(KFJ-FP-24)the Huimin Technology Demonstration Project of the Natio nal Modem Agricultural Science and Technology Achievements City(Z151100001015008).
文摘Paper mulberry(Broussonetia papyrifera)is a well-known woody tree historically used for Cai Lun papermaking,one of the four great inventions of ancient China.More recently,Paper mulberry has also been used as forage to address the shortage of feedstuff because of its digestible crude fiber and high protein contents.In this study,we obtained a chromosome-scale genome assembly for Paper mulberry using integrated approaches,including Illumina and PacBio sequencing platform as well as Hi-C,optical,and genetic maps.The assembled Paper mulberry genome consists of 386.83 Mb,which is close to the estimated size,and 99.25%(383.93 Mb)of the assembly was assigned to 13 pseudochromosomes.Comparative genomic analysis revealed the expansion and contraction in the flavonoid and lignin biosynthetic gene families,respectively,accounting for the enhanced flavonoid and decreased lignin biosynthesis in Paper mulberry.Moreover,the increased ratio of syringyl-lignin to guaiacyl-lignin in Paper mulberry underscores its suitability for use in medicine,forage,papermaking,and barkcloth making.We also identified the rootassociated microbiota of Paper mulberry and found that Pseudomonas and Rhizobia were enriched in its roots and may provide the source of nitrogen for its stems and leaves via symbiotic nitrogen fixation.Collectively,these results suggest that Paper mulberry might have undergone adaptive evolution and recruited nitrogen-fixing microbes to promote growth by enhancing flavonoid production and altering lignin monomer composition.Our study provides significant insights into genetic basis of the usefulness of Paper mulberry in papermaking and barkcloth making,and as forage.These insights will facilitate further domestication and selection as well as industrial utilization of Paper mulberry worldwide.
基金The Ministry of Science and Technology of the People's Republic of China (2016YFD0101001)to X.G.,National Natural Science Foundation of China (31871606,31671670)to X.G.,Recruitment program of Global Youth Expert of China to X.G.,and Fundamental Research funds for Central Non-profit Scientific Institution (1610392017001)to X.G.
文摘N6-Methyladenine (6mA)DNA methylation has recently been implicated as a potential new epigenetic marker in eukaryotes,including the dioot modelArabidopsis thaliana.However,the conservation and divergence of 6mA distribution patterns and functions in plants remain elusive.Here we report high-quality 6mA methylomes at single-nucleotide resolution in rice based on substantially improved genome sequences of two rice cultivars,Nipponbare (Nip;Japonica)and 93-11 (Indica).Analysis of 6mA genomic distribution and its association with transcription suggest that 6mA distribution and function is rather conserved between rice and Arabidopsis.We found that 6mA levels are positively correlated with the expression of key stressrelated genes,which may be responsible for the difference in stress tolerance between Nip and 93-11. Moreover,we showed that mutations in DDM1 cause defects in plant growth and decreased 6mA level. Our results reveal that 6mA is a conserved DNA modification that is positively associated with gene expression and contributes to key agronomic traits in plants.
基金the National Natural Science Foundation of China(grant nos.U1708231 and 31271687)the National Key R&D Program of China(grant no.2017YFD0100501).
文摘Crop weediness,especially that of weedy rice(Oryza sativa f.spontanea),remains mysterious.Weedy rice possesses robust ecological adaptability;however,how this strain originated and gradually formed proprietary genetic features remains unclear?Here,we demonstrate that weedy rice at Asian high latitudes(WRAH)is phylogenetically well defined and possesses unselected genomic characteristics in many divergence regions between weedy and cultivated rice.We also identified novel quantitative trait loci underlying weedy-specific traits,and revealed that a genome block on the end of chromosome 1 is associated with rice weediness.To identify the genomic modifications underlying weedy rice evolution,we generated the first de novo assembly of a high-quality weedy rice genome(WR04-6),and conducted a comparative genomics study between WR04-6 with other rice reference genomes.Multiple lines of evidence,including the results of demographic scenario comparisons,suggest that differentiation between weedy rice and cultivated rice was initiated by genetic improvement of cultivated rice and that the essence of weediness arose through semi-domestication.A plant height model further implied that the origin of WRAH can be modeled as an evolutionary game and indicated that strategy-based selection driven by fitness shaped its genomic diversity.
文摘Chinese cabbage is the most consumed leafy crop in East Asian countries.However,premature bolting induced by continuous low temperatures severely decreases the yield and quality of the Chinese cabbage, and therefore restricts its planting season and geographic distribution.In the past 40years,spring Chinese cabbage with strong winterness has been selected to meet the market demand.Here,we report a genome variation map of Chinese cabbage generated from the resequencing data of 194 geographically diverse accessions of three ecotypes.In-depth analyses of the selection sweeps and genome-wide patterns revealed that spring Chinese cabbage was selected from a specific population of autumn Chinese cabbage around the area of Shandong peninsula in northern China.We identified 23 genomic loci that underwent intensive selection,and further demonstrated by gene expression and haplotype analyses that the incorporation of elite alleles of VERNALISATION INSENTIVE 3.1(BrVIN3.1)and FLOWER LOCUS C 1(BrFLC1)is a determinant genetic source of variation during selection.Moreover,we showed that the quantitative response of BrVIN3.1 to cold due to the sequence variations in the cis elements of the BrVlN3.1 promoter significantly contributes to bolting-time variation in Chinese cabbage.Collectively, our study provides valuable insights into the genetic basis of spring Chinese cabbage selection and will facilitate the breeding of bolting-resistant Varieties by molecular-marker-assisted selection,transgenic or gene editingapproaches.
基金the National Key Basic Research Project (no. 2015CB150205)Natural Science Foundation of China,China (no. 31671597,31370313,31670283)+1 种基金Sino-German Science Center for Research Promotion,China (GZ 1099)Jiangsu Collaborative Innovation Center for Modern Crop Production,China,and the Singapore National Research Foundation Investigatorship Program,Singapore (NRF-NRFI2016-02).
文摘Rapeseed (Brassica napus),an important oilseed crop,has adapted to diverse climate zones and latitudes by forming three main ecotype groups,namely winter,semiwinter,and spring types. However,genetic variations underlying the divergence of these ecotypes are largely unknown. Here,we report the global pattern of genetic polymorphisms in rapeseed determined by resequencing a worldwide collection of 991 germplasm accessions.A total of 5.56 and 5.53 million singlenucleotide polymorphisms (SNPs)as Well as 1.86 and 1.92 million InDels were identified by mapping reads to the reference genomes of "Darmor-bzh"and "Tapidor,"respectively.We generated a map of allelic drift paths that shows splits and mixtures of the main populations,and revealed an asymmetric evolution of the two subgenomes of B.napus by calculating the genetic diversity and linkage disequilibrium parameters.Selective-sweep analysis revealed genetic changes in genes orthologous to those regulating various aspects of plant development and response to stresses.A genome-wide association study identified SNPs in the promoter regions of FLOWERING LOCUS T and FLOWERING LOCUS C orthologs that corresponded to the different rapeseed ecotype groups. Our study provides important insights into the genomic footprints of rapeseed evolution and flowering-time divergence among three ecotype groups,and will facilitate screening of molecular markers for accelerating rapeseed breeding.