Coriander(Coriandrum sativum L.),also known as cilantro,is a globally important vegetable and spice crop.Its genome and that of carrot are models for studying the evolution of the Apiaceae family.Here,we developed the...Coriander(Coriandrum sativum L.),also known as cilantro,is a globally important vegetable and spice crop.Its genome and that of carrot are models for studying the evolution of the Apiaceae family.Here,we developed the Coriander Genomics Database(CGDB,http://cgdb.bio2db.com/)to collect,store,and integrate the genomic,transcriptomic,metabolic,functional annotation,and repeat sequence data of coriander and carrot to serve as a central online platform for Apiaceae and other related plants.Using these data sets in the CGDB,we intriguingly found that seven transcription factor(TF)families showed significantly greater numbers of members in the coriander genome than in the carrot genome.The highest ratio of the numbers of MADS TFs between coriander and carrot reached 3.15,followed by those for tubby protein(TUB)and heat shock factors.As a demonstration of CGDB applications,we identified 17 TUB family genes and conducted systematic comparative and evolutionary analyses.RNA-seq data deposited in the CGDB also suggest dose compensation effects of gene expression in coriander.CGDB allows bulk downloading,significance searches,genome browser analyses,and BLAST searches for comparisons between coriander and other plants regarding genomics,gene families,gene collinearity,gene expression,and the metabolome.A detailed user manual and contact information are also available to provide support to the scientific research community and address scientific questions.CGDB will be continuously updated,and new data will be integrated for comparative and functional genomic analysis in Apiaceae and other related plants.展开更多
It is known that RNA editing occurs in human cells, which can change the information transmission from DNA to RNA and proteins. Most previous studies have focused on editing of the mRNAs. Here we reported that several...It is known that RNA editing occurs in human cells, which can change the information transmission from DNA to RNA and proteins. Most previous studies have focused on editing of the mRNAs. Here we reported that several kinds of RNAs, including miRNA, rRNA, mRNA, miscRNA and unknown RNA, exhibited base editing in a human fetal liver. Several editing types are displayed. Our data reveals that RNA editing may occur in different species of RNAs.展开更多
Cultivated peanut (Arachis hypogaea) is an allotetraploid crop planted in Asia, Africa, and America for edible oil and protein. To explore the origins and consequences of tetraploidy, we sequenced the allotetraploid A...Cultivated peanut (Arachis hypogaea) is an allotetraploid crop planted in Asia, Africa, and America for edible oil and protein. To explore the origins and consequences of tetraploidy, we sequenced the allotetraploid A. hypogaea genome and compared it with the related diploid Arachis duranensis and Arachis ipaensis genomes. We annotated 39 888 A-subgenome genes and 41 526 B-subgenome genes in allotetraploid peanut. The A. hypogaea subgenomes have evolved asymmetrically, with the B subgenome resembling the ancestral state and the A subgenome undergoing more gene disruption, loss, conversion, and transposable element proliferation, and having reduced gene expression during seed development despite lacking genome-wide expression dominance. Genomic and transcriptomic analyses identified more than 2 500 oil metabolism-related genes and revealed that most of them show altered expression early in seed development while their expression ceases during desiccation, presenting a comprehensive map of peanut lipid biosynthesis. The availability of these genomic resources will facilitate a better understanding of the complex genome architecture, agronomically and economically important genes, and genetic improvement of peanut.展开更多
Evidence of whole-genome duplications(WGDs)and subsequent karyotype changes has been detected in most major lineages of living organisms on Earth.To clarify the complex resulting multi-layered patterns of gene colline...Evidence of whole-genome duplications(WGDs)and subsequent karyotype changes has been detected in most major lineages of living organisms on Earth.To clarify the complex resulting multi-layered patterns of gene collinearity in genome analyses,there is a need for convenient and accurate toolkits.To meet this need,we developed WGDI(Whole-Genome Duplication Integrated analysis),a Python-based command-line tool that facilitates comprehensive analysis of recursive polyploidization events and cross-species genome alignments.WGDI supports three main workflows(polyploid inference,hierarchical inference of genomic homology,and ancestral chromosome karyotyping)that can improve the detection of WGD and characterization of WGD-related events based on high-quality chromosome-level genomes.Significantly,it can extract complete synteny blocks and facilitate reconstruction of detailed karyotype evolution.This toolkit is freely available at GitHub(https://github.com/SunPengChuan/wgdi).As an example of its application,WGDI convincingly clarified karyotype evolution in Aquilegia coerulea and Vitis vinifera following WGDs and rejected the hypothesis that Aquilegia contributed as a parental lineage to the allopolyploid origin of core dicots.展开更多
A hallmark of adaptive evolution is innovation in gene function,which is associated with the development of distinct roles for genes during plant evolution;however,assessing functional innovation over long periods of ...A hallmark of adaptive evolution is innovation in gene function,which is associated with the development of distinct roles for genes during plant evolution;however,assessing functional innovation over long periods of time is not trivial.Tartary buckwheat(Fagopyrum tataricum)originated in the Himalayan region and has been exposed to intense UV-B radiation for a long time,making it an ideal species for studying novel UV-B response mechanisms in plants.Here,we developed a workflow to obtain a co-functional network of UV-B responses using data from more than 10,000 samples in more than 80 projects with multi-species and multiomics data.Dissecting the entire network revealed that flavonoid biosynthesis was most significantly related to the UV-B response.Importantly,we found that the regulatory factor MYB4R1,which resides at the core of the network,has undergone neofunctionalization.In vitro and in vivo experiments demonstrated that MYB4R1 regulates flavonoid and anthocyanin accumulation in response to UV-B in buckwheat by binding to L-box motifs in the FtCHS,FtFLS,and FtUFGT promoters.We used deep learning to develop a visual discrimination model of buckwheat flavonoid content based on natural populations exposed to global UV-B radiation.Our study highlights the critical role of gene neofunctionalization in UV-B adaptation.展开更多
Cybersecurity has always been the focus of Internet research.An LDoS attack is an intelligent type of DoS attack,which reduces the quality of network service by periodically sending high-speed but short-pulse attack t...Cybersecurity has always been the focus of Internet research.An LDoS attack is an intelligent type of DoS attack,which reduces the quality of network service by periodically sending high-speed but short-pulse attack traffic.Because of its concealment and low average rate,the traditional DoS attack detection methods are challenging to be effective.The existing LDoS attack detection methods generally have the problems of high FPR and FNR.A cloud model-based LDoS attack detection method is proposed,and a classifier based on SVM is used to train and classify the feature parameters.The detection method is verified and tested in the NS2 simulation platform and Test-bed network environment.Compared with the existing research results,the proposed method requires fewer samples,and it has lower FPR and FNR.展开更多
Cereal genes are classified into two distinct classes according to the guanine-cytosine (GC) content at the third codon sites (GC3). Natural selection and mutation bias have been proposed to affect the GC content....Cereal genes are classified into two distinct classes according to the guanine-cytosine (GC) content at the third codon sites (GC3). Natural selection and mutation bias have been proposed to affect the GC content. However, there has been controversy about the cause of GC variation. Here, we characterized the GC content of 1 092 paralogs and other single-copy genes in the duplicated chromosomal regions of the rice genome (ssp. indica) and classified the paralogs into GC3-rich and GC3-poor groups. By referring to out-group sequences from Arabidopsis and maize, we confirmed that the average synonymous substitution rate of the GC3-rich genes is significantly lower than that of the GC3-poor genes. Furthermore, we explored the other possible factors corresponding to the GC variation including the length of coding sequences, the number of exons in each gene, the number of genes in each family, the location of genes on chromosomes and the protein functions. Consequently, we propose that natural selection rather than mutation bias was the primary cause of the GC variation.展开更多
Lycophytes and seed plants constitute the typical vascular plants.Lycophytes have been thought to have no paleo-polyploidization although the event is known to be critical for the fast expansion of seed plants.Here,ge...Lycophytes and seed plants constitute the typical vascular plants.Lycophytes have been thought to have no paleo-polyploidization although the event is known to be critical for the fast expansion of seed plants.Here,genomic analyses including the homologous gene dot plot analysis detected multiple paleo-polyploidization events,with one occurring approximately 13–15 million years ago(MYA)and another about 125–142 MYA,during the evolution of the genome of Selaginella moellendorffii,a model lycophyte.In addition,comparative analysis of reconstructed ancestral genomes of lycophytes and angiosperms suggested that lycophytes were affected by more paleopolyploidization events than seed plants.Results from the present genomic analyses indicate that paleo-polyploidization has contributed to the successful establishment of both lineages—lycophytes and seed plants—of vascular plants.展开更多
Repetitive DNA sequences are highly dynamic com-ponents of the genome.Most repetitive sequences are located in intergenic regions,but some are also located in coding sequences(CDSs)or pseudogenes(Hartl 2000).The funct...Repetitive DNA sequences are highly dynamic com-ponents of the genome.Most repetitive sequences are located in intergenic regions,but some are also located in coding sequences(CDSs)or pseudogenes(Hartl 2000).The functions of repetitive sequences can be related to human genetic diseases,bacterial virulence,adaptive evolution,structural aberrations,transcriptional activity,and many other aspects(Van Belkum et al.1998;Jin et al.2004;Kashi and King 2006).展开更多
基金supported by the National Natural Science Foundation of China(31801856 to X.S.)the Hebei Province Higher Education Youth Talents Program(BJ2018016 to X.S.)+1 种基金China-Hebei 100 Scholars Supporting Project(E2013100003 to X.W.)the Natural Science Foundation of Hebei(C2017209103 to X.S.).
文摘Coriander(Coriandrum sativum L.),also known as cilantro,is a globally important vegetable and spice crop.Its genome and that of carrot are models for studying the evolution of the Apiaceae family.Here,we developed the Coriander Genomics Database(CGDB,http://cgdb.bio2db.com/)to collect,store,and integrate the genomic,transcriptomic,metabolic,functional annotation,and repeat sequence data of coriander and carrot to serve as a central online platform for Apiaceae and other related plants.Using these data sets in the CGDB,we intriguingly found that seven transcription factor(TF)families showed significantly greater numbers of members in the coriander genome than in the carrot genome.The highest ratio of the numbers of MADS TFs between coriander and carrot reached 3.15,followed by those for tubby protein(TUB)and heat shock factors.As a demonstration of CGDB applications,we identified 17 TUB family genes and conducted systematic comparative and evolutionary analyses.RNA-seq data deposited in the CGDB also suggest dose compensation effects of gene expression in coriander.CGDB allows bulk downloading,significance searches,genome browser analyses,and BLAST searches for comparisons between coriander and other plants regarding genomics,gene families,gene collinearity,gene expression,and the metabolome.A detailed user manual and contact information are also available to provide support to the scientific research community and address scientific questions.CGDB will be continuously updated,and new data will be integrated for comparative and functional genomic analysis in Apiaceae and other related plants.
文摘It is known that RNA editing occurs in human cells, which can change the information transmission from DNA to RNA and proteins. Most previous studies have focused on editing of the mRNAs. Here we reported that several kinds of RNAs, including miRNA, rRNA, mRNA, miscRNA and unknown RNA, exhibited base editing in a human fetal liver. Several editing types are displayed. Our data reveals that RNA editing may occur in different species of RNAs.
基金National Natural Science Foundation of China (31501246,31771841,31801401)the Natural Science Foundation of Guangdong Province (2017A030311007)+4 种基金the Modem Agroindustry Technology Research System (CARS-14)the Science and Technology Planning Project of Guangdong Province (2015B020231006, 2015A020209051, 2016B020201003, 2016LM3161, 2016LM3164, 2014A020208060 and S2013020012647)the International Science & Technology Cooperation Program of Guangdong Province (2013B050800021)the Agricultural Science and Technology Program of Guangdong (2013B020301014)the teamwork projects funded Guangdong Natural Science Foundation of Guangdong Province (no. 2017A030312004).
文摘Cultivated peanut (Arachis hypogaea) is an allotetraploid crop planted in Asia, Africa, and America for edible oil and protein. To explore the origins and consequences of tetraploidy, we sequenced the allotetraploid A. hypogaea genome and compared it with the related diploid Arachis duranensis and Arachis ipaensis genomes. We annotated 39 888 A-subgenome genes and 41 526 B-subgenome genes in allotetraploid peanut. The A. hypogaea subgenomes have evolved asymmetrically, with the B subgenome resembling the ancestral state and the A subgenome undergoing more gene disruption, loss, conversion, and transposable element proliferation, and having reduced gene expression during seed development despite lacking genome-wide expression dominance. Genomic and transcriptomic analyses identified more than 2 500 oil metabolism-related genes and revealed that most of them show altered expression early in seed development while their expression ceases during desiccation, presenting a comprehensive map of peanut lipid biosynthesis. The availability of these genomic resources will facilitate a better understanding of the complex genome architecture, agronomically and economically important genes, and genetic improvement of peanut.
基金This work was supported equally by the Strategic Priority Research Program of the Chinese Academy of Sciences(XDB31000000)the National Natural Science Foundation of China(grant numbers 31590821 and 91731301 to J.L.and 32070669to X.W.)+1 种基金the National Key Research and Development Program of China(2017YFC0505203 to Z.X.)also by the Fundamental Research Funds for the Central Universities(SCU2019D013 and 2020SCUNL207)and theNational High-Level Talents Special Support Plan(10 Thousand People Plan)。
文摘Evidence of whole-genome duplications(WGDs)and subsequent karyotype changes has been detected in most major lineages of living organisms on Earth.To clarify the complex resulting multi-layered patterns of gene collinearity in genome analyses,there is a need for convenient and accurate toolkits.To meet this need,we developed WGDI(Whole-Genome Duplication Integrated analysis),a Python-based command-line tool that facilitates comprehensive analysis of recursive polyploidization events and cross-species genome alignments.WGDI supports three main workflows(polyploid inference,hierarchical inference of genomic homology,and ancestral chromosome karyotyping)that can improve the detection of WGD and characterization of WGD-related events based on high-quality chromosome-level genomes.Significantly,it can extract complete synteny blocks and facilitate reconstruction of detailed karyotype evolution.This toolkit is freely available at GitHub(https://github.com/SunPengChuan/wgdi).As an example of its application,WGDI convincingly clarified karyotype evolution in Aquilegia coerulea and Vitis vinifera following WGDs and rejected the hypothesis that Aquilegia contributed as a parental lineage to the allopolyploid origin of core dicots.
基金sponsored by the National Key R&D Program of China(2021YFD1200105)the National Natural Science Foundation of China(3210150112)+2 种基金the Sichuan Province Science and Technology Support Program(2021YFH0086)the Shanghai Sailing Program(20YF1422000)the Startup Fund for Youngman Research at Shanghai Jiao Tong University(20X100040052).
文摘A hallmark of adaptive evolution is innovation in gene function,which is associated with the development of distinct roles for genes during plant evolution;however,assessing functional innovation over long periods of time is not trivial.Tartary buckwheat(Fagopyrum tataricum)originated in the Himalayan region and has been exposed to intense UV-B radiation for a long time,making it an ideal species for studying novel UV-B response mechanisms in plants.Here,we developed a workflow to obtain a co-functional network of UV-B responses using data from more than 10,000 samples in more than 80 projects with multi-species and multiomics data.Dissecting the entire network revealed that flavonoid biosynthesis was most significantly related to the UV-B response.Importantly,we found that the regulatory factor MYB4R1,which resides at the core of the network,has undergone neofunctionalization.In vitro and in vivo experiments demonstrated that MYB4R1 regulates flavonoid and anthocyanin accumulation in response to UV-B in buckwheat by binding to L-box motifs in the FtCHS,FtFLS,and FtUFGT promoters.We used deep learning to develop a visual discrimination model of buckwheat flavonoid content based on natural populations exposed to global UV-B radiation.Our study highlights the critical role of gene neofunctionalization in UV-B adaptation.
基金supported by the National Natural Science Foundation of China (Grant Nos.61772189,61772191)the Hunan Provincial Natural Science Foundation of China (2019JJ40037).
文摘Cybersecurity has always been the focus of Internet research.An LDoS attack is an intelligent type of DoS attack,which reduces the quality of network service by periodically sending high-speed but short-pulse attack traffic.Because of its concealment and low average rate,the traditional DoS attack detection methods are challenging to be effective.The existing LDoS attack detection methods generally have the problems of high FPR and FNR.A cloud model-based LDoS attack detection method is proposed,and a classifier based on SVM is used to train and classify the feature parameters.The detection method is verified and tested in the NS2 simulation platform and Test-bed network environment.Compared with the existing research results,the proposed method requires fewer samples,and it has lower FPR and FNR.
基金the State Key Basic Research and Development Plan of China(2003CB715900)the National Natural Science Foundation of China(90408015,30121003 and 30430030).
文摘Cereal genes are classified into two distinct classes according to the guanine-cytosine (GC) content at the third codon sites (GC3). Natural selection and mutation bias have been proposed to affect the GC content. However, there has been controversy about the cause of GC variation. Here, we characterized the GC content of 1 092 paralogs and other single-copy genes in the duplicated chromosomal regions of the rice genome (ssp. indica) and classified the paralogs into GC3-rich and GC3-poor groups. By referring to out-group sequences from Arabidopsis and maize, we confirmed that the average synonymous substitution rate of the GC3-rich genes is significantly lower than that of the GC3-poor genes. Furthermore, we explored the other possible factors corresponding to the GC variation including the length of coding sequences, the number of exons in each gene, the number of genes in each family, the location of genes on chromosomes and the protein functions. Consequently, we propose that natural selection rather than mutation bias was the primary cause of the GC variation.
基金the Ministry of Science and Technology of the People’s Republic of China(Grant No.2016YFD0101001)the China National Science Foundation(Grant Nos.31371282 to XW,31510333 to JW,and 31661143009 to XW)+1 种基金the Natural Science Foundation of Hebei Province(Grant No.C2015209069 to JW)Tangshan Key Laboratory Project to XW。
文摘Lycophytes and seed plants constitute the typical vascular plants.Lycophytes have been thought to have no paleo-polyploidization although the event is known to be critical for the fast expansion of seed plants.Here,genomic analyses including the homologous gene dot plot analysis detected multiple paleo-polyploidization events,with one occurring approximately 13–15 million years ago(MYA)and another about 125–142 MYA,during the evolution of the genome of Selaginella moellendorffii,a model lycophyte.In addition,comparative analysis of reconstructed ancestral genomes of lycophytes and angiosperms suggested that lycophytes were affected by more paleopolyploidization events than seed plants.Results from the present genomic analyses indicate that paleo-polyploidization has contributed to the successful establishment of both lineages—lycophytes and seed plants—of vascular plants.
基金supported by High-performance Computing Platform of China Agricultural University.
文摘Repetitive DNA sequences are highly dynamic com-ponents of the genome.Most repetitive sequences are located in intergenic regions,but some are also located in coding sequences(CDSs)or pseudogenes(Hartl 2000).The functions of repetitive sequences can be related to human genetic diseases,bacterial virulence,adaptive evolution,structural aberrations,transcriptional activity,and many other aspects(Van Belkum et al.1998;Jin et al.2004;Kashi and King 2006).