"Synthetic"allopolyploids recreated by interspecific hybridization play an important role in providing novel genomic variation for crop improvement.Such synthetic allopolyploids often undergo rapid genomic s..."Synthetic"allopolyploids recreated by interspecific hybridization play an important role in providing novel genomic variation for crop improvement.Such synthetic allopolyploids often undergo rapid genomic structural variation(SV).However,how such SV arises,is inherited and fixed,and how it affects important traits,has rarely been comprehensively and quantitively studied in advanced generation synthetic lines.A better understanding of these processes will aid breeders in knowing how to best utilize synthetic allopolyploids in breeding programs.Here,we analyzed three genetic mapping populations(735 DH lines)derived from crosses between advanced synthetic and conventional Brassica napus(rapeseed)lines,using whole-genome sequencing to determine genome composition.We observed high tolerance of large structural variants,particularly toward the telomeres,and preferential selection for balanced homoeologous exchanges(duplication/deletion events between the A and C genomes resulting in retention of gene/chromosome dosage between homoeologous chromosome pairs),including stable events involving whole chromosomes("pseudoeuploidy").Given the experimental design(all three populations shared a common parent),we were able to observe that parental SV was regularly inherited,showed genetic hitchhiking effects on segregation,and was one of the major factors inducing adjacent novel and larger SV.Surprisingly,novel SV occurred at low frequencies with no significant impacts on observed fertility and yield-related traits in the advanced generation synthetic lines.However,incorporating genome-wide SV in linkage mapping explained significantly more genetic variance for traits.Our results provide a framework for detecting and understanding the occurrence and inheritance of genomic SV in breeding programs,and support the use of synthetic parents as an important source of novel trait variation.展开更多
The genus Trigonotis comprises nearly 60 species mainly distributed in East and Southeast Asia.China has the largest number of Trigonotis species in the world,with a total of 44 species,of which 38 are endemic.Nutlet ...The genus Trigonotis comprises nearly 60 species mainly distributed in East and Southeast Asia.China has the largest number of Trigonotis species in the world,with a total of 44 species,of which 38 are endemic.Nutlet morphology is useful for the taxonomic delimitation of Trigonotis.However,there are still controversial circumscriptions of nutlet shape in some species.In previous studies,interspecies phylogenetic relationships were inferred using few DNA markers and very few taxa,which possibly led to erroneous or incomplete conclusions.In this study,the nutlet morphology of 39 Trigonotis taxa and the characteristics of 34 complete chloroplast genomes(29 taxa)were investigated and analyzed.Then,the phylogenetic relationships were discussed within this genus based on complete chloroplast genomes.To the best of our knowledge,this study is the first comprehensive analysis of nutlet morphology and complete chloroplast genome of Trigonotis.Based on nutlet morphology,Trigonotis can be divided into two groups:Group 1,hemispherical or oblique tetrahedron with carpopodiums,and Group 2,inverted tetrahedron without carpopodiums.The chloroplast genome of Trigonotis exhibited a typical quadripartite structure,including 84-86 protein-coding,37 transfer RNA,and 8 ribosomal RNA genes,with a total length of 147,247-148,986 bp.Genes in the junctions were well conserved in Trigonotis,similar to those in other Boraginaceae s.str.species.Furthermore,Trigonotis chloroplast genomes showed relatively high diversity,with more conserved genic regions than intergenic regions;in addition,we detected 14 hot spots(Pi>0.005)in non-coding regions.Phylogenetic analyses based on chloroplast genome data identified highly resolved relationships between Trigonotis species.Specifically,Trigonotis was divided into two clades with strong support:one clade included species with hemispherical or oblique tetrahedron nutlets with carpopodiums and bracts,whereas the other clade included species with inverted tetrahedron nutlets without carpopodiums or bracts.Our results may inform future taxonomic,phylogenetic,and evolutionary studies on Boraginaceae.展开更多
Previous research reveals that the genome structures of rhizobial type strains and reference strains can reflect their phylogenetic relationships. In order to further explore the potential application of genome struct...Previous research reveals that the genome structures of rhizobial type strains and reference strains can reflect their phylogenetic relationships. In order to further explore the potential application of genome structure as a phylogenetic marker in rhizobial natural taxonomy, this study analyzed the genome structures of 29 unclassified nodule bacteria isolated from the root nodules of leguminous trees, Robinia sp., Dalbergia spp., and A lbizia spp. and 7 rhizobial reference strains by I-CeuI cleavage, then clustered these bacteria phylogenetically based on their genome structures and compared these clusters with those based on numerical taxonomy and 16S rDNA PCR-RFLP. Eleven phylogenetic clusters were obtained, The clusters were in large part consistent with those based on numerical taxonomy and 16S rDNA PCR-RFLP. Also there are inconsistent clusters based on the above three methods. But results are completely consistent with 16S rRNA clusters. This suggested that the genome structure clustering method can be used to lastly identify root nodule isolates and detect their phylogenetic relationships. The credibility and repeatability of the results, together with the simplicity and possibility to analyze a large number of strains in a short time of the method, indicates the broad potential application of genome structure as phylogenetic marker to categorize rhizobial isolates and should in the future facilitate biodiversity studies.展开更多
Seahorses have evolved many unique biological traits,including a male brood pouch,the absence of caudal and pelvic fins,and the lack of spleen and gut-associated lymphatic tissue.The mitogenactivated protein kinases(M...Seahorses have evolved many unique biological traits,including a male brood pouch,the absence of caudal and pelvic fins,and the lack of spleen and gut-associated lymphatic tissue.The mitogenactivated protein kinases(MAPKs)are known to be involved in various important biological processes including growth,differentiation,immunity,and stress responses.Therefore,we hypothesized that the adaptive evolution and expression of the MAPK gene family in seahorse may differ from those of other teleost species.We identified positive selection sites in the erk2,erk5,jnk1,and p38αMAPK genes of the lined seahorse Hippocampus erectus and tiger-tailed seahorse Hippocampus comes.A novel expression profile of MAPK cascade genes was found in seahorse larvae during the first day after birth based on the RNA-seq data of H.erectus,which refl ected vital signs of immune response to its parental immune system.The expression patterns of the four positively selected MAPK genes were analyzed following the bacterial challenge of Vibrio fortis,revealing their upregulation pattern in brood pouch and other immune tissues.This study enriched our knowledge of the evolution of the H.erectus MAPK subfamilies,and could help better understanding the functional role of MAPKs in teleosts.展开更多
Arcobacter is an emerging foodborne pathogen worldwide.In this study,the prevalence,antimicrobial susceptibility and genetic characteristics of Arcobacter from different sources were investigated.Eighteen A.butzleri i...Arcobacter is an emerging foodborne pathogen worldwide.In this study,the prevalence,antimicrobial susceptibility and genetic characteristics of Arcobacter from different sources were investigated.Eighteen A.butzleri isolates were obtained from 60 raw chicken meat samples(16/60,27%)and 150 patients with diarrhea(2/150,1.3%).The resistance ratios to nalidixic acid,ciprofloxacin,clindamycin,chloramphenicol,and florfenicol were 83.33%(15/18),38.89%(7/18),38.89%(7/18),33.33%(6/18)and 33.33%(6/18),respectively.We performed whole genome sequencing of the 18 isolates,and we predicted antibiotic resistance genes and virulence factors by using assembled genomes through blastx analysis.Two resistance genes,blaOXA-464 and tet(H),and the C254T mutation in gyrA,were identified in the genomes of some resistant isolates.Furthermore,virulence genes,such as flgG,flhA,flhB,fliI,fliP,motA,cadF,cjl349,ciaB,mviN,pldA and tlyA,were found in all strains,whereas hecA,hecB and iroE were found in only some strains.Phylogenetic tree analysis of A.butzleri isolates on the basis of the core-genome single nucleotide polymorphisms showed that two isolates from patients with diarrhea clustered together,separately from the isolates from raw chicken and the chicken strains.This study is the first comprehensive analysis of Arcobacter isolated in Beijing.展开更多
Five genes encoding heat shock proteins(HSPs), Cchsp40, Cchsp60, Cchsp70, Cchsc70 and Cchsp90, were cloned and sequenced from Cotesia chilonis using RT-PCR and RACE. The c DNA sequences of Cchsp40, Cchsp60, Cchsp70,...Five genes encoding heat shock proteins(HSPs), Cchsp40, Cchsp60, Cchsp70, Cchsc70 and Cchsp90, were cloned and sequenced from Cotesia chilonis using RT-PCR and RACE. The c DNA sequences of Cchsp40, Cchsp60, Cchsp70, Cchsc70 and Cchsp90 were 1 265, 2 551, 2 094, 2 297 and 2 635 bp in length, respectively, with a molecular weight(MW) of 39.1, 60.6, 71.45, 70.19 and 82.92 k Da, respectively. The predicted amino acid sequences of these proteins showed high similarities with published HSPs of other insects in Hymenoptera. Analysis of genomic DNAs indicated that Cchsp40, Cchsp60, Cchsp70 and Cchsp90 lacked introns, but Cchsc70 contained an intron. The results also suggested that CcH SP40 in C. chilonis was the Type II HSP40, Cc HSP60 was a member of the mitochondrial HSP60 family, and Cc HSP90 was a part of cytoplasmic HSP90 A family. Expression patterns varied in the five Cchsps in response to temperature. Expression of Cchsp40 and Cchsp60 was induced significantly by cold but not heat stress. Cchsp70 and Cchsc70 showed similar response to the thermal stress and could be induced by both cold and heat, but their expression levels were consistently lower than that of Cchsp40 and Cchsp60. Cchsp90 could be induced by heat stress and mild cold, but not cold stress. In addition, the results demonstrated Cchsc70 might be constitutive and inducible protein that was expressed during normal cell functioning and also up-regulated in response to stressful stimuli while Cchsp70 was solely inducible protein induced by temperature changes. Overall, results generated from this study could significantly advance the understanding of Cchsps in response to temperature and provide important biological information for C. chilonis insects that reared under different temperatures.展开更多
The cultivated soybean(Glycine max(L.) Merr.) was distinguished from its wild progenitor Glycine soja Sieb.& Zucc.in growth period structure,by a shorter vegetative phase(V),a prolonged reproductive phase(R) ...The cultivated soybean(Glycine max(L.) Merr.) was distinguished from its wild progenitor Glycine soja Sieb.& Zucc.in growth period structure,by a shorter vegetative phase(V),a prolonged reproductive phase(R) and hence a larger R/V ratio.However,the genetic basis of the domestication of soybean from wild materials is unclear.Here,a panel of 123 cultivated and 97 wild accessions were genotyped using a set of 24 presence/absence variants(PAVs) while at the same time the materials were phenotyped with respect to flowering and maturity times at two trial sites located at very different latitudes.The major result of this study showed that variation at PAVs is informative for assessing patterns of genetic diversity in Glycine spp.The genotyping was largely consistent with the taxonomic status,although a few accessions were intermediate between the two major clades identified.Allelic diversity was much higher in the wild germplasm than in the cultivated materials.A significant domestication signal was detected at 11 of the PAVs at 0.01 level.In particular,this study has provided information for revealing the genetic basis of photoperiodism which was a prominent feature for the domestication of soybean.A significant marker-trait association with R/V ratio was detected at 14 of the PAVs,but stripping out population structure reduced this to three.These results will provide markers information for further finding of R/V related genes that can help to understand the domestication process and introgress novel genes in wild soybean to broaden the genetic base of modern soybean cultivars.展开更多
F-type lectins(Fucolectins)are carbohydrate-binding proteins and play important roles in innate immune responses against pathogenic microbial invasion.In our previous research,we found that two homologous Fucolectin g...F-type lectins(Fucolectins)are carbohydrate-binding proteins and play important roles in innate immune responses against pathogenic microbial invasion.In our previous research,we found that two homologous Fucolectin genes,AjFTL-1 and AjFTL-2,exhibited different expression profiles after lipopolysaccharides(LPS)challenge in Apostichopus japonicus.However,the transcriptional regulation mechanism of these two genes remains largely unknown.In this study,the 5'flanking regions of Aj FTL-1 and Aj FTL-2 genes were cloned and the promoter activities were studied in epithelioma papulosum cyprinid(EPC)cell system.First,in silico analysis indicated that these two promoters both contain numerous putative transcription factor binding sites including NF-κB,CREB,and CREBP1,and both contain a TATA box.Additionally,luciferase assay and progressive 5'truncation analysis revealed that AjFTL-1 and AjFTL-2 both possess high promoter activities in EPC cells.Moreover,the luciferase activity of AjFTL-1 promoter was significantly regulated by peptidoglycan(PGN)and mannan(MAN),while AjFTL-2 promoter was prominently regulated by LPS and MAN,indicating Aj FTL-1 and Aj FTL-2 genes showed different transcriptional regulation pattern under different immune stimulation.More importantly,analyses of the functional promoter regions revealed the presence of two potential NF-κB binding sites(-769 bp to-761 bp,-185 bp to-172 bp)in Aj FTL-1 and one potential binding site(-530 to-517 bp)in Aj FTL-2.Different truncated reporter vectors and expression vector co-transfection revealed that transcription factor NF-κB/Rel could significantly increase the AjFTL-2 promoter activity,but not AjFTL-1 promoter activity.These findings indicated that in marine invertebrates,different Fucolectin members differ in transcription regulations and expression patterns,and might play different roles in immune defenses during pathogen infection.展开更多
In this study, by analysis of genome structures of E. coli, the relationships between the genomic types of E. coli and the associated diseases were investigated. Samples of sputum, urine and other excretions from pati...In this study, by analysis of genome structures of E. coli, the relationships between the genomic types of E. coli and the associated diseases were investigated. Samples of sputum, urine and other excretions from patients with different infective diseases were collected. And 62 E. coli strains were isolated from these samples. Intact bacterial genomic DNA was cleaved with I-CeuI, separated by pulsed field gel electrophoresis and then typed on the basis of cleavage map. The results showed that 7 I-CeuI sites were found in all the genome structures of the 62 E. coli, indicating that there were 7 rrn operons in the genomes. The size of genome ranged from 4500 kb to 5000 kb. According to the genome structures, 62 E. coli strains were divided into 30 genome types. It was concluded that genome structures of E. coli isolated from the patients with different infective diseases varied to some extent, suggesting that some genome types of E. coli were closely related to some infective diseases.展开更多
In December 2019,a group of people in Wuhan city of Hubei province of China were found to be affected by an infection called dark etiology pneumonia.The outbreak of this pneumonia infection was declared a deadly disea...In December 2019,a group of people in Wuhan city of Hubei province of China were found to be affected by an infection called dark etiology pneumonia.The outbreak of this pneumonia infection was declared a deadly disease by the China Center for Disease Control and Prevention on January 9,2020,named Novel Coronavirus 2019(nCoV-2019).This nCoV-2019 is now known as COVID-19.There is a big list of infections of this coronavirus which is present in the form of a big family.This virus can cause several diseases that usually develop with a serious problem.According to the World Health Organization(WHO),2019-nCoV has been placed as the modern generation of Severe Acute Respiratory Syndrome(SARS)and Middle East Respiratory Syndrome(MERS)coronaviruses,so COVID-19 can repeatedly change its internal genome structure to extend its existence.Understanding and accurately predicting the mutational properties of the genome structure of COVID-19 can form a good leadership role in preventing and fighting against coronavirus.In this research paper,an analytical approach has been presented which is based on the k-means cluster technique of machine learning to find the clusters over the mutational properties of the COVID-19 viruses’complete genome.This method would be able to act as a promising tool to monitor and track pathogenic infections in their stable and local genetics/hereditary varieties.This paper identifies five main clusters of mutations with k=5 as best in most cases in the coronavirus that could help scientists and researchers develop disease control vaccines for the transformation of coronaviruses.展开更多
Upland cotton has the highest yield,and accounts for >95% of world cotton production.Decoding upland cotton genomes will undoubtedly provide the ultimate reference and resource for structural,functional,and evoluti...Upland cotton has the highest yield,and accounts for >95% of world cotton production.Decoding upland cotton genomes will undoubtedly provide the ultimate reference and resource for structural,functional,and evolutionary studies of the species.Here,we employed GeneTrek and展开更多
Since the completion of the rice genome sequencing project in 2005,we have entered the era of rice genomics,which is still in its ascendancy.Rice genomics studies can be classified into three stages:structural genomic...Since the completion of the rice genome sequencing project in 2005,we have entered the era of rice genomics,which is still in its ascendancy.Rice genomics studies can be classified into three stages:structural genomics,functional genomics,and quantitative genomics.Structural genomics refers primarily to genome sequencing for the construction of a complete map of rice genome sequence.This is fundamental for rice genetics and molecular biology research.Functional genomics aims to decode the functions of rice genes.Quantitative genomics is large-scale sequence-and statistics-based research to define the quantitative traits and genetic features of rice populations.Rice genomics has been a transformative influence on rice biological research and contributes significantly to rice breeding,making rice a good model plant for studying crop sciences.展开更多
In order to evaluate to what extent will genomics and in silico related technologies improve overall drug discovery process, we analyzed three studies comparing cost, time and attrition rate at each step of the dr...In order to evaluate to what extent will genomics and in silico related technologies improve overall drug discovery process, we analyzed three studies comparing cost, time and attrition rate at each step of the drug discovery process, between standard pharmaceutical and genomics based approaches.展开更多
Genomic structural variations (SVs), particularly insertions, deletions and inversions, can contribute to the heterogeneity of millions of nucleotides within a genome, and are likely to make an important contributio...Genomic structural variations (SVs), particularly insertions, deletions and inversions, can contribute to the heterogeneity of millions of nucleotides within a genome, and are likely to make an important contribution to biological diversity and phenotypic variation (Alkan et al., 2011; Bickhart and Liu, 2014). With the rapid development of the next-generation sequencing technologies and the new assembly methodolo- gies, the multiple de novo assemblies of genomes within a species allow researchers to explore more detailed SV maps (Li et al., 2011). Compared with the traditional read depth algorithm using the whole-genome resequencing approach and array-based technologies (Baker, 2012; Wang et al., 2012;展开更多
Common wheat (Triticum aestivum L.) is one of the most important crops, and intra-specific wheat hybrids have obvious heterosis in yield and protein quality. Therefore, utilization of hybrid wheat varieties offers a...Common wheat (Triticum aestivum L.) is one of the most important crops, and intra-specific wheat hybrids have obvious heterosis in yield and protein quality. Therefore, utilization of hybrid wheat varieties offers an effective way to increase yield and nutrition. Cytoplasmic male sterility (CMS) systems are a useful genetic tool for hybrid crop breeding, and are ideal models for studying the genetic interaction and cooperative function of mitochondrial and nuclear genomes in plants (Schnable and Wise, 1998; Hanson and Bentolila, 2004).展开更多
The 14-3-3 protein family is among the most extensively studied, yet still largely mysterious protein families in mammals to date. As they are well recognized for their roles in apoptosis, cell cycle regulation, and p...The 14-3-3 protein family is among the most extensively studied, yet still largely mysterious protein families in mammals to date. As they are well recognized for their roles in apoptosis, cell cycle regulation, and proliferation in healthy cells, aberrant 14-3-3 expression has unsurprisingly emerged as instrumentalin the development of many cancers and in prognosis. Interestingly, while the seven known 14-3-3 isoforms in humans have many similar functions across cell types, evidence of isoform-specific functions and localization has been observed in both healthy and diseased cells The strikingly high similarity among 14-3-3 isoforms has made it difficult to delineate isoform-specific functions and for isoform-specific targeting. Here, we review our knowledge of 14-3-3 interactome(s) generated by high- throughput techniques, bioinformatics, structural genomics and chemical genornics and point out that integrating the information with molecular dynamics (MD) simulations may bring us new opportunity to the design of isoform-specific inhibitors, which can not only be used as powerful research tools for delineating distinct interactomes of individual 14-3-3 isoforms, but also can serve as potential new anti-cancer drugs that selectively target aberrant 14-3-3 isoform.展开更多
Plants produce a remarkable diversity of structurally and functionally diverse natural chemicals that serve as adaptive compounds throughout their life cycles.However,unlocking this metabolic diversity is significantl...Plants produce a remarkable diversity of structurally and functionally diverse natural chemicals that serve as adaptive compounds throughout their life cycles.However,unlocking this metabolic diversity is significantly impeded by the size,complexity,and abundant repetitive elements of typical plant genomes.As genome sequencing becomes routine,we anticipate that links between metabolic diversity and genetic variation will be strengthened.In addition,an ever-increasing number of plant genomes have revealed that biosynthetic gene clusters are not only a hallmark of microbes and fungi;gene clusters for various classes of compounds have also been found in plants,and many are associated with important agronomic traits.We present recent examples of plant metabolic diversification that have been discovered through the exploration and exploitation of various genomic and pan-genomic data.We also draw attention to the fundamental genomic and pan-genomic basis of plant chemodiversity and discuss challenges and future perspectives for investigating metabolic diversity in the coming pan-genomics era.展开更多
Methoprene-tolerant (Met) gene has been found to be involved in juvenile hormone (JH) action in insects. Herein, we isolated a silkworm (Bombyx mori) homolog of Met gene from Drosophila melanogaster using bio-in...Methoprene-tolerant (Met) gene has been found to be involved in juvenile hormone (JH) action in insects. Herein, we isolated a silkworm (Bombyx mori) homolog of Met gene from Drosophila melanogaster using bio-informatics analysis and rapid amplification of cDNA ends - polymerase chain reaction method, and defined it as BmMet. The full-length cDNA ofBmMet gene consists of 1 917 nucleotides and includes a 1 368 bp of open reading frame for a deduced protein of 455 amino acids. All deduced protein sequences from Met genes in B. mori and other surveyed insects contain four typical domains ofbHLH, PAS-A, PAS-B and PAC, highlighting a high sequence conservation of Met genes during insect evolution. Also, genomic structure and phylogenic analysis suggested that Met in both B. mori and Drosophila species may originate from an ancestor gene with gce, another member of bHLH-PAS family, via gene duplication. In addition, BmMet was detected in all surveyed tissues and throughout the whole life of silkworm at transcriptional levels. Furthermore, silkworm individuals with RNAi silencing of BmMet gene in the early stage of the fourth instar larvae could molt normally and pupate successfully. This result was different from the observation in T. castaneum but similar to that in D. melanogaster after Met knockdown, revealing that the action mode of Met in B. mori and D. melanogaster should be divergent with that in other insect species.展开更多
Background: In eukaryotic genome, chromatin is not randomly distributed in cell nuclei, but instead is organized into higher-order structures. Emerging evidence indicates that these higher-order chromatin structures ...Background: In eukaryotic genome, chromatin is not randomly distributed in cell nuclei, but instead is organized into higher-order structures. Emerging evidence indicates that these higher-order chromatin structures play important roles in regulating genome functions such as transcription and DNA replication. With the advancement in 3C (chromosome conformation capture) based technologies, Hi-C has been widely used to investigate genome-wide long- range chromatin interactions during cellular differentiation and oncogenesis. Since the first publication of Hi-C assay in 2009, lots of bioinformatic tools have been implemented for processing Hi-C data from mapping raw reads to normalizing contact matrix and high interpretation, either providing a whole workflow pipeline or focusing on a particular process. Results: This article reviews the general Hi-C data processing workflow and the currently popular Hi-C data processing tools. We highlight on how these tools are used for a full interpretation of Hi-C results. Conclusions: Hi-C assay is a powerful tool to investigate the higher-order chromatin structure. Continued development of novel methods for Hi-C data analysis will be necessary for better understanding the regulatory function of genome organization.展开更多
This article addresses how the functionalities of the cellular machinery of a bacterium might have constrained the genomic arrangement of its genes during evolution and how we can study such problems using computation...This article addresses how the functionalities of the cellular machinery of a bacterium might have constrained the genomic arrangement of its genes during evolution and how we can study such problems using computational approaches, taking full advantage of the rapidly increasing pool of the sequenced bacterial genomes, potentially leading to a much improved understanding of why a bacterial genome is organized in the way it is. This article discusses a number of challenging computational problems in elucidating the genomic structures at multiple levels and the information that is encoded through these genomic structures, gearing towards the ultimate understanding of the governing rules of bacterial genome organization.展开更多
基金supported by the National Natural Science Foundation of China(NSFC,31970564,32000397,32171982)the Fundamental Research Funds for the Central Universities(2662023PY004)。
文摘"Synthetic"allopolyploids recreated by interspecific hybridization play an important role in providing novel genomic variation for crop improvement.Such synthetic allopolyploids often undergo rapid genomic structural variation(SV).However,how such SV arises,is inherited and fixed,and how it affects important traits,has rarely been comprehensively and quantitively studied in advanced generation synthetic lines.A better understanding of these processes will aid breeders in knowing how to best utilize synthetic allopolyploids in breeding programs.Here,we analyzed three genetic mapping populations(735 DH lines)derived from crosses between advanced synthetic and conventional Brassica napus(rapeseed)lines,using whole-genome sequencing to determine genome composition.We observed high tolerance of large structural variants,particularly toward the telomeres,and preferential selection for balanced homoeologous exchanges(duplication/deletion events between the A and C genomes resulting in retention of gene/chromosome dosage between homoeologous chromosome pairs),including stable events involving whole chromosomes("pseudoeuploidy").Given the experimental design(all three populations shared a common parent),we were able to observe that parental SV was regularly inherited,showed genetic hitchhiking effects on segregation,and was one of the major factors inducing adjacent novel and larger SV.Surprisingly,novel SV occurred at low frequencies with no significant impacts on observed fertility and yield-related traits in the advanced generation synthetic lines.However,incorporating genome-wide SV in linkage mapping explained significantly more genetic variance for traits.Our results provide a framework for detecting and understanding the occurrence and inheritance of genomic SV in breeding programs,and support the use of synthetic parents as an important source of novel trait variation.
基金This project was funded by the Science and Technology Basic Work,Ministry of Science and Technology,China(2013FY112100 to Q.L.)National Natural Science Foundation of China(31700175 to Z.W.)Fostering Project for Young Teachers of Zhengzhou University,China(JC21343014 to Z.W.).
文摘The genus Trigonotis comprises nearly 60 species mainly distributed in East and Southeast Asia.China has the largest number of Trigonotis species in the world,with a total of 44 species,of which 38 are endemic.Nutlet morphology is useful for the taxonomic delimitation of Trigonotis.However,there are still controversial circumscriptions of nutlet shape in some species.In previous studies,interspecies phylogenetic relationships were inferred using few DNA markers and very few taxa,which possibly led to erroneous or incomplete conclusions.In this study,the nutlet morphology of 39 Trigonotis taxa and the characteristics of 34 complete chloroplast genomes(29 taxa)were investigated and analyzed.Then,the phylogenetic relationships were discussed within this genus based on complete chloroplast genomes.To the best of our knowledge,this study is the first comprehensive analysis of nutlet morphology and complete chloroplast genome of Trigonotis.Based on nutlet morphology,Trigonotis can be divided into two groups:Group 1,hemispherical or oblique tetrahedron with carpopodiums,and Group 2,inverted tetrahedron without carpopodiums.The chloroplast genome of Trigonotis exhibited a typical quadripartite structure,including 84-86 protein-coding,37 transfer RNA,and 8 ribosomal RNA genes,with a total length of 147,247-148,986 bp.Genes in the junctions were well conserved in Trigonotis,similar to those in other Boraginaceae s.str.species.Furthermore,Trigonotis chloroplast genomes showed relatively high diversity,with more conserved genic regions than intergenic regions;in addition,we detected 14 hot spots(Pi>0.005)in non-coding regions.Phylogenetic analyses based on chloroplast genome data identified highly resolved relationships between Trigonotis species.Specifically,Trigonotis was divided into two clades with strong support:one clade included species with hemispherical or oblique tetrahedron nutlets with carpopodiums and bracts,whereas the other clade included species with inverted tetrahedron nutlets without carpopodiums or bracts.Our results may inform future taxonomic,phylogenetic,and evolutionary studies on Boraginaceae.
基金985 Foundation Grant from Peking University Health Science Center
文摘Previous research reveals that the genome structures of rhizobial type strains and reference strains can reflect their phylogenetic relationships. In order to further explore the potential application of genome structure as a phylogenetic marker in rhizobial natural taxonomy, this study analyzed the genome structures of 29 unclassified nodule bacteria isolated from the root nodules of leguminous trees, Robinia sp., Dalbergia spp., and A lbizia spp. and 7 rhizobial reference strains by I-CeuI cleavage, then clustered these bacteria phylogenetically based on their genome structures and compared these clusters with those based on numerical taxonomy and 16S rDNA PCR-RFLP. Eleven phylogenetic clusters were obtained, The clusters were in large part consistent with those based on numerical taxonomy and 16S rDNA PCR-RFLP. Also there are inconsistent clusters based on the above three methods. But results are completely consistent with 16S rRNA clusters. This suggested that the genome structure clustering method can be used to lastly identify root nodule isolates and detect their phylogenetic relationships. The credibility and repeatability of the results, together with the simplicity and possibility to analyze a large number of strains in a short time of the method, indicates the broad potential application of genome structure as phylogenetic marker to categorize rhizobial isolates and should in the future facilitate biodiversity studies.
基金Supported by the Shandong Province Science and Technology Support Program for Outstanding Youth of Colleges and Universities(No.2020KJF007)the Shandong Province Science and Technology Research Program for Colleges and Universities(No.J18KA146)+3 种基金the Yantai Foundation for Development of Science and Technology(Nos.2020LJRC120,2019CXJJ040)the Weihai Foundation for Development of Science and Technology(No.2017GNS10)the Key Special Project for Introduced Talents Team of Southern Marine Science and Engineering Guangdong Laboratory(Guangzhou)(No.GML2019ZD0407)the Guangdong Basic and Applied Basic Research Foundation(No.2019A1515110199)。
文摘Seahorses have evolved many unique biological traits,including a male brood pouch,the absence of caudal and pelvic fins,and the lack of spleen and gut-associated lymphatic tissue.The mitogenactivated protein kinases(MAPKs)are known to be involved in various important biological processes including growth,differentiation,immunity,and stress responses.Therefore,we hypothesized that the adaptive evolution and expression of the MAPK gene family in seahorse may differ from those of other teleost species.We identified positive selection sites in the erk2,erk5,jnk1,and p38αMAPK genes of the lined seahorse Hippocampus erectus and tiger-tailed seahorse Hippocampus comes.A novel expression profile of MAPK cascade genes was found in seahorse larvae during the first day after birth based on the RNA-seq data of H.erectus,which refl ected vital signs of immune response to its parental immune system.The expression patterns of the four positively selected MAPK genes were analyzed following the bacterial challenge of Vibrio fortis,revealing their upregulation pattern in brood pouch and other immune tissues.This study enriched our knowledge of the evolution of the H.erectus MAPK subfamilies,and could help better understanding the functional role of MAPKs in teleosts.
基金supported by National Key Research and Development Program of China [grant No.:2021YFC2301000]the Academic Commission of Shunyi District Center for Disease Control and Prevention,Beijing,China。
文摘Arcobacter is an emerging foodborne pathogen worldwide.In this study,the prevalence,antimicrobial susceptibility and genetic characteristics of Arcobacter from different sources were investigated.Eighteen A.butzleri isolates were obtained from 60 raw chicken meat samples(16/60,27%)and 150 patients with diarrhea(2/150,1.3%).The resistance ratios to nalidixic acid,ciprofloxacin,clindamycin,chloramphenicol,and florfenicol were 83.33%(15/18),38.89%(7/18),38.89%(7/18),33.33%(6/18)and 33.33%(6/18),respectively.We performed whole genome sequencing of the 18 isolates,and we predicted antibiotic resistance genes and virulence factors by using assembled genomes through blastx analysis.Two resistance genes,blaOXA-464 and tet(H),and the C254T mutation in gyrA,were identified in the genomes of some resistant isolates.Furthermore,virulence genes,such as flgG,flhA,flhB,fliI,fliP,motA,cadF,cjl349,ciaB,mviN,pldA and tlyA,were found in all strains,whereas hecA,hecB and iroE were found in only some strains.Phylogenetic tree analysis of A.butzleri isolates on the basis of the core-genome single nucleotide polymorphisms showed that two isolates from patients with diarrhea clustered together,separately from the isolates from raw chicken and the chicken strains.This study is the first comprehensive analysis of Arcobacter isolated in Beijing.
基金funded by the National Key R&D Program of China (2017YFD0200400)the National Basic Research Program of China (973 Program, 2013CB127604)
文摘Five genes encoding heat shock proteins(HSPs), Cchsp40, Cchsp60, Cchsp70, Cchsc70 and Cchsp90, were cloned and sequenced from Cotesia chilonis using RT-PCR and RACE. The c DNA sequences of Cchsp40, Cchsp60, Cchsp70, Cchsc70 and Cchsp90 were 1 265, 2 551, 2 094, 2 297 and 2 635 bp in length, respectively, with a molecular weight(MW) of 39.1, 60.6, 71.45, 70.19 and 82.92 k Da, respectively. The predicted amino acid sequences of these proteins showed high similarities with published HSPs of other insects in Hymenoptera. Analysis of genomic DNAs indicated that Cchsp40, Cchsp60, Cchsp70 and Cchsp90 lacked introns, but Cchsc70 contained an intron. The results also suggested that CcH SP40 in C. chilonis was the Type II HSP40, Cc HSP60 was a member of the mitochondrial HSP60 family, and Cc HSP90 was a part of cytoplasmic HSP90 A family. Expression patterns varied in the five Cchsps in response to temperature. Expression of Cchsp40 and Cchsp60 was induced significantly by cold but not heat stress. Cchsp70 and Cchsc70 showed similar response to the thermal stress and could be induced by both cold and heat, but their expression levels were consistently lower than that of Cchsp40 and Cchsp60. Cchsp90 could be induced by heat stress and mild cold, but not cold stress. In addition, the results demonstrated Cchsc70 might be constitutive and inducible protein that was expressed during normal cell functioning and also up-regulated in response to stressful stimuli while Cchsp70 was solely inducible protein induced by temperature changes. Overall, results generated from this study could significantly advance the understanding of Cchsps in response to temperature and provide important biological information for C. chilonis insects that reared under different temperatures.
基金supported by the Agricultural Science and Technology Innovation Program(ASTIP) of Chinese Academy of Agricultural Sciences and the Platform of National Crop Germplasm Resources of China(nos.2012-004 and 2013-004)
文摘The cultivated soybean(Glycine max(L.) Merr.) was distinguished from its wild progenitor Glycine soja Sieb.& Zucc.in growth period structure,by a shorter vegetative phase(V),a prolonged reproductive phase(R) and hence a larger R/V ratio.However,the genetic basis of the domestication of soybean from wild materials is unclear.Here,a panel of 123 cultivated and 97 wild accessions were genotyped using a set of 24 presence/absence variants(PAVs) while at the same time the materials were phenotyped with respect to flowering and maturity times at two trial sites located at very different latitudes.The major result of this study showed that variation at PAVs is informative for assessing patterns of genetic diversity in Glycine spp.The genotyping was largely consistent with the taxonomic status,although a few accessions were intermediate between the two major clades identified.Allelic diversity was much higher in the wild germplasm than in the cultivated materials.A significant domestication signal was detected at 11 of the PAVs at 0.01 level.In particular,this study has provided information for revealing the genetic basis of photoperiodism which was a prominent feature for the domestication of soybean.A significant marker-trait association with R/V ratio was detected at 14 of the PAVs,but stripping out population structure reduced this to three.These results will provide markers information for further finding of R/V related genes that can help to understand the domestication process and introgress novel genes in wild soybean to broaden the genetic base of modern soybean cultivars.
基金financially supported by the National Natural Science Foundation of China(No.31802331)the Natural Science Foundation of Zhejiang Province(Nos.LZ19C190001,LY20C190002)+1 种基金the Natural Science Foundation of Ningbo(No.2018A610340)the K.C.Wong Magna Fund in Ningbo University。
文摘F-type lectins(Fucolectins)are carbohydrate-binding proteins and play important roles in innate immune responses against pathogenic microbial invasion.In our previous research,we found that two homologous Fucolectin genes,AjFTL-1 and AjFTL-2,exhibited different expression profiles after lipopolysaccharides(LPS)challenge in Apostichopus japonicus.However,the transcriptional regulation mechanism of these two genes remains largely unknown.In this study,the 5'flanking regions of Aj FTL-1 and Aj FTL-2 genes were cloned and the promoter activities were studied in epithelioma papulosum cyprinid(EPC)cell system.First,in silico analysis indicated that these two promoters both contain numerous putative transcription factor binding sites including NF-κB,CREB,and CREBP1,and both contain a TATA box.Additionally,luciferase assay and progressive 5'truncation analysis revealed that AjFTL-1 and AjFTL-2 both possess high promoter activities in EPC cells.Moreover,the luciferase activity of AjFTL-1 promoter was significantly regulated by peptidoglycan(PGN)and mannan(MAN),while AjFTL-2 promoter was prominently regulated by LPS and MAN,indicating Aj FTL-1 and Aj FTL-2 genes showed different transcriptional regulation pattern under different immune stimulation.More importantly,analyses of the functional promoter regions revealed the presence of two potential NF-κB binding sites(-769 bp to-761 bp,-185 bp to-172 bp)in Aj FTL-1 and one potential binding site(-530 to-517 bp)in Aj FTL-2.Different truncated reporter vectors and expression vector co-transfection revealed that transcription factor NF-κB/Rel could significantly increase the AjFTL-2 promoter activity,but not AjFTL-1 promoter activity.These findings indicated that in marine invertebrates,different Fucolectin members differ in transcription regulations and expression patterns,and might play different roles in immune defenses during pathogen infection.
文摘In this study, by analysis of genome structures of E. coli, the relationships between the genomic types of E. coli and the associated diseases were investigated. Samples of sputum, urine and other excretions from patients with different infective diseases were collected. And 62 E. coli strains were isolated from these samples. Intact bacterial genomic DNA was cleaved with I-CeuI, separated by pulsed field gel electrophoresis and then typed on the basis of cleavage map. The results showed that 7 I-CeuI sites were found in all the genome structures of the 62 E. coli, indicating that there were 7 rrn operons in the genomes. The size of genome ranged from 4500 kb to 5000 kb. According to the genome structures, 62 E. coli strains were divided into 30 genome types. It was concluded that genome structures of E. coli isolated from the patients with different infective diseases varied to some extent, suggesting that some genome types of E. coli were closely related to some infective diseases.
基金This project was funded by the Deanship of Scientific Research(DSR),King Abdulaziz University,Jeddah,under Grant No.(D-111-611-1443)The authors,therefore,gratefully acknowledge DSR technical and financial support.
文摘In December 2019,a group of people in Wuhan city of Hubei province of China were found to be affected by an infection called dark etiology pneumonia.The outbreak of this pneumonia infection was declared a deadly disease by the China Center for Disease Control and Prevention on January 9,2020,named Novel Coronavirus 2019(nCoV-2019).This nCoV-2019 is now known as COVID-19.There is a big list of infections of this coronavirus which is present in the form of a big family.This virus can cause several diseases that usually develop with a serious problem.According to the World Health Organization(WHO),2019-nCoV has been placed as the modern generation of Severe Acute Respiratory Syndrome(SARS)and Middle East Respiratory Syndrome(MERS)coronaviruses,so COVID-19 can repeatedly change its internal genome structure to extend its existence.Understanding and accurately predicting the mutational properties of the genome structure of COVID-19 can form a good leadership role in preventing and fighting against coronavirus.In this research paper,an analytical approach has been presented which is based on the k-means cluster technique of machine learning to find the clusters over the mutational properties of the COVID-19 viruses’complete genome.This method would be able to act as a promising tool to monitor and track pathogenic infections in their stable and local genetics/hereditary varieties.This paper identifies five main clusters of mutations with k=5 as best in most cases in the coronavirus that could help scientists and researchers develop disease control vaccines for the transformation of coronaviruses.
文摘Upland cotton has the highest yield,and accounts for >95% of world cotton production.Decoding upland cotton genomes will undoubtedly provide the ultimate reference and resource for structural,functional,and evolutionary studies of the species.Here,we employed GeneTrek and
基金supported by grants from the National Natural Science Foundation of China(31788103)the Chinese Academy of Sciences(XDB27010301)to B.H.
文摘Since the completion of the rice genome sequencing project in 2005,we have entered the era of rice genomics,which is still in its ascendancy.Rice genomics studies can be classified into three stages:structural genomics,functional genomics,and quantitative genomics.Structural genomics refers primarily to genome sequencing for the construction of a complete map of rice genome sequence.This is fundamental for rice genetics and molecular biology research.Functional genomics aims to decode the functions of rice genes.Quantitative genomics is large-scale sequence-and statistics-based research to define the quantitative traits and genetic features of rice populations.Rice genomics has been a transformative influence on rice biological research and contributes significantly to rice breeding,making rice a good model plant for studying crop sciences.
文摘In order to evaluate to what extent will genomics and in silico related technologies improve overall drug discovery process, we analyzed three studies comparing cost, time and attrition rate at each step of the drug discovery process, between standard pharmaceutical and genomics based approaches.
基金supported by the National High Technology Research and Development Program of China(863 Program)(No.2013AA102502)the National Natural Science Foundation of China(Nos.31372284 and 31402046)+1 种基金the Fund of Fok Ying-Tung Education Foundation(No.141117)the Fund for Distinguished Young Scientists of Sichuan Province(No. 2013JQ0013)
文摘Genomic structural variations (SVs), particularly insertions, deletions and inversions, can contribute to the heterogeneity of millions of nucleotides within a genome, and are likely to make an important contribution to biological diversity and phenotypic variation (Alkan et al., 2011; Bickhart and Liu, 2014). With the rapid development of the next-generation sequencing technologies and the new assembly methodolo- gies, the multiple de novo assemblies of genomes within a species allow researchers to explore more detailed SV maps (Li et al., 2011). Compared with the traditional read depth algorithm using the whole-genome resequencing approach and array-based technologies (Baker, 2012; Wang et al., 2012;
基金supported by the National Natural Science Foundation of China(No.30971844)the Fundamental Research Funds of Northwest A & F University(No. QN2011003)+1 种基金China Postdoctoral Science Foundation to Wang Junwei(No.20070410835)the Tang Zhong-Ying Breeding Funding Project of Northwest A & F University
文摘Common wheat (Triticum aestivum L.) is one of the most important crops, and intra-specific wheat hybrids have obvious heterosis in yield and protein quality. Therefore, utilization of hybrid wheat varieties offers an effective way to increase yield and nutrition. Cytoplasmic male sterility (CMS) systems are a useful genetic tool for hybrid crop breeding, and are ideal models for studying the genetic interaction and cooperative function of mitochondrial and nuclear genomes in plants (Schnable and Wise, 1998; Hanson and Bentolila, 2004).
文摘The 14-3-3 protein family is among the most extensively studied, yet still largely mysterious protein families in mammals to date. As they are well recognized for their roles in apoptosis, cell cycle regulation, and proliferation in healthy cells, aberrant 14-3-3 expression has unsurprisingly emerged as instrumentalin the development of many cancers and in prognosis. Interestingly, while the seven known 14-3-3 isoforms in humans have many similar functions across cell types, evidence of isoform-specific functions and localization has been observed in both healthy and diseased cells The strikingly high similarity among 14-3-3 isoforms has made it difficult to delineate isoform-specific functions and for isoform-specific targeting. Here, we review our knowledge of 14-3-3 interactome(s) generated by high- throughput techniques, bioinformatics, structural genomics and chemical genornics and point out that integrating the information with molecular dynamics (MD) simulations may bring us new opportunity to the design of isoform-specific inhibitors, which can not only be used as powerful research tools for delineating distinct interactomes of individual 14-3-3 isoforms, but also can serve as potential new anti-cancer drugs that selectively target aberrant 14-3-3 isoform.
基金The Z.L.laboratory is supported by a startup grant provided by Shanghai Jiao Tong University,School of Agriculture and Biology and the Shanghai Pujiang Program(20PJ1405900).
文摘Plants produce a remarkable diversity of structurally and functionally diverse natural chemicals that serve as adaptive compounds throughout their life cycles.However,unlocking this metabolic diversity is significantly impeded by the size,complexity,and abundant repetitive elements of typical plant genomes.As genome sequencing becomes routine,we anticipate that links between metabolic diversity and genetic variation will be strengthened.In addition,an ever-increasing number of plant genomes have revealed that biosynthetic gene clusters are not only a hallmark of microbes and fungi;gene clusters for various classes of compounds have also been found in plants,and many are associated with important agronomic traits.We present recent examples of plant metabolic diversification that have been discovered through the exploration and exploitation of various genomic and pan-genomic data.We also draw attention to the fundamental genomic and pan-genomic basis of plant chemodiversity and discuss challenges and future perspectives for investigating metabolic diversity in the coming pan-genomics era.
文摘Methoprene-tolerant (Met) gene has been found to be involved in juvenile hormone (JH) action in insects. Herein, we isolated a silkworm (Bombyx mori) homolog of Met gene from Drosophila melanogaster using bio-informatics analysis and rapid amplification of cDNA ends - polymerase chain reaction method, and defined it as BmMet. The full-length cDNA ofBmMet gene consists of 1 917 nucleotides and includes a 1 368 bp of open reading frame for a deduced protein of 455 amino acids. All deduced protein sequences from Met genes in B. mori and other surveyed insects contain four typical domains ofbHLH, PAS-A, PAS-B and PAC, highlighting a high sequence conservation of Met genes during insect evolution. Also, genomic structure and phylogenic analysis suggested that Met in both B. mori and Drosophila species may originate from an ancestor gene with gce, another member of bHLH-PAS family, via gene duplication. In addition, BmMet was detected in all surveyed tissues and throughout the whole life of silkworm at transcriptional levels. Furthermore, silkworm individuals with RNAi silencing of BmMet gene in the early stage of the fourth instar larvae could molt normally and pupate successfully. This result was different from the observation in T. castaneum but similar to that in D. melanogaster after Met knockdown, revealing that the action mode of Met in B. mori and D. melanogaster should be divergent with that in other insect species.
基金This work is supported by the National Basic Research Program of China (Nos. 2016YFA0100703 and 2015CB964800) and the National Natural Science Foundation of China (No. 31271354).
文摘Background: In eukaryotic genome, chromatin is not randomly distributed in cell nuclei, but instead is organized into higher-order structures. Emerging evidence indicates that these higher-order chromatin structures play important roles in regulating genome functions such as transcription and DNA replication. With the advancement in 3C (chromosome conformation capture) based technologies, Hi-C has been widely used to investigate genome-wide long- range chromatin interactions during cellular differentiation and oncogenesis. Since the first publication of Hi-C assay in 2009, lots of bioinformatic tools have been implemented for processing Hi-C data from mapping raw reads to normalizing contact matrix and high interpretation, either providing a whole workflow pipeline or focusing on a particular process. Results: This article reviews the general Hi-C data processing workflow and the currently popular Hi-C data processing tools. We highlight on how these tools are used for a full interpretation of Hi-C results. Conclusions: Hi-C assay is a powerful tool to investigate the higher-order chromatin structure. Continued development of novel methods for Hi-C data analysis will be necessary for better understanding the regulatory function of genome organization.
基金supported in part by the NSF of USA (Grant Nos. DBI-0354771, ITR-IIS-0407204, DBI-0542119, CCF0621700)NIHof USA (Grant Nos. 1R01GM075331 and 1R01GM081682)the grant for the BioEnergy Science Center
文摘This article addresses how the functionalities of the cellular machinery of a bacterium might have constrained the genomic arrangement of its genes during evolution and how we can study such problems using computational approaches, taking full advantage of the rapidly increasing pool of the sequenced bacterial genomes, potentially leading to a much improved understanding of why a bacterial genome is organized in the way it is. This article discusses a number of challenging computational problems in elucidating the genomic structures at multiple levels and the information that is encoded through these genomic structures, gearing towards the ultimate understanding of the governing rules of bacterial genome organization.