Mycoplasma ovipneumoniae, a kind of mycoplasma bacteria, commonly infects the respiratory tract causing respiratory disease in sheep and goats worldwide. Here, the complete genome sequence of M. ovipneumoniae strain N...Mycoplasma ovipneumoniae, a kind of mycoplasma bacteria, commonly infects the respiratory tract causing respiratory disease in sheep and goats worldwide. Here, the complete genome sequence of M. ovipneumoniae strain NM2010 isolated from a sheep in China was reported for the ifrst time.展开更多
Cobetia marina is a model proteobacteria in researches on marine biofouling. Its taxonomic nomenclature has been revised many times over the past few decades. ~To better understand the role of the surface-associated l...Cobetia marina is a model proteobacteria in researches on marine biofouling. Its taxonomic nomenclature has been revised many times over the past few decades. ~To better understand the role of the surface-associated lifestyle of C. marina and the phylogeny of the family Halomonadaceae, we sequenced the entire genome of C. marina JCM 21022 ~T using single molecule real-time sequencing technology(SMR^T) and performed comparative genomics and phylogenomics analyses. ~The circular chromosome was 4 176 300 bp with an average GC content of 62.44% and contained 3 611 predicted coding sequences, 72 t RNA genes, and 21 r RNA genes. ~The C. marina JCM 21022 ~T genome contained a set of crucial genes involved in surface colonization processes. ~The comparative genome analysis indicated the significant diff erences between C. marina JCM 21022 ~T and Cobetia amphilecti KMM 296(formerly named C. marina KMM 296) resulted from sequence insertions or deletions and chromosomal recombination. Despite these diff erences, pan and core genome analysis showed similar gene functions between the two strains. ~The phylogenomic study of the family Halomonadaceae is reported here for the first time. We found that the relationships were well resolved among every genera tested, including Chromohalobacter, Halomonas, Cobetia, Kushneria, Zymobacter, and Halotalea.展开更多
It was rarely reported about strawberry vein banding virus(SVBV)genome sequence in China and most countries worldwide.In this work,we determined the complete genome sequences of two SVBV isolates in China,designated S...It was rarely reported about strawberry vein banding virus(SVBV)genome sequence in China and most countries worldwide.In this work,we determined the complete genome sequences of two SVBV isolates in China,designated SVBV-AH and SVBV-BJ,that were obtained from naturally infected strawberry samples from Anhui province and Beijing city of China,respectively.The complete genomes of SVBV-AH and SVBV-BJ were 7,862 nucleotides(nts)and 7,863 nts long,respectively,and both constituted with seven genes typical of the caulimoviruses.Alignment of complete nucleotide sequences showed that SVBV-AH and SVBV-BJ shared a significant nucleotide sequence identity of 97.7%of each other and had 85.7%and 86.0%sequence identity related to SVBV from the United States(SVBV-US),respectively.Phylogenetic trees,based on the alignment of complete nucleotide sequences and amino acid sequences of Coat Protein(CP),both showed that SVBV-AH and SVBV-BJ clustered into one branch with all the other SVBV isolates,and other species of caulimoviruses clustered into another tree branch.It illustrated that all the SVBV isolates had an extremely high relationship but had a distant relationship with other species of caulimoviruses.We further confirmed that SVBV-AH infectious clone could cause similar symptoms to SVBVinfected in strawberry under natural conditions.Taken together,our study provided valuable information to elucidate the origin and dissemination of SVBV Chinese isolates,meanwhile providing the necessary vector for studying the gene functions of strawberry.展开更多
Aerobic anoxygenic phototrophic bacteria(AAPB)are photoheterotrophic prokaryotes able to use both light and dissolved organic matter as energy sources.Roseicitreum antarcticum ZS2-28^(T)was isolated from intertidal se...Aerobic anoxygenic phototrophic bacteria(AAPB)are photoheterotrophic prokaryotes able to use both light and dissolved organic matter as energy sources.Roseicitreum antarcticum ZS2-28^(T)was isolated from intertidal sediment in the Larsemann Hills,Princess Elizabeth Land,Antarctica,and was able to produce bacteriochlorophyll a.It is the type strain of the sole species within the genus Roseicitreum.The complete genome sequence of the bacterium was determined using Illumina HiSeq X and PacBio RSII systems.The genome of R.antarcticum ZS2-28^(T)was 4253095 bp and consisted of one chromosome and four plasmids.A number of genes related to the bacteriochlorophyll a production,photosynthetic reaction,cold adaptation,salt adaptation,ultra-violet resistance and DNA damage repairing were found in the genome.In addition to genomic islands and typeⅣsecretion systems,genes related to gene transfer agents were detected in the genome of R.antarcticum ZS2-28^(T),suggesting that this bacterium can adapt to its environment by acquiring exogenous nucleic acids.The annotated complete genome sequence provides genetic insights into the environmental adaptation and ecological function of R.antarcticum ZS2-28^(T)in Antarctic coastal area.展开更多
[ Objective] This study aimed to investigate the molecular characteristics of porcine circovirus type 2 (PCV2) strains isolated from Hainan Province. [ Method] The complete genome of PCV2 was amplified from PMWS-sus...[ Objective] This study aimed to investigate the molecular characteristics of porcine circovirus type 2 (PCV2) strains isolated from Hainan Province. [ Method] The complete genome of PCV2 was amplified from PMWS-suspected samples by PCR for sequence analysis. [ Result] A total of eight PCV2 strains were isolated and identified. All the eight isolates belonged to genotype PCV2b, among which seven isolates belonged to subgenotype PCV2b-1 C, and one isolate be- longed to subgenotype PCV2b-IA/1B. ORF2 gene of PCV2 isolates from Hainan Province was 705 bp in length, encoding 234 amino acids. Antigenic epitopes of Cap protein exhibited certain changes. Nucleotide sequences and deduced amino acid sequences of ORF2 gene shared 95.3% -99.7% and 93.6% - 100% simi- larities among eight PCV2 isolates from Hainan Province, respectively. Moreover, nucleotide sequences and deduced amino acid sequences of ORF2 gene of PCV2 isolates from Hainan Province shared 91.0% -99.9% and 91.0% -99.6% similarities with other PCV2 strain isolated from China (AY682994, AF381175, JX945577, JX682407, AY180397 ), respectively; nucleotide sequences and deduced amino acid sequences of ORF2 gene of PCV2 isolates from Hainan Province shared 90.0% - 97.0% and 88.0% -97.9% similarities with PCV2 isolates from other countries ( NC_005148, JQ994268, KJ187306, AF201307, AF454546, AY"/Sd020), respectively; nucleotide sequences and deduced amino acid sequences of ORF2 gene of PCV2 isolates from Hainan Province shared 98.2% -100% and 94.9% -100% similarities with vaccine strain SH, respectively; nucleotide sequences and deduced amino acid sequences of ORF2 gene of PCV2 isolates from Hainan Province shared 90.6% -91.7% and 89.7% - 91.0% similarities with vaccine strain LG, respectively. [ Conclusion] This study provided theoretical hasis for the prevention and control of PCV2 and selection of vaccine strains in Hainan Province.展开更多
Dear Editor,Pathogenic Escherichia coli cause chicken colibacillosis,which is economically devastating to the poultry industry worldwide(Bagheri et al.,2014).Owing to increasing antibiotic resistance,phage therapy rea...Dear Editor,Pathogenic Escherichia coli cause chicken colibacillosis,which is economically devastating to the poultry industry worldwide(Bagheri et al.,2014).Owing to increasing antibiotic resistance,phage therapy reagents have been developed to treat bacterial infections(Xu et al.,2015).Coliphage N4 is the first reported phage in the'N4-like viruses'genus and the only member recognized展开更多
Enterococci bacteria are important in environmental, food and clinical microbiology. Enterococcus faecium is a nosocomial pathogen that causes bacteremia, endocarditis and other infections. It is among the most preval...Enterococci bacteria are important in environmental, food and clinical microbiology. Enterococcus faecium is a nosocomial pathogen that causes bacteremia, endocarditis and other infections. It is among the most prevalent organisms encountered in hospital-associated infections accounting for approximately 12% of nosocomial infections in the USA (Linden and Miller, 1999). However, certain strains of E. faecium are not only non-pathogenic but also have beneficial effects on human health with probiotic potential. For example, E. faecium T-110 is a consortium member in several probiotic products including BIO-THREE~ which is widely prescribed for human, animal and aqua-cultural use. This strain was originally developed by TOA Pharmaceuticals in Japan, and later used in the probiotic products of several other companies.展开更多
DEAR EDITOR,In this study, we sequenced the complete mitochondrial genome (mitogenome) of the Thai Red Junglefowl (RJF; Gallus gallus) using the next-generation sequencing (NGS) platform of the Ion Torrent PGM. ...DEAR EDITOR,In this study, we sequenced the complete mitochondrial genome (mitogenome) of the Thai Red Junglefowl (RJF; Gallus gallus) using the next-generation sequencing (NGS) platform of the Ion Torrent PGM. Samples were taken from Mae Wang District, Chiang Mai Province, northern Thailand Our data showed the complete mitogenome to be 16 785 bp in length, composed by 13 protein-coding genes, 22 tRNA genes, two rRNA genes, and one control region. The genome nucleotide composition was 30.3% A, 23.7% T, 32.5% C, and 13.5% G, resulting in a high percentage of A+T (50.4%). Phylogenetic analysis revealed that the mitogenome belonged to haplogroup X, whereas those of all domestic chickens belong to haplogroups A to G. This newly released mitogenome sequence will advance further evolutionary and population genetics study of the RJF and domestic chicken The availability of the G. gallus mitogenome will also contribute to further conservation genetics research of a unique species, listed as 'data deficient' in Thailand.展开更多
Hog deer(Axis porcinus)is a small mammal and listed in the International Union for Conservation of Nature.However,phylogenetic position of hog deer within Axis genus has remained controversial.In the present study,we ...Hog deer(Axis porcinus)is a small mammal and listed in the International Union for Conservation of Nature.However,phylogenetic position of hog deer within Axis genus has remained controversial.In the present study,we first assembled complete mitochondrial genome of Chinese hog deer reared in Chengdu Zoo,Sichuan,by the second-generation sequencing technology.This newly assembled mitochondrial genome of hog deer is 16376 bp in length and consists of 13 protein-encoding genes,23 transfer RNA genes and 2 ribosomal RNA genes.Phylogenetic analyses based on complete mitochondrial genome and cytochrome b gene sequences revealed that hog deer is closely clustered together and placed with sister taxon of spotted deer(A.Axis),which therefore supported monophyletic statue of Axis genus.Furthermore,considerable genetic differentiation,up to 139 mutations of complete mitochondrial genome was revealed between geographical populations of hog deer in France and Southeast Asia.However,only six variable sites(nucleotide diversity of 0.00007)and four haplotypes(haplotype diversity of 0.533)were totally detected among ten newly sequenced Chinese hog deer.The results provide a better understanding on the phylogeny of hog deer.展开更多
Avian infectious bronchitis(IB)is a highly contagious infectious disease caused by infectious bronchitis virus(IBV),which is prevalent in many countries worldwide and causes serious harm to the poultry industry.At pre...Avian infectious bronchitis(IB)is a highly contagious infectious disease caused by infectious bronchitis virus(IBV),which is prevalent in many countries worldwide and causes serious harm to the poultry industry.At present,many commercial IBV vaccines have been used for the prevention and control of IB;however,IB outbreaks occur frequently.In this study,two new strains of IBV,SX/2106 and SX/2204,were isolated from two flocks which were immunized with IBV H120 vaccine in central China.Phylogenetic and recombination analysis indicated that SX/2106,which was clustered into the GI-19 lineage,may be derived from recombination events of the GI-19 and GI-7 strains and the LDT3-A vaccine.Genetic analysis showed that SX/2204 belongs to the GVI-1 lineage,which may have originated from the recombination of the GI-13 and GVI-1 strains and the H120 vaccine.The virus cross-neutralization test showed that the antigenicity of SX/2106 and SX/2204 was different from H120.Animal experiments found that both SX/2106 and SX/2204 could replicate effectively in the lungs and kidneys of chickens and cause disease and death,and H120 immunization could not provide effective protection against the two IBV isolates.It is noteworthy that the pathogenicity of SX/2204 has significantly increased compared to the GVI-1 strains isolated previously,with a mortality rate up to 60%.Considering the continuous mutation and recombination of the IBV genome to produce new variant strains,it is important to continuously monitor epidemic strains and develop new vaccines for the prevention and control of IBV epidemics.展开更多
We reported here the complete genome of the Bombyx mori infectious flacherie virus (BmlFV), BmlFV-CHN01, isolated in China and compared it with the Japanese strain IFV (BmIFV-JAN) sequence. BmIFV-JAN and BmIFV-CHN...We reported here the complete genome of the Bombyx mori infectious flacherie virus (BmlFV), BmlFV-CHN01, isolated in China and compared it with the Japanese strain IFV (BmIFV-JAN) sequence. BmIFV-JAN and BmIFV-CHN01 were 99% identical in nucleotide and amino acid sequences. The amino acid sequences of the three structural proteins (VP1, VP3 and VP4) and L protein were 100% identical between the two strains. Phylogenetic analyses based on conserved domains in RNA-dependent RNA polymerases (RdRps) by the neighbor-joining method showed that these two strains were from the same virus. The gain of the whole genome of the BmIFV isolated in China and its weak mutation character established a great foundation to the study of functional genome and the molecular epidemiology of BmIFV.展开更多
Fabaceae is a large family of angiosperms with high biodiversity that contains a variety of economically important crops and model plants for the study of biological nitrogen fixation.Polyploidization events have been...Fabaceae is a large family of angiosperms with high biodiversity that contains a variety of economically important crops and model plants for the study of biological nitrogen fixation.Polyploidization events have been extensively studied in some Fabaceae plants,but the occurrence of new genes is still concealed,owing to a lack of genomic information on certain species of the basal clade of Fabaceae.Cercis chinensis(Cercidoideae)is one such species;it diverged earliest from Fabaceae and is essential for phylogenomic studies and new gene predictions in Fabaceae.To facilitate genomic studies on Fabaceae,we performed genome sequencing of C.chinensis and obtained a 352.84 Mb genome,which was further assembled into seven pseudochromosomes with 30612 predicted protein-coding genes.Compared with other legume genomes,that of C.chinensis exhibits no lineage-specific polyploidization event.Further phylogenomic analyses of 22 legumes and 11 other angiosperms revealed that many gene families are lineage specific before and after the diversification of Fabaceae.Among them,dozens of genes are candidates for new genes that have evolved from intergenic regions and are thus regarded as de novo-originated genes.They differ significantly from established genes in coding sequence length,exon number,guanine–cytosine content,and expression patterns among tissues.Functional analysis revealed that many new genes are related to asparagine metabolism.This study represents an important advance in understanding the evolutionary pattern of new genes in legumes and provides a valuable resource for plant phylogenomic studies.展开更多
Strawberry mottle virus(SMoV)is one of the most common viruses infecting strawberries,causing losses to fruit yield and quality.In this study,165 strawberry leaf samples were collected from six provinces of China,46 o...Strawberry mottle virus(SMoV)is one of the most common viruses infecting strawberries,causing losses to fruit yield and quality.In this study,165 strawberry leaf samples were collected from six provinces of China,46 of which tested positive for SMoV.The complete genome sequences of 11 SMoV isolates were obtained from Liaoning(DGHY3,DGHY16-2,DGHY17,DGHY20-2,DGHY21,DGHY26-2),Shandong(SDHY1,SDHY5,SDHY31-2,SDHY33-2),and Beijing(BJMX7).The RNA1 and RNA2 nucleotide identities between the 11 Chinese isolates were 95.4-99.3%and 96.3-99.6%,respectively,and they shared 78.4-96.6%and 84.8-93.5%identities with the available SMoV isolates in GenBank.Recombination analysis revealed that Chinese isolate SDHY33-2 and Canadian isolates Ontario and Simcoe were recombinants,and recombination events frequently occurred in the 3’UTR of SMoV.Phylogenetic analysis showed that in an RNA1 tree,most Chinese isolates clustered into the same group while isolate DGHY17 clustered into another group together with Czech isolate C and three Canadian isolates.In an RNA2 tree,all Chinese isolates clustered into a single group.The phylogenetic analysis based on nucleotide sequences was consistent with the results based on coat protein(CP)and RNA-dependent RNA polymerase(RdRp).Further evolutionary analysis indicated that negative selection drives SMoV evolution,and gene flow plays a major role in genetic differentiation.Additionally,reassortment and recombination also influence the evolution of SMoV.To our knowledge,this is the first report of the complete genome of SMoV isolates from China and a detailed analysis of the SMoV population structure.展开更多
To determine the genomic sequence of a duck hepatitis virus type 1 (DHV-1) strain, real-time quantitative polymerase chain reaction (RTQ-PCR) assay based on SYBR Green I technology was developed to target 3D gene ...To determine the genomic sequence of a duck hepatitis virus type 1 (DHV-1) strain, real-time quantitative polymerase chain reaction (RTQ-PCR) assay based on SYBR Green I technology was developed to target 3D gene of DHV-1, Comparative sequence analysis showed that the genome has a typical picornarivus genetic organization, and strain DHV-1 R genetic organaiztion is 5' untranslated region (UTR)-VP0-VP3-VPI-2A1-2A2-2B-2C-3A-3B-3C-3D-3' UTR, DHV-1 R has close relationship with Parechovirus, and has 95.1-99.1% nucleotide sequence identity with other DHV-1 strains. Based on the DHV-1 sequences in GenBank, three pairs of specific primers were designed to amplify DHV-1 using real-time PCR. The results showed that real-time PCR Tm value is 85.6℃ and the real-time PCR provides a broad dynamic range, detecting from 102 to 109 copies of DHV-1 cDNA per reaction. No cross-reactions were found in specimens containing DPV, AIV and NDV. It is concluded that DHV-1 belongs to a new group of the family Picornaviridae that may form a separate genus most closely related to the genus Parechovirus. All results showed that the real-time PCR has high sensitivity and specificity to detect DHV-1 using SYBR Green I dissociation curve analysis, isolates can be distinguished by their melting temperature. These methods are rapid, sensitive, and reliable, and can be readily adapted for detection of DHV-1 from other clinical samples.展开更多
The codon usage bias and the base compositions in the available 49 complete fish rhabdovirus genome sequences were investigated. The high correlation between GC12% and GC3% suggested that mutational pressure rather th...The codon usage bias and the base compositions in the available 49 complete fish rhabdovirus genome sequences were investigated. The high correlation between GC12% and GC3% suggested that mutational pressure rather than natural selection was the main factor determining the codon usage and base component in fish rhabdovirns, most of the spots lying below the expected curve ( ENC vs GC3% ) also suggested that codon usage bias in these 49 genomes was influenced by mutational pressure in codon usage pattern. Through PCA analysis based on RSCU, all ORFs in the 49 fish rhabdoviurs genomes were clustered into two parts, and this result showed that hosts played a role in codon usage pattern of fish rhabdovirns genomes. Further, comparison between SVCV, INHV, VHSV and their hosts suggested the effect of natural host on the codon usage pattern took part in the evolution process of fish rhabdovirns. This study represents the most comprehensive analysis to date for fish rhabdovirus eodon usage patterns.展开更多
Pyrrosia petiolosa,Pyrrosia lingua and Pyrrosia sheareri are recorded as original plants of Pyrrosiae Folium(PF)and commonly used as Chinese herbal medicines.Due to the similar morphological features of PF and its adu...Pyrrosia petiolosa,Pyrrosia lingua and Pyrrosia sheareri are recorded as original plants of Pyrrosiae Folium(PF)and commonly used as Chinese herbal medicines.Due to the similar morphological features of PF and its adulterants,common DNA barcodes cannot accurately distinguish PF species.Knowledge of the chloroplast(cp)genome is widely used in species identification,molecular marker and phylogenetic analyses.Herein,we determined the complete cp genomes of three original species of PF via highthroughput sequencing technologies.The three cp genomes exhibited a typical quadripartite structure with sizes ranging from 158165 to 163026 bp.The cp genomes of P.petiolosa and P.lingua encoded 130 genes,whilst that of P.sheareri encoded 131 genes.The complete cp genomes were compared,and five highly divergent regions of pet A-psb J,mat K-rps16,ndh C-trn M,psb M-pet N and psa Cndh E were screened as potential DNA barcodes for identification of Pyrrosia genus species.The phylogenetic tree we obtained indicated that P.petiolosa and P.lingua are clustered in a single clade and,thus,share a close relationship.This study provides invaluable information for further studies on the species identification,taxonomy and phylogeny of Pyrrosia genus species.展开更多
Severe acute respiratory syndrome coronavirus 2(SARS-CoV-2),the pathogen responsible for coronavirus disease 2019(COVID-19),continues to evolve,giving rise to more variants and global reinfections.Previous research ha...Severe acute respiratory syndrome coronavirus 2(SARS-CoV-2),the pathogen responsible for coronavirus disease 2019(COVID-19),continues to evolve,giving rise to more variants and global reinfections.Previous research has demonstrated that barcode segments can effectively and cost-efficiently identify specific species within closely related populations.In this study,we designed and tested RNA barcode segments based on genetic evolutionary relationships to facilitate the efficient and accurate identification of SARS-CoV-2 from extensive virus samples,including human coronaviruses(HCoVs)and SARSr-CoV-2 lineages.Nucleotide sequences sourced from NCBI and GISAID were meticulously selected and curated to construct training sets,encompassing 1733 complete genome sequences of HCoVs and SARSr-CoV-2 lineages.Through genetic-level species testing,we validated the accuracy and reliability of the barcode segments for identifying SARS-CoV-2.Subsequently,75 main and subordinate species-specific barcode segments for SARS-CoV-2,located in ORF1ab,S,E,ORF7a,and N coding sequences,were intercepted and screened based on single-nucleotide polymorphism sites and weighted scores.Post-testing,these segments exhibited high recall rates(nearly 100%),specificity(almost 30%at the nucleotide level),and precision(100%)performance on identification.They were eventually visualized using one and two-dimensional combined barcodes and deposited in an online database(http://virusbarcodedatabase.top/).The successful integration of barcoding technology in SARS-CoV-2 identification provides valuable insights for future studies involving complete genome sequence polymorphism analysis.Moreover,this cost-effective and efficient identification approach also provides valuable reference for future research endeavors related to virus surveillance.展开更多
of complete genome sequences submitted directly from sequencing projects are diverse in terms of annotation strategies and update frequencies. These inconsistencies make comparative studies difficult. To allow rapid d...of complete genome sequences submitted directly from sequencing projects are diverse in terms of annotation strategies and update frequencies. These inconsistencies make comparative studies difficult. To allow rapid data preparation of a large number of complete genomes, automation and speed are important for genome re-annotation. Here we introduce an open-source rapid genome re-annotation software system, Restauro-G, specialized for bacterial genomes. Restauro-G re-annotates a genome by similarity searches utilizing the BLASTLike Alignment Tool, referring to protein databases such as UniProt KB, NCBI nr, NCBI COGs, Pfam, and PSORTb. Re-annotation by Restauro-G achieved over 98% accuracy for most bacterial chromosomes in comparison with the original manually curated annotation of EMBL releases. Restauro-G was developed in the generic bioinformatics workbench G-language Genome Analysis Environment and is distributed at http://restauro-g.iab.keio.ac.jp/ under the GNU General Public License.展开更多
The complete genome of methanol-utilizing Amycolatopsis methanolica strain 239T was generated,revealing a single 7,237,391 nucleotide circular chromosome with 7074 annotated protein-coding sequences(CDSs).Comparative ...The complete genome of methanol-utilizing Amycolatopsis methanolica strain 239T was generated,revealing a single 7,237,391 nucleotide circular chromosome with 7074 annotated protein-coding sequences(CDSs).Comparative analyses against the complete genome sequences of Amycolatopsis japonica strain MG417-CF17T,Amycolatopsis mediterranei strain U32 and Amycolatopsis orientalis strain HCCB10007 revealed a broad spectrum of genomic structures,including various genome sizes,core/quasi-core/non-core configurations and different kinds of episomes.Although polyketide synthase gene clusters were absent from the A.methanolica genome,12 gene clusters related to the biosynthesis of other specialized(secondary)metabolites were identified.Complete pathways attributable to the facultative methylotrophic physiology of A.methanolica strain 239T,including both the mdo/mscR encoded methanol oxidation and the hps/hpi encoded formaldehyde assimilation via the ribulose monophosphate cycle,were identified together with evidence that the latter might be the result of horizontal gene transfer.Phylogenetic analyses based on 16S rDNA or orthologues of AMETH_3452,a novel actinobacterial class-specific conserved gene against 62 or 18 Amycolatopsis type strains,respectively,revealed three major phyletic lineages,namely the mesophilic or moderately thermophilic A.orientalis subclade(AOS),the mesophilic Amycolatopsis taiwanensis subclade(ATS)and the thermophilic A.methanolica subclade(AMS).The distinct growth temperatures of members of the subclades correlated with corresponding genetic variations in their encoded compatible solutes.This study shows the value of integrating conventional taxonomic with whole genome sequence data.展开更多
基金supported by the Nationai Key Technology R&D Program of China (2011BAD18B01)
文摘Mycoplasma ovipneumoniae, a kind of mycoplasma bacteria, commonly infects the respiratory tract causing respiratory disease in sheep and goats worldwide. Here, the complete genome sequence of M. ovipneumoniae strain NM2010 isolated from a sheep in China was reported for the ifrst time.
基金Supported by the National Natural Science Foundation of China(Nos.41006082,31372517)
文摘Cobetia marina is a model proteobacteria in researches on marine biofouling. Its taxonomic nomenclature has been revised many times over the past few decades. ~To better understand the role of the surface-associated lifestyle of C. marina and the phylogeny of the family Halomonadaceae, we sequenced the entire genome of C. marina JCM 21022 ~T using single molecule real-time sequencing technology(SMR^T) and performed comparative genomics and phylogenomics analyses. ~The circular chromosome was 4 176 300 bp with an average GC content of 62.44% and contained 3 611 predicted coding sequences, 72 t RNA genes, and 21 r RNA genes. ~The C. marina JCM 21022 ~T genome contained a set of crucial genes involved in surface colonization processes. ~The comparative genome analysis indicated the significant diff erences between C. marina JCM 21022 ~T and Cobetia amphilecti KMM 296(formerly named C. marina KMM 296) resulted from sequence insertions or deletions and chromosomal recombination. Despite these diff erences, pan and core genome analysis showed similar gene functions between the two strains. ~The phylogenomic study of the family Halomonadaceae is reported here for the first time. We found that the relationships were well resolved among every genera tested, including Chromohalobacter, Halomonas, Cobetia, Kushneria, Zymobacter, and Halotalea.
基金the National Natural Science Foundation of China(Nos.32072386 and 31801700)the Key Research and Development Project of Anhui Province(202004a06020013)the Anhui Postdoctoral Fund(2019B360).
文摘It was rarely reported about strawberry vein banding virus(SVBV)genome sequence in China and most countries worldwide.In this work,we determined the complete genome sequences of two SVBV isolates in China,designated SVBV-AH and SVBV-BJ,that were obtained from naturally infected strawberry samples from Anhui province and Beijing city of China,respectively.The complete genomes of SVBV-AH and SVBV-BJ were 7,862 nucleotides(nts)and 7,863 nts long,respectively,and both constituted with seven genes typical of the caulimoviruses.Alignment of complete nucleotide sequences showed that SVBV-AH and SVBV-BJ shared a significant nucleotide sequence identity of 97.7%of each other and had 85.7%and 86.0%sequence identity related to SVBV from the United States(SVBV-US),respectively.Phylogenetic trees,based on the alignment of complete nucleotide sequences and amino acid sequences of Coat Protein(CP),both showed that SVBV-AH and SVBV-BJ clustered into one branch with all the other SVBV isolates,and other species of caulimoviruses clustered into another tree branch.It illustrated that all the SVBV isolates had an extremely high relationship but had a distant relationship with other species of caulimoviruses.We further confirmed that SVBV-AH infectious clone could cause similar symptoms to SVBVinfected in strawberry under natural conditions.Taken together,our study provided valuable information to elucidate the origin and dissemination of SVBV Chinese isolates,meanwhile providing the necessary vector for studying the gene functions of strawberry.
基金the National Key R&D Program of China(Grant no.2018YFC1406903)the National Natural Science Foundation of China(Grant nos.91851201 and 41476171).
文摘Aerobic anoxygenic phototrophic bacteria(AAPB)are photoheterotrophic prokaryotes able to use both light and dissolved organic matter as energy sources.Roseicitreum antarcticum ZS2-28^(T)was isolated from intertidal sediment in the Larsemann Hills,Princess Elizabeth Land,Antarctica,and was able to produce bacteriochlorophyll a.It is the type strain of the sole species within the genus Roseicitreum.The complete genome sequence of the bacterium was determined using Illumina HiSeq X and PacBio RSII systems.The genome of R.antarcticum ZS2-28^(T)was 4253095 bp and consisted of one chromosome and four plasmids.A number of genes related to the bacteriochlorophyll a production,photosynthetic reaction,cold adaptation,salt adaptation,ultra-violet resistance and DNA damage repairing were found in the genome.In addition to genomic islands and typeⅣsecretion systems,genes related to gene transfer agents were detected in the genome of R.antarcticum ZS2-28^(T),suggesting that this bacterium can adapt to its environment by acquiring exogenous nucleic acids.The annotated complete genome sequence provides genetic insights into the environmental adaptation and ecological function of R.antarcticum ZS2-28^(T)in Antarctic coastal area.
基金Supported by Natural Science Foundation of Hainan Province(Grant No.313100)
文摘[ Objective] This study aimed to investigate the molecular characteristics of porcine circovirus type 2 (PCV2) strains isolated from Hainan Province. [ Method] The complete genome of PCV2 was amplified from PMWS-suspected samples by PCR for sequence analysis. [ Result] A total of eight PCV2 strains were isolated and identified. All the eight isolates belonged to genotype PCV2b, among which seven isolates belonged to subgenotype PCV2b-1 C, and one isolate be- longed to subgenotype PCV2b-IA/1B. ORF2 gene of PCV2 isolates from Hainan Province was 705 bp in length, encoding 234 amino acids. Antigenic epitopes of Cap protein exhibited certain changes. Nucleotide sequences and deduced amino acid sequences of ORF2 gene shared 95.3% -99.7% and 93.6% - 100% simi- larities among eight PCV2 isolates from Hainan Province, respectively. Moreover, nucleotide sequences and deduced amino acid sequences of ORF2 gene of PCV2 isolates from Hainan Province shared 91.0% -99.9% and 91.0% -99.6% similarities with other PCV2 strain isolated from China (AY682994, AF381175, JX945577, JX682407, AY180397 ), respectively; nucleotide sequences and deduced amino acid sequences of ORF2 gene of PCV2 isolates from Hainan Province shared 90.0% - 97.0% and 88.0% -97.9% similarities with PCV2 isolates from other countries ( NC_005148, JQ994268, KJ187306, AF201307, AF454546, AY"/Sd020), respectively; nucleotide sequences and deduced amino acid sequences of ORF2 gene of PCV2 isolates from Hainan Province shared 98.2% -100% and 94.9% -100% similarities with vaccine strain SH, respectively; nucleotide sequences and deduced amino acid sequences of ORF2 gene of PCV2 isolates from Hainan Province shared 90.6% -91.7% and 89.7% - 91.0% similarities with vaccine strain LG, respectively. [ Conclusion] This study provided theoretical hasis for the prevention and control of PCV2 and selection of vaccine strains in Hainan Province.
基金supported by grants from the Nature Science Foundation of Shandong Province of China (grant nos.ZR2013CQ024 and ZR2015CM020)
文摘Dear Editor,Pathogenic Escherichia coli cause chicken colibacillosis,which is economically devastating to the poultry industry worldwide(Bagheri et al.,2014).Owing to increasing antibiotic resistance,phage therapy reagents have been developed to treat bacterial infections(Xu et al.,2015).Coliphage N4 is the first reported phage in the'N4-like viruses'genus and the only member recognized
文摘Enterococci bacteria are important in environmental, food and clinical microbiology. Enterococcus faecium is a nosocomial pathogen that causes bacteremia, endocarditis and other infections. It is among the most prevalent organisms encountered in hospital-associated infections accounting for approximately 12% of nosocomial infections in the USA (Linden and Miller, 1999). However, certain strains of E. faecium are not only non-pathogenic but also have beneficial effects on human health with probiotic potential. For example, E. faecium T-110 is a consortium member in several probiotic products including BIO-THREE~ which is widely prescribed for human, animal and aqua-cultural use. This strain was originally developed by TOA Pharmaceuticals in Japan, and later used in the probiotic products of several other companies.
基金supported by the National Key Research and Development Program(2016YFC1200705)Southeast Asia Biodiversity Research Institute,Chinese Academy of Sciences(Y4ZK111B01:2015CASEABR1002)+2 种基金Ministry of Science and Technology of China(2012FY110800)Chinese Academy of Sciences(KFZD-SW-208)Yunnan Provincial Science and Technology Department
文摘DEAR EDITOR,In this study, we sequenced the complete mitochondrial genome (mitogenome) of the Thai Red Junglefowl (RJF; Gallus gallus) using the next-generation sequencing (NGS) platform of the Ion Torrent PGM. Samples were taken from Mae Wang District, Chiang Mai Province, northern Thailand Our data showed the complete mitogenome to be 16 785 bp in length, composed by 13 protein-coding genes, 22 tRNA genes, two rRNA genes, and one control region. The genome nucleotide composition was 30.3% A, 23.7% T, 32.5% C, and 13.5% G, resulting in a high percentage of A+T (50.4%). Phylogenetic analysis revealed that the mitogenome belonged to haplogroup X, whereas those of all domestic chickens belong to haplogroups A to G. This newly released mitogenome sequence will advance further evolutionary and population genetics study of the RJF and domestic chicken The availability of the G. gallus mitogenome will also contribute to further conservation genetics research of a unique species, listed as 'data deficient' in Thailand.
基金The Chengdu Giant Panda Breeding Research(CPF2017-07)。
文摘Hog deer(Axis porcinus)is a small mammal and listed in the International Union for Conservation of Nature.However,phylogenetic position of hog deer within Axis genus has remained controversial.In the present study,we first assembled complete mitochondrial genome of Chinese hog deer reared in Chengdu Zoo,Sichuan,by the second-generation sequencing technology.This newly assembled mitochondrial genome of hog deer is 16376 bp in length and consists of 13 protein-encoding genes,23 transfer RNA genes and 2 ribosomal RNA genes.Phylogenetic analyses based on complete mitochondrial genome and cytochrome b gene sequences revealed that hog deer is closely clustered together and placed with sister taxon of spotted deer(A.Axis),which therefore supported monophyletic statue of Axis genus.Furthermore,considerable genetic differentiation,up to 139 mutations of complete mitochondrial genome was revealed between geographical populations of hog deer in France and Southeast Asia.However,only six variable sites(nucleotide diversity of 0.00007)and four haplotypes(haplotype diversity of 0.533)were totally detected among ten newly sequenced Chinese hog deer.The results provide a better understanding on the phylogeny of hog deer.
基金supported by the National Natural Science Foundation of China(32202788)the Special Research Fund of Shanxi Agricultural University for High-level Talents,China(2021XG004)+3 种基金the Fund for Shanxi“1331 Project”,China(20211331-13)the Shanxi Province Excellent Doctoral Work Award-Scientific Research Project,China(SXBYKY2021063,SXBYKY2021005,and SXBYKY 2022014)the earmarked fund for Modern Agro-industry Technology Research System of Shanxi Province,China(2023CYJSTX15-13)the Fundamental Research Program of Shanxi Province,China(202103021224156)。
文摘Avian infectious bronchitis(IB)is a highly contagious infectious disease caused by infectious bronchitis virus(IBV),which is prevalent in many countries worldwide and causes serious harm to the poultry industry.At present,many commercial IBV vaccines have been used for the prevention and control of IB;however,IB outbreaks occur frequently.In this study,two new strains of IBV,SX/2106 and SX/2204,were isolated from two flocks which were immunized with IBV H120 vaccine in central China.Phylogenetic and recombination analysis indicated that SX/2106,which was clustered into the GI-19 lineage,may be derived from recombination events of the GI-19 and GI-7 strains and the LDT3-A vaccine.Genetic analysis showed that SX/2204 belongs to the GVI-1 lineage,which may have originated from the recombination of the GI-13 and GVI-1 strains and the H120 vaccine.The virus cross-neutralization test showed that the antigenicity of SX/2106 and SX/2204 was different from H120.Animal experiments found that both SX/2106 and SX/2204 could replicate effectively in the lungs and kidneys of chickens and cause disease and death,and H120 immunization could not provide effective protection against the two IBV isolates.It is noteworthy that the pathogenicity of SX/2204 has significantly increased compared to the GVI-1 strains isolated previously,with a mortality rate up to 60%.Considering the continuous mutation and recombination of the IBV genome to produce new variant strains,it is important to continuously monitor epidemic strains and develop new vaccines for the prevention and control of IBV epidemics.
基金supported by the National 863 Program of China (2006AA10A119)the Key Project of Science and Technology Commission of Zhejiang Province,China (2003C22013)
文摘We reported here the complete genome of the Bombyx mori infectious flacherie virus (BmlFV), BmlFV-CHN01, isolated in China and compared it with the Japanese strain IFV (BmIFV-JAN) sequence. BmIFV-JAN and BmIFV-CHN01 were 99% identical in nucleotide and amino acid sequences. The amino acid sequences of the three structural proteins (VP1, VP3 and VP4) and L protein were 100% identical between the two strains. Phylogenetic analyses based on conserved domains in RNA-dependent RNA polymerases (RdRps) by the neighbor-joining method showed that these two strains were from the same virus. The gain of the whole genome of the BmIFV isolated in China and its weak mutation character established a great foundation to the study of functional genome and the molecular epidemiology of BmIFV.
基金supported by grants from the National Natural Science Foundation of China(grant number 32070247 to J.Q.)funds from the State Key Laboratory of Genetic Engineering at Fudan University,China.
文摘Fabaceae is a large family of angiosperms with high biodiversity that contains a variety of economically important crops and model plants for the study of biological nitrogen fixation.Polyploidization events have been extensively studied in some Fabaceae plants,but the occurrence of new genes is still concealed,owing to a lack of genomic information on certain species of the basal clade of Fabaceae.Cercis chinensis(Cercidoideae)is one such species;it diverged earliest from Fabaceae and is essential for phylogenomic studies and new gene predictions in Fabaceae.To facilitate genomic studies on Fabaceae,we performed genome sequencing of C.chinensis and obtained a 352.84 Mb genome,which was further assembled into seven pseudochromosomes with 30612 predicted protein-coding genes.Compared with other legume genomes,that of C.chinensis exhibits no lineage-specific polyploidization event.Further phylogenomic analyses of 22 legumes and 11 other angiosperms revealed that many gene families are lineage specific before and after the diversification of Fabaceae.Among them,dozens of genes are candidates for new genes that have evolved from intergenic regions and are thus regarded as de novo-originated genes.They differ significantly from established genes in coding sequence length,exon number,guanine–cytosine content,and expression patterns among tissues.Functional analysis revealed that many new genes are related to asparagine metabolism.This study represents an important advance in understanding the evolutionary pattern of new genes in legumes and provides a valuable resource for plant phylogenomic studies.
基金funded by the National Key R&D Program of China(2019YFD1001800).
文摘Strawberry mottle virus(SMoV)is one of the most common viruses infecting strawberries,causing losses to fruit yield and quality.In this study,165 strawberry leaf samples were collected from six provinces of China,46 of which tested positive for SMoV.The complete genome sequences of 11 SMoV isolates were obtained from Liaoning(DGHY3,DGHY16-2,DGHY17,DGHY20-2,DGHY21,DGHY26-2),Shandong(SDHY1,SDHY5,SDHY31-2,SDHY33-2),and Beijing(BJMX7).The RNA1 and RNA2 nucleotide identities between the 11 Chinese isolates were 95.4-99.3%and 96.3-99.6%,respectively,and they shared 78.4-96.6%and 84.8-93.5%identities with the available SMoV isolates in GenBank.Recombination analysis revealed that Chinese isolate SDHY33-2 and Canadian isolates Ontario and Simcoe were recombinants,and recombination events frequently occurred in the 3’UTR of SMoV.Phylogenetic analysis showed that in an RNA1 tree,most Chinese isolates clustered into the same group while isolate DGHY17 clustered into another group together with Czech isolate C and three Canadian isolates.In an RNA2 tree,all Chinese isolates clustered into a single group.The phylogenetic analysis based on nucleotide sequences was consistent with the results based on coat protein(CP)and RNA-dependent RNA polymerase(RdRp).Further evolutionary analysis indicated that negative selection drives SMoV evolution,and gene flow plays a major role in genetic differentiation.Additionally,reassortment and recombination also influence the evolution of SMoV.To our knowledge,this is the first report of the complete genome of SMoV isolates from China and a detailed analysis of the SMoV population structure.
基金the Natural Science Foundation of Guangdong Province,China (5006678)the National Natural Science Foundation of China
文摘To determine the genomic sequence of a duck hepatitis virus type 1 (DHV-1) strain, real-time quantitative polymerase chain reaction (RTQ-PCR) assay based on SYBR Green I technology was developed to target 3D gene of DHV-1, Comparative sequence analysis showed that the genome has a typical picornarivus genetic organization, and strain DHV-1 R genetic organaiztion is 5' untranslated region (UTR)-VP0-VP3-VPI-2A1-2A2-2B-2C-3A-3B-3C-3D-3' UTR, DHV-1 R has close relationship with Parechovirus, and has 95.1-99.1% nucleotide sequence identity with other DHV-1 strains. Based on the DHV-1 sequences in GenBank, three pairs of specific primers were designed to amplify DHV-1 using real-time PCR. The results showed that real-time PCR Tm value is 85.6℃ and the real-time PCR provides a broad dynamic range, detecting from 102 to 109 copies of DHV-1 cDNA per reaction. No cross-reactions were found in specimens containing DPV, AIV and NDV. It is concluded that DHV-1 belongs to a new group of the family Picornaviridae that may form a separate genus most closely related to the genus Parechovirus. All results showed that the real-time PCR has high sensitivity and specificity to detect DHV-1 using SYBR Green I dissociation curve analysis, isolates can be distinguished by their melting temperature. These methods are rapid, sensitive, and reliable, and can be readily adapted for detection of DHV-1 from other clinical samples.
基金Supported by National Natural Science Foundation of China(No.31402347)Guangdong Special Marine Fishery Science and Technology Project(A201701C04)+3 种基金"Molecular epidemiological study of Cy HV-3 in Guangdong Province",2016 Guangdong Special Funds for Fish DiseasesGuangzhou Science and Technology Plan Projects(201707010216)Guangdong Science and Technology Plan Projects(2017A020224019)Guangdong Science and Technology Plan Projects(2016A020210029)
文摘The codon usage bias and the base compositions in the available 49 complete fish rhabdovirus genome sequences were investigated. The high correlation between GC12% and GC3% suggested that mutational pressure rather than natural selection was the main factor determining the codon usage and base component in fish rhabdovirns, most of the spots lying below the expected curve ( ENC vs GC3% ) also suggested that codon usage bias in these 49 genomes was influenced by mutational pressure in codon usage pattern. Through PCA analysis based on RSCU, all ORFs in the 49 fish rhabdoviurs genomes were clustered into two parts, and this result showed that hosts played a role in codon usage pattern of fish rhabdovirns genomes. Further, comparison between SVCV, INHV, VHSV and their hosts suggested the effect of natural host on the codon usage pattern took part in the evolution process of fish rhabdovirns. This study represents the most comprehensive analysis to date for fish rhabdovirus eodon usage patterns.
基金supported by the Major Scientific and Technological Special Project for“Significant New Drugs Creation”(No.2018ZX09711001-008-007)the Chinese Academy of Medical Sciences(CAMS)Innovation Fund for Medical Sciences(CIFMS,No.2016-I2M-3-016)。
文摘Pyrrosia petiolosa,Pyrrosia lingua and Pyrrosia sheareri are recorded as original plants of Pyrrosiae Folium(PF)and commonly used as Chinese herbal medicines.Due to the similar morphological features of PF and its adulterants,common DNA barcodes cannot accurately distinguish PF species.Knowledge of the chloroplast(cp)genome is widely used in species identification,molecular marker and phylogenetic analyses.Herein,we determined the complete cp genomes of three original species of PF via highthroughput sequencing technologies.The three cp genomes exhibited a typical quadripartite structure with sizes ranging from 158165 to 163026 bp.The cp genomes of P.petiolosa and P.lingua encoded 130 genes,whilst that of P.sheareri encoded 131 genes.The complete cp genomes were compared,and five highly divergent regions of pet A-psb J,mat K-rps16,ndh C-trn M,psb M-pet N and psa Cndh E were screened as potential DNA barcodes for identification of Pyrrosia genus species.The phylogenetic tree we obtained indicated that P.petiolosa and P.lingua are clustered in a single clade and,thus,share a close relationship.This study provides invaluable information for further studies on the species identification,taxonomy and phylogeny of Pyrrosia genus species.
基金supported by grants from Key Research&Development Project of Nanhua Biomedical Co.,Ltd.(No.H202191490139)National Natural Science Foundation of China(No.31872866)+1 种基金China Postdoctoral Science Foundation(Nos.2021M701160 and 2022M721101)Funds of Hunan university(521119400156).
文摘Severe acute respiratory syndrome coronavirus 2(SARS-CoV-2),the pathogen responsible for coronavirus disease 2019(COVID-19),continues to evolve,giving rise to more variants and global reinfections.Previous research has demonstrated that barcode segments can effectively and cost-efficiently identify specific species within closely related populations.In this study,we designed and tested RNA barcode segments based on genetic evolutionary relationships to facilitate the efficient and accurate identification of SARS-CoV-2 from extensive virus samples,including human coronaviruses(HCoVs)and SARSr-CoV-2 lineages.Nucleotide sequences sourced from NCBI and GISAID were meticulously selected and curated to construct training sets,encompassing 1733 complete genome sequences of HCoVs and SARSr-CoV-2 lineages.Through genetic-level species testing,we validated the accuracy and reliability of the barcode segments for identifying SARS-CoV-2.Subsequently,75 main and subordinate species-specific barcode segments for SARS-CoV-2,located in ORF1ab,S,E,ORF7a,and N coding sequences,were intercepted and screened based on single-nucleotide polymorphism sites and weighted scores.Post-testing,these segments exhibited high recall rates(nearly 100%),specificity(almost 30%at the nucleotide level),and precision(100%)performance on identification.They were eventually visualized using one and two-dimensional combined barcodes and deposited in an online database(http://virusbarcodedatabase.top/).The successful integration of barcoding technology in SARS-CoV-2 identification provides valuable insights for future studies involving complete genome sequence polymorphism analysis.Moreover,this cost-effective and efficient identification approach also provides valuable reference for future research endeavors related to virus surveillance.
文摘of complete genome sequences submitted directly from sequencing projects are diverse in terms of annotation strategies and update frequencies. These inconsistencies make comparative studies difficult. To allow rapid data preparation of a large number of complete genomes, automation and speed are important for genome re-annotation. Here we introduce an open-source rapid genome re-annotation software system, Restauro-G, specialized for bacterial genomes. Restauro-G re-annotates a genome by similarity searches utilizing the BLASTLike Alignment Tool, referring to protein databases such as UniProt KB, NCBI nr, NCBI COGs, Pfam, and PSORTb. Re-annotation by Restauro-G achieved over 98% accuracy for most bacterial chromosomes in comparison with the original manually curated annotation of EMBL releases. Restauro-G was developed in the generic bioinformatics workbench G-language Genome Analysis Environment and is distributed at http://restauro-g.iab.keio.ac.jp/ under the GNU General Public License.
基金This work was supported in part by grants from the National Basic Research Program of China(2012CB721102,2013CB734000)the Natural Science Foundation for Youth(31300034)+1 种基金the National Natural Science Foundation of China(31270056,31430004 and 31421061)LX.Z.is an Awardee of the National Distinguished Young Scholar Program in China(31125002).
文摘The complete genome of methanol-utilizing Amycolatopsis methanolica strain 239T was generated,revealing a single 7,237,391 nucleotide circular chromosome with 7074 annotated protein-coding sequences(CDSs).Comparative analyses against the complete genome sequences of Amycolatopsis japonica strain MG417-CF17T,Amycolatopsis mediterranei strain U32 and Amycolatopsis orientalis strain HCCB10007 revealed a broad spectrum of genomic structures,including various genome sizes,core/quasi-core/non-core configurations and different kinds of episomes.Although polyketide synthase gene clusters were absent from the A.methanolica genome,12 gene clusters related to the biosynthesis of other specialized(secondary)metabolites were identified.Complete pathways attributable to the facultative methylotrophic physiology of A.methanolica strain 239T,including both the mdo/mscR encoded methanol oxidation and the hps/hpi encoded formaldehyde assimilation via the ribulose monophosphate cycle,were identified together with evidence that the latter might be the result of horizontal gene transfer.Phylogenetic analyses based on 16S rDNA or orthologues of AMETH_3452,a novel actinobacterial class-specific conserved gene against 62 or 18 Amycolatopsis type strains,respectively,revealed three major phyletic lineages,namely the mesophilic or moderately thermophilic A.orientalis subclade(AOS),the mesophilic Amycolatopsis taiwanensis subclade(ATS)and the thermophilic A.methanolica subclade(AMS).The distinct growth temperatures of members of the subclades correlated with corresponding genetic variations in their encoded compatible solutes.This study shows the value of integrating conventional taxonomic with whole genome sequence data.