Long-PCR amplification, clone and primer-walking sequencing methods were employed in determine the complete sequence of mitochondrial genome of tokay (Gekko gecko). The genome is 16 435 bp in size, contains 13 protein...Long-PCR amplification, clone and primer-walking sequencing methods were employed in determine the complete sequence of mitochondrial genome of tokay (Gekko gecko). The genome is 16 435 bp in size, contains 13 protein-coding, 2 ribosomal and 22 transfer RNA genes. The mt genome of Gekko is similar to most of the vertebrates in gene components, order, orientation, tRNA structures, low percentage of guanine and high percentage of thymine, and skews of base GC and AT. Base A was preferred at third codon positions for protein genes is similar to amphibians and fishes rather than amnion vertebrates. The standard stop codes (TAA) present only in three protein genes, less than those of most vertebrates. Transfer RNA genes range in length from 63 to 76 nt, their planar structure present characteristic clover leaf, except for tRNA-Cys and tRNA-Ser (AGY) because of lacking the D arm.展开更多
Mycoplasma ovipneumoniae, a kind of mycoplasma bacteria, commonly infects the respiratory tract causing respiratory disease in sheep and goats worldwide. Here, the complete genome sequence of M. ovipneumoniae strain N...Mycoplasma ovipneumoniae, a kind of mycoplasma bacteria, commonly infects the respiratory tract causing respiratory disease in sheep and goats worldwide. Here, the complete genome sequence of M. ovipneumoniae strain NM2010 isolated from a sheep in China was reported for the ifrst time.展开更多
The complete nucleotide sequence of the mumps virus SP, which was isolated in China, was determined. As with other mumps viruses, its genome was 15 384 nucleotides (nts) in length and encoded seven proteins. The full-...The complete nucleotide sequence of the mumps virus SP, which was isolated in China, was determined. As with other mumps viruses, its genome was 15 384 nucleotides (nts) in length and encoded seven proteins. The full-length nucleotide sequence of the SP isolate differed from other strains by 4% –6.8% at the nucleotide sequence level. Due to variations of amino acids over the full genome (including the HN and N genes), this isolate exhibited significant variations in the antigenic sites. This report is the first to describe the full-length genome of a genotype F strain and provide an overview of the diversity of genetic characteristics of a circulating mumps virus.展开更多
In this study, we determined the complete nucleotide and deduced amino acid sequence of a primary isolate of rabies virus (SH06) obtained from the brain of a rabid dog. The overall length of the genome was 11 924 nucl...In this study, we determined the complete nucleotide and deduced amino acid sequence of a primary isolate of rabies virus (SH06) obtained from the brain of a rabid dog. The overall length of the genome was 11 924 nucleotides. Comparison of the genomic sequence showed the homology of SH06 at nucleotide level with full-length genomes of reference vaccine strains ranged from 82.2% with the PV strain to 86.9% with the CTN strain. A full-length genome-based phylogenetic analysis was performed with sequences available from GenBank. Phylogenetic analysis of the complete genome sequences indicated that the SH06 exhibited the highest homology with rabies street virus BD06 and CTN vaccine strain originated from China.展开更多
The complete nucleotide sequence of the measles virus strain IMB-1,which was isolated in China,was determined.As in other measles viruses,its genome is 15,894 nucleotides in length and encodes six proteins.The full-le...The complete nucleotide sequence of the measles virus strain IMB-1,which was isolated in China,was determined.As in other measles viruses,its genome is 15,894 nucleotides in length and encodes six proteins.The full-length nucleotide sequence of the IMB-1 isolate differed from vaccine strains (including wild-type Edmonston strain) by 4%-5% at the nucleotide sequence level.This isolate has amino acid variations over the full genome,including in the hemagglutinin and fusion genes.This report is the first to describe the full-length genome of a genotype H1 strain and provide an overview of the diversity of genetic characteristics of a circulating measles virus.展开更多
Cobetia marina is a model proteobacteria in researches on marine biofouling. Its taxonomic nomenclature has been revised many times over the past few decades. To better understand the role of the surface-associated li...Cobetia marina is a model proteobacteria in researches on marine biofouling. Its taxonomic nomenclature has been revised many times over the past few decades. To better understand the role of the surface-associated lifestyle of C. marina and the phylogeny of the family Halomonadaceae, we sequenced the entire genome of C. marina JCM 21022T using single molecule real-time sequencing technology (SMRT) and performed comparative genomics and phylogenomics analyses. The circular chromosome was 4 176 300 bp with an average GC content of 62.44% and contained 3 611 predicted coding sequences, 72 tRNA genes, and 21 rRNA genes. The C. marina JCM 2102U genome contained a set of crucial genes involved in surface colonization processes. The comparative genome analysis indicated the significant differences between C. marina JCM 21022T and Cobetia amphilecti KMM 296 (formerly named C. marina KMM 296) resulted from sequence insertions or deletions and chromosomal recombination. Despite these differences, pan and core genome analysis showed similar gene functions between the two strains. The phylogenomic study of the family Halomonadaceae is relationships were well resolved among every genera Cobetia, Kushneria, Zymobacter, and Halotalea. reported here for the first time. We found that the tested, including Chromohalobacter, Halomonas,展开更多
For any given coprime integers p and q greater than 1, in 1959, B proved that all sufficiently large integers can be expressed as a sum of pairwise terms of the form p^aq^b. As Davenport observed, Birch's proof can b...For any given coprime integers p and q greater than 1, in 1959, B proved that all sufficiently large integers can be expressed as a sum of pairwise terms of the form p^aq^b. As Davenport observed, Birch's proof can be modified that the exponent b can be bounded in terms of p and q. In 2000, N. Hegyvari effective version of this bound. The author improves this bound.展开更多
Pathogenic Escherichia coli cause chicken colibacillosis, which is economically devastating to the poultry in- dustry worldwide (Bagheri et al., 2014). Owing to in- creasing antibiotic resistance, phage therapy reag...Pathogenic Escherichia coli cause chicken colibacillosis, which is economically devastating to the poultry in- dustry worldwide (Bagheri et al., 2014). Owing to in- creasing antibiotic resistance, phage therapy reagents have been developed to treat bacterial infections (Xu et al., 2015).展开更多
Cyanophages are ubiquitous and essential components of the aquatic environment and play an important role in the termination of algal blooms.As such,they have attracted widespread interest.PP was the first isolated cy...Cyanophages are ubiquitous and essential components of the aquatic environment and play an important role in the termination of algal blooms.As such,they have attracted widespread interest.PP was the first isolated cyanophage in China,which infects Plectonema boryanum and Phormidium foveolarum.In this study,this cyanophage was purified three times by a double-agar overlay plaque assay and characterized.Its genome was extracted,totally sequenced and analyzed.Electron microscopy revealed a particle with an icosahedral head connected to a short stubby tail.Bioassays showed that PP was quite virulent.The genome of PP is a 42,480 base pair(bp),linear,double-stranded DNA molecule with 222 bp terminal repeats.It has high similarity with the known Pf-WMP3 sequence.It contains 41 open reading frames(ORFs),17 of which were annotated.Intriguingly,the genome can be divided into two completely different parts,which differ both in orientation and function.展开更多
Enterococci bacteria are important in environmental, food and clinical microbiology. Enterococcus faecium is a nosocomial pathogen that causes bacteremia, endocarditis and other infections. It is among the most preval...Enterococci bacteria are important in environmental, food and clinical microbiology. Enterococcus faecium is a nosocomial pathogen that causes bacteremia, endocarditis and other infections. It is among the most prevalent organisms encountered in hospital-associated infections accounting for approximately 12% of nosocomial infections in the USA (Linden and Miller, 1999). However, certain strains of E. faecium are not only non-pathogenic but also have beneficial effects on human health with probiotic potential. For example, E. faecium T-110 is a consortium member in several probiotic products including BIO-THREE~ which is widely prescribed for human, animal and aqua-cultural use. This strain was originally developed by TOA Pharmaceuticals in Japan, and later used in the probiotic products of several other companies.展开更多
In the paper,we investigate the complete convergence and complete moment convergence for the maximal partial sum of martingale diference sequence.Especially,we get the Baum–Katz-type Theorem and Hsu–Robbins-type The...In the paper,we investigate the complete convergence and complete moment convergence for the maximal partial sum of martingale diference sequence.Especially,we get the Baum–Katz-type Theorem and Hsu–Robbins-type Theorem for martingale diference sequence.As an application,a strong law of large numbers for martingale diference sequence is obtained.展开更多
Complete complementary sequences are widely used in spectrum spread communications because of their ideal correlation functions. A previous method generates complete complementary sequences with lengths of N^nN (n,N ...Complete complementary sequences are widely used in spectrum spread communications because of their ideal correlation functions. A previous method generates complete complementary sequences with lengths of N^nN (n,N ∈ Z^+). This paper presents a new iterative method to construct complete complementary sequences with lengths of 2^mN (m,N ∈ Z^+). The analysis proves that this method can produce many sequence sets that do not appear in sequence sets generated by the former method, especially shorter sequence sets. The result will certainly increase the application of complete complementary sequences in communication engineering and related fields.展开更多
In this paper we first discuss the relations between some G-M-type spaces, and the previous eight kinds of G-M-type Banach spaces are merged into four different kinds. Then we build a Generalized Operator Extension Th...In this paper we first discuss the relations between some G-M-type spaces, and the previous eight kinds of G-M-type Banach spaces are merged into four different kinds. Then we build a Generalized Operator Extension Theorem, and introduce the concept of complete minimal sequences. Some sufficient and necessary conditions under which a Banach space is a hereditarily indecomposable space are given. Finally, we give some characterizations of hereditarily indecomposable Banach Spaces.展开更多
Severe acute respiratory syndrome coronavirus 2(SARS-CoV-2),the pathogen responsible for coronavirus disease 2019(COVID-19),continues to evolve,giving rise to more variants and global reinfections.Previous research ha...Severe acute respiratory syndrome coronavirus 2(SARS-CoV-2),the pathogen responsible for coronavirus disease 2019(COVID-19),continues to evolve,giving rise to more variants and global reinfections.Previous research has demonstrated that barcode segments can effectively and cost-efficiently identify specific species within closely related populations.In this study,we designed and tested RNA barcode segments based on genetic evolutionary relationships to facilitate the efficient and accurate identification of SARS-CoV-2 from extensive virus samples,including human coronaviruses(HCoVs)and SARSr-CoV-2 lineages.Nucleotide sequences sourced from NCBI and GISAID were meticulously selected and curated to construct training sets,encompassing 1733 complete genome sequences of HCoVs and SARSr-CoV-2 lineages.Through genetic-level species testing,we validated the accuracy and reliability of the barcode segments for identifying SARS-CoV-2.Subsequently,75 main and subordinate species-specific barcode segments for SARS-CoV-2,located in ORF1ab,S,E,ORF7a,and N coding sequences,were intercepted and screened based on single-nucleotide polymorphism sites and weighted scores.Post-testing,these segments exhibited high recall rates(nearly 100%),specificity(almost 30%at the nucleotide level),and precision(100%)performance on identification.They were eventually visualized using one and two-dimensional combined barcodes and deposited in an online database(http://virusbarcodedatabase.top/).The successful integration of barcoding technology in SARS-CoV-2 identification provides valuable insights for future studies involving complete genome sequence polymorphism analysis.Moreover,this cost-effective and efficient identification approach also provides valuable reference for future research endeavors related to virus surveillance.展开更多
The complete genome of methanol-utilizing Amycolatopsis methanolica strain 239T was generated,revealing a single 7,237,391 nucleotide circular chromosome with 7074 annotated protein-coding sequences(CDSs).Comparative ...The complete genome of methanol-utilizing Amycolatopsis methanolica strain 239T was generated,revealing a single 7,237,391 nucleotide circular chromosome with 7074 annotated protein-coding sequences(CDSs).Comparative analyses against the complete genome sequences of Amycolatopsis japonica strain MG417-CF17T,Amycolatopsis mediterranei strain U32 and Amycolatopsis orientalis strain HCCB10007 revealed a broad spectrum of genomic structures,including various genome sizes,core/quasi-core/non-core configurations and different kinds of episomes.Although polyketide synthase gene clusters were absent from the A.methanolica genome,12 gene clusters related to the biosynthesis of other specialized(secondary)metabolites were identified.Complete pathways attributable to the facultative methylotrophic physiology of A.methanolica strain 239T,including both the mdo/mscR encoded methanol oxidation and the hps/hpi encoded formaldehyde assimilation via the ribulose monophosphate cycle,were identified together with evidence that the latter might be the result of horizontal gene transfer.Phylogenetic analyses based on 16S rDNA or orthologues of AMETH_3452,a novel actinobacterial class-specific conserved gene against 62 or 18 Amycolatopsis type strains,respectively,revealed three major phyletic lineages,namely the mesophilic or moderately thermophilic A.orientalis subclade(AOS),the mesophilic Amycolatopsis taiwanensis subclade(ATS)and the thermophilic A.methanolica subclade(AMS).The distinct growth temperatures of members of the subclades correlated with corresponding genetic variations in their encoded compatible solutes.This study shows the value of integrating conventional taxonomic with whole genome sequence data.展开更多
In October 2017,a small outbreak of echovirus 30(E30)associated with aseptic meningitis in nine cases occurred at a primary school in the Ningxia Hui Autonomous Region.That year,we observed a significant increase in E...In October 2017,a small outbreak of echovirus 30(E30)associated with aseptic meningitis in nine cases occurred at a primary school in the Ningxia Hui Autonomous Region.That year,we observed a significant increase in E30 levels in an acute flaccid paralysis(AFP)case surveillance system.To investigate their phylogenetic relationships,we determined the whole genomic sequences of 12 strains isolated from aseptic meningitis cases,AFP cases,and healthy children.We found that the E30 strains circulating in Ningxia belong to two lineages(H and J).The strains isolated in 2010,2012,and 2016 belonged to the H lineage.In 2017,a new lineage,J,emerged as the dominant lineage.Phylogenetic trees were constructed based on the whole genome andP1,P2,andP3 regions;clustering with other types of enterovirus species B was found,suggesting that recombination events had occurred.The recombination sites were mainly in regions2B,2C,and3D.This study confirmed that the E30 strains in Ningxia in 2010,2012,and 2016 had different recombination patterns and were recombined with different enteroviruses.The 2017 epidemic E30 originated from another new lineage with a complex recombination pattern and formed an independent transmission chain in Ningxia.展开更多
Coxsackievirus A24 variant(CVA24v)is a major pathogen that causes continued outbreaks and pandemics of acute hemorrhagic conjunctivitis(AHC).In China,the first confirmed outbreak of CVA24v-related AHC occurred in Beij...Coxsackievirus A24 variant(CVA24v)is a major pathogen that causes continued outbreaks and pandemics of acute hemorrhagic conjunctivitis(AHC).In China,the first confirmed outbreak of CVA24v-related AHC occurred in Beijing in 1988,followed by another two significant outbreaks respectively in 1994 and 2007,which coincides with the three-stage dynamic distribution of AHC in the world after 1970s.To illustrate the genetic characteristics of CVA24v in different periods,a total of 23 strains were isolated from those three outbreaks and the whole genome of those isolations were sequenced and analyzed.Compared with the prototype strain,the 23 strains shared four nucleotide deletions in the 5'UTR except the 0744 strain isolated in 2007.And at the 98th site,one nucleotide insertion was found in all the strains collected from 2007.From 1994 to 2007,amino acid polarity in the VP1 region at the 25th and the 32nd site were changed.Both the 3C and VP1 phylogenetic tree indicated that isolates from 1988 and 1994 belonged to Genotype III(GIII),and 2007 strains to Genotype IV(GIV).According to the Bayesian analysis based on complete genome sequence,the most recent common ancestors for the isolates in1988,1994 and 2007 were respectively estimated around October 1987,February 1993 and December 2004.The evolutionary rate of the CVA24v was estimated to be 7.45×10^(-3) substitutions/site/year.Our study indicated that the early epidemic of CVA24v in Chinese mainland was the GIII.Point mutations and amino acid changes in different genotypes of CVA24v may generate intensity differences of the AHC outbreak.CVA24v has been evolving constantly with a relatively rapid rate.展开更多
文摘Long-PCR amplification, clone and primer-walking sequencing methods were employed in determine the complete sequence of mitochondrial genome of tokay (Gekko gecko). The genome is 16 435 bp in size, contains 13 protein-coding, 2 ribosomal and 22 transfer RNA genes. The mt genome of Gekko is similar to most of the vertebrates in gene components, order, orientation, tRNA structures, low percentage of guanine and high percentage of thymine, and skews of base GC and AT. Base A was preferred at third codon positions for protein genes is similar to amphibians and fishes rather than amnion vertebrates. The standard stop codes (TAA) present only in three protein genes, less than those of most vertebrates. Transfer RNA genes range in length from 63 to 76 nt, their planar structure present characteristic clover leaf, except for tRNA-Cys and tRNA-Ser (AGY) because of lacking the D arm.
基金supported by the Nationai Key Technology R&D Program of China (2011BAD18B01)
文摘Mycoplasma ovipneumoniae, a kind of mycoplasma bacteria, commonly infects the respiratory tract causing respiratory disease in sheep and goats worldwide. Here, the complete genome sequence of M. ovipneumoniae strain NM2010 isolated from a sheep in China was reported for the ifrst time.
基金Public Benefit Grant of Ministry of Health, China (200802035)Natural Science Foundation of Yunnan province (2008CD153)
文摘The complete nucleotide sequence of the mumps virus SP, which was isolated in China, was determined. As with other mumps viruses, its genome was 15 384 nucleotides (nts) in length and encoded seven proteins. The full-length nucleotide sequence of the SP isolate differed from other strains by 4% –6.8% at the nucleotide sequence level. Due to variations of amino acids over the full genome (including the HN and N genes), this isolate exhibited significant variations in the antigenic sites. This report is the first to describe the full-length genome of a genotype F strain and provide an overview of the diversity of genetic characteristics of a circulating mumps virus.
基金National High-Tech Research and Development Program of China (2007AA022402)
文摘In this study, we determined the complete nucleotide and deduced amino acid sequence of a primary isolate of rabies virus (SH06) obtained from the brain of a rabid dog. The overall length of the genome was 11 924 nucleotides. Comparison of the genomic sequence showed the homology of SH06 at nucleotide level with full-length genomes of reference vaccine strains ranged from 82.2% with the PV strain to 86.9% with the CTN strain. A full-length genome-based phylogenetic analysis was performed with sequences available from GenBank. Phylogenetic analysis of the complete genome sequences indicated that the SH06 exhibited the highest homology with rabies street virus BD06 and CTN vaccine strain originated from China.
基金Public Benefit Grant of Ministry of Health P.R China (200802035)Basic Research Foundation(General Program) of Yunnan Province (2008CD153)
文摘The complete nucleotide sequence of the measles virus strain IMB-1,which was isolated in China,was determined.As in other measles viruses,its genome is 15,894 nucleotides in length and encodes six proteins.The full-length nucleotide sequence of the IMB-1 isolate differed from vaccine strains (including wild-type Edmonston strain) by 4%-5% at the nucleotide sequence level.This isolate has amino acid variations over the full genome,including in the hemagglutinin and fusion genes.This report is the first to describe the full-length genome of a genotype H1 strain and provide an overview of the diversity of genetic characteristics of a circulating measles virus.
基金Supported by the National Natural Science Foundation of China(Nos.41006082,31372517)
文摘Cobetia marina is a model proteobacteria in researches on marine biofouling. Its taxonomic nomenclature has been revised many times over the past few decades. To better understand the role of the surface-associated lifestyle of C. marina and the phylogeny of the family Halomonadaceae, we sequenced the entire genome of C. marina JCM 21022T using single molecule real-time sequencing technology (SMRT) and performed comparative genomics and phylogenomics analyses. The circular chromosome was 4 176 300 bp with an average GC content of 62.44% and contained 3 611 predicted coding sequences, 72 tRNA genes, and 21 rRNA genes. The C. marina JCM 2102U genome contained a set of crucial genes involved in surface colonization processes. The comparative genome analysis indicated the significant differences between C. marina JCM 21022T and Cobetia amphilecti KMM 296 (formerly named C. marina KMM 296) resulted from sequence insertions or deletions and chromosomal recombination. Despite these differences, pan and core genome analysis showed similar gene functions between the two strains. The phylogenomic study of the family Halomonadaceae is relationships were well resolved among every genera Cobetia, Kushneria, Zymobacter, and Halotalea. reported here for the first time. We found that the tested, including Chromohalobacter, Halomonas,
基金Project supported by the National Natural Science Foundation of China (Nos. 10771103, 11071121)
文摘For any given coprime integers p and q greater than 1, in 1959, B proved that all sufficiently large integers can be expressed as a sum of pairwise terms of the form p^aq^b. As Davenport observed, Birch's proof can be modified that the exponent b can be bounded in terms of p and q. In 2000, N. Hegyvari effective version of this bound. The author improves this bound.
基金supported by grants from the Nature Science Foundation of Shandong Province of China (grant nos.ZR2013CQ024 and ZR2015CM020)
文摘Pathogenic Escherichia coli cause chicken colibacillosis, which is economically devastating to the poultry in- dustry worldwide (Bagheri et al., 2014). Owing to in- creasing antibiotic resistance, phage therapy reagents have been developed to treat bacterial infections (Xu et al., 2015).
文摘Cyanophages are ubiquitous and essential components of the aquatic environment and play an important role in the termination of algal blooms.As such,they have attracted widespread interest.PP was the first isolated cyanophage in China,which infects Plectonema boryanum and Phormidium foveolarum.In this study,this cyanophage was purified three times by a double-agar overlay plaque assay and characterized.Its genome was extracted,totally sequenced and analyzed.Electron microscopy revealed a particle with an icosahedral head connected to a short stubby tail.Bioassays showed that PP was quite virulent.The genome of PP is a 42,480 base pair(bp),linear,double-stranded DNA molecule with 222 bp terminal repeats.It has high similarity with the known Pf-WMP3 sequence.It contains 41 open reading frames(ORFs),17 of which were annotated.Intriguingly,the genome can be divided into two completely different parts,which differ both in orientation and function.
文摘Enterococci bacteria are important in environmental, food and clinical microbiology. Enterococcus faecium is a nosocomial pathogen that causes bacteremia, endocarditis and other infections. It is among the most prevalent organisms encountered in hospital-associated infections accounting for approximately 12% of nosocomial infections in the USA (Linden and Miller, 1999). However, certain strains of E. faecium are not only non-pathogenic but also have beneficial effects on human health with probiotic potential. For example, E. faecium T-110 is a consortium member in several probiotic products including BIO-THREE~ which is widely prescribed for human, animal and aqua-cultural use. This strain was originally developed by TOA Pharmaceuticals in Japan, and later used in the probiotic products of several other companies.
基金Supported by National Natural Science Foundation of China(Grant Nos.11201001,11171001,11126176 and 11226207)Natural Science Foundation of Anhui Province(Grant Nos.1208085QA03 and 1308085QA03)+2 种基金Applied Teaching Model Curriculum of Anhui University(Grant No.XJYYXKC04)Students Innovative Training Project of Anhui University(Grant No.201310357004)Doctoral Research Start-up Funds Projects of Anhui University and the Students Science Research Training Program of Anhui University(Grant No.KYXL2012007)
文摘In the paper,we investigate the complete convergence and complete moment convergence for the maximal partial sum of martingale diference sequence.Especially,we get the Baum–Katz-type Theorem and Hsu–Robbins-type Theorem for martingale diference sequence.As an application,a strong law of large numbers for martingale diference sequence is obtained.
基金Supported by the Joint Research Program of Tsinghua University of China and the National Institute of Informatics, Japan
文摘Complete complementary sequences are widely used in spectrum spread communications because of their ideal correlation functions. A previous method generates complete complementary sequences with lengths of N^nN (n,N ∈ Z^+). This paper presents a new iterative method to construct complete complementary sequences with lengths of 2^mN (m,N ∈ Z^+). The analysis proves that this method can produce many sequence sets that do not appear in sequence sets generated by the former method, especially shorter sequence sets. The result will certainly increase the application of complete complementary sequences in communication engineering and related fields.
基金The NNSF (10471025) of China the Foundation (JA04170) of the Education Department of Fujian Province, China.
文摘In this paper we first discuss the relations between some G-M-type spaces, and the previous eight kinds of G-M-type Banach spaces are merged into four different kinds. Then we build a Generalized Operator Extension Theorem, and introduce the concept of complete minimal sequences. Some sufficient and necessary conditions under which a Banach space is a hereditarily indecomposable space are given. Finally, we give some characterizations of hereditarily indecomposable Banach Spaces.
基金supported by grants from Key Research&Development Project of Nanhua Biomedical Co.,Ltd.(No.H202191490139)National Natural Science Foundation of China(No.31872866)+1 种基金China Postdoctoral Science Foundation(Nos.2021M701160 and 2022M721101)Funds of Hunan university(521119400156).
文摘Severe acute respiratory syndrome coronavirus 2(SARS-CoV-2),the pathogen responsible for coronavirus disease 2019(COVID-19),continues to evolve,giving rise to more variants and global reinfections.Previous research has demonstrated that barcode segments can effectively and cost-efficiently identify specific species within closely related populations.In this study,we designed and tested RNA barcode segments based on genetic evolutionary relationships to facilitate the efficient and accurate identification of SARS-CoV-2 from extensive virus samples,including human coronaviruses(HCoVs)and SARSr-CoV-2 lineages.Nucleotide sequences sourced from NCBI and GISAID were meticulously selected and curated to construct training sets,encompassing 1733 complete genome sequences of HCoVs and SARSr-CoV-2 lineages.Through genetic-level species testing,we validated the accuracy and reliability of the barcode segments for identifying SARS-CoV-2.Subsequently,75 main and subordinate species-specific barcode segments for SARS-CoV-2,located in ORF1ab,S,E,ORF7a,and N coding sequences,were intercepted and screened based on single-nucleotide polymorphism sites and weighted scores.Post-testing,these segments exhibited high recall rates(nearly 100%),specificity(almost 30%at the nucleotide level),and precision(100%)performance on identification.They were eventually visualized using one and two-dimensional combined barcodes and deposited in an online database(http://virusbarcodedatabase.top/).The successful integration of barcoding technology in SARS-CoV-2 identification provides valuable insights for future studies involving complete genome sequence polymorphism analysis.Moreover,this cost-effective and efficient identification approach also provides valuable reference for future research endeavors related to virus surveillance.
基金This work was supported in part by grants from the National Basic Research Program of China(2012CB721102,2013CB734000)the Natural Science Foundation for Youth(31300034)+1 种基金the National Natural Science Foundation of China(31270056,31430004 and 31421061)LX.Z.is an Awardee of the National Distinguished Young Scholar Program in China(31125002).
文摘The complete genome of methanol-utilizing Amycolatopsis methanolica strain 239T was generated,revealing a single 7,237,391 nucleotide circular chromosome with 7074 annotated protein-coding sequences(CDSs).Comparative analyses against the complete genome sequences of Amycolatopsis japonica strain MG417-CF17T,Amycolatopsis mediterranei strain U32 and Amycolatopsis orientalis strain HCCB10007 revealed a broad spectrum of genomic structures,including various genome sizes,core/quasi-core/non-core configurations and different kinds of episomes.Although polyketide synthase gene clusters were absent from the A.methanolica genome,12 gene clusters related to the biosynthesis of other specialized(secondary)metabolites were identified.Complete pathways attributable to the facultative methylotrophic physiology of A.methanolica strain 239T,including both the mdo/mscR encoded methanol oxidation and the hps/hpi encoded formaldehyde assimilation via the ribulose monophosphate cycle,were identified together with evidence that the latter might be the result of horizontal gene transfer.Phylogenetic analyses based on 16S rDNA or orthologues of AMETH_3452,a novel actinobacterial class-specific conserved gene against 62 or 18 Amycolatopsis type strains,respectively,revealed three major phyletic lineages,namely the mesophilic or moderately thermophilic A.orientalis subclade(AOS),the mesophilic Amycolatopsis taiwanensis subclade(ATS)and the thermophilic A.methanolica subclade(AMS).The distinct growth temperatures of members of the subclades correlated with corresponding genetic variations in their encoded compatible solutes.This study shows the value of integrating conventional taxonomic with whole genome sequence data.
基金supported by the Science and Natural Science of Ningxia (Grant No.2021AAC03415)the Science and Natural Science of China (Grant No.81960607)+1 种基金Science and Natural Science of Ningxia (Grant No.2022AAC03708)the National Key Research and Development Program of China (Project No.2021YFC2302003).
文摘In October 2017,a small outbreak of echovirus 30(E30)associated with aseptic meningitis in nine cases occurred at a primary school in the Ningxia Hui Autonomous Region.That year,we observed a significant increase in E30 levels in an acute flaccid paralysis(AFP)case surveillance system.To investigate their phylogenetic relationships,we determined the whole genomic sequences of 12 strains isolated from aseptic meningitis cases,AFP cases,and healthy children.We found that the E30 strains circulating in Ningxia belong to two lineages(H and J).The strains isolated in 2010,2012,and 2016 belonged to the H lineage.In 2017,a new lineage,J,emerged as the dominant lineage.Phylogenetic trees were constructed based on the whole genome andP1,P2,andP3 regions;clustering with other types of enterovirus species B was found,suggesting that recombination events had occurred.The recombination sites were mainly in regions2B,2C,and3D.This study confirmed that the E30 strains in Ningxia in 2010,2012,and 2016 had different recombination patterns and were recombined with different enteroviruses.The 2017 epidemic E30 originated from another new lineage with a complex recombination pattern and formed an independent transmission chain in Ningxia.
基金funded by the National Science and Technology Major project of China(Project No.2017ZX10104001)the National key research and development project(Project No.2021YFC0863000)the Beijing Natural Science Foundation(Project No.L192014)。
文摘Coxsackievirus A24 variant(CVA24v)is a major pathogen that causes continued outbreaks and pandemics of acute hemorrhagic conjunctivitis(AHC).In China,the first confirmed outbreak of CVA24v-related AHC occurred in Beijing in 1988,followed by another two significant outbreaks respectively in 1994 and 2007,which coincides with the three-stage dynamic distribution of AHC in the world after 1970s.To illustrate the genetic characteristics of CVA24v in different periods,a total of 23 strains were isolated from those three outbreaks and the whole genome of those isolations were sequenced and analyzed.Compared with the prototype strain,the 23 strains shared four nucleotide deletions in the 5'UTR except the 0744 strain isolated in 2007.And at the 98th site,one nucleotide insertion was found in all the strains collected from 2007.From 1994 to 2007,amino acid polarity in the VP1 region at the 25th and the 32nd site were changed.Both the 3C and VP1 phylogenetic tree indicated that isolates from 1988 and 1994 belonged to Genotype III(GIII),and 2007 strains to Genotype IV(GIV).According to the Bayesian analysis based on complete genome sequence,the most recent common ancestors for the isolates in1988,1994 and 2007 were respectively estimated around October 1987,February 1993 and December 2004.The evolutionary rate of the CVA24v was estimated to be 7.45×10^(-3) substitutions/site/year.Our study indicated that the early epidemic of CVA24v in Chinese mainland was the GIII.Point mutations and amino acid changes in different genotypes of CVA24v may generate intensity differences of the AHC outbreak.CVA24v has been evolving constantly with a relatively rapid rate.