摘要
Squaliobarbus curriculus is a eurythermic ?sh found mainly in East Asia. The natural germplasm resources of this species, an organism of potential economic aquacultural importance, have recently been threatened by over?shing and habitat degradation. The objective of this study was to obtain functional genomic resources and candidate microsatellite markers for future genetic observation and germplasm protection of S. curriculus. Illumina paired-end sequencing generated over 22.20 million clean reads, which were assembled into 204 027 transcripts with an average length of 1 457 bp. A total of 112 570 protein-coding sequences(55.17%) were predicted, 111 934 of which had signi?cant matches in the NCBI nonredundant protein database. Overall, 60 450, 34 830, and 77 755 transcripts were assigned to Gene Ontology, Clusters of Orthologous Groups, and Eukaryotic Orthologous Groups categories, respectively.In addition, 44 781 transcripts(21.95%) were mapped to 230 KEGG pathways. A total of 51 426 candidate simple sequence repeat(SSR) markers were identi?ed and used to design 36 149 PCR primer pairs. Fifty of these primer pairs were randomly selected for PCR validation in 32 S. curriculus specimens, which resulted in 25 pairs yielding the expected products. Further analysis revealed that 40%(10) of the 25 ampli?ed loci were polymorphic. The transcriptomic pro?le and SSR repertoire obtained in this study should facilitate population genetic studies and germplasm resource protection of S. curriculus.
Squaliobarbus curriculus is a eurythermic ?sh found mainly in East Asia. The natural germplasm resources of this species, an organism of potential economic aquacultural importance, have recently been threatened by over?shing and habitat degradation. The objective of this study was to obtain functional genomic resources and candidate microsatellite markers for future genetic observation and germplasm protection of S. curriculus. Illumina paired-end sequencing generated over 22.20 million clean reads, which were assembled into 204 027 transcripts with an average length of 1 457 bp. A total of 112 570 protein-coding sequences(55.17%) were predicted, 111 934 of which had signi?cant matches in the NCBI nonredundant protein database. Overall, 60 450, 34 830, and 77 755 transcripts were assigned to Gene Ontology, Clusters of Orthologous Groups, and Eukaryotic Orthologous Groups categories, respectively.In addition, 44 781 transcripts(21.95%) were mapped to 230 KEGG pathways. A total of 51 426 candidate simple sequence repeat(SSR) markers were identi?ed and used to design 36 149 PCR primer pairs. Fifty of these primer pairs were randomly selected for PCR validation in 32 S. curriculus specimens, which resulted in 25 pairs yielding the expected products. Further analysis revealed that 40%(10) of the 25 ampli?ed loci were polymorphic. The transcriptomic pro?le and SSR repertoire obtained in this study should facilitate population genetic studies and germplasm resource protection of S. curriculus.
基金
Supported by the Jiangsu Collaborative Innovation Center of Regional Modern Agriculture and Environmental Protection,Huaiyin Normal University(Germplasm Production and Culture of S.curriculus,No.HSXT219)
the Scientific Fund of Jiangsu Province(No.BY2015039-10)
the Priority Academic Development Program of Jiangsu Higher Education Institutions,China