摘要
为了从分子生物学的角度解析桑树的重要功能性状,以广西蚕区主栽桑品种桂桑优12的根部组织为材料提取总RNA进行转录组测序。测序共获得50844314条原始测序数据(Raw data),经过滤后得到50540436条Clean data。对序列进行拼接组装、去冗余后,获得了102254个转录本,总长度为64665080 bp,平均长度为771 bp,N50值为1473 bp,GC含量47.2%。将获得的序列与各大功能数据库比对,进行功能注释,共有86332个转录本获得了注释结果,预测出68218个编码序列(Coding DNA sequence,CDS),总长度34282488 bp,平均每个CDS长度为502 bp,N50值为756 bp,GC含量42.44%。其中有40254个转录本在KEGG获得注释,有26832个转录本在COG获得注释,有27766个转录本在GO获得注释。桂桑优12根部组织的转录本数据分析结果,可为今后研究发掘桑树重要功能性状基因提供一定的数据支持。
In order to analyze the functional genes of Mulberry,the total RNA was extracted from the root tissue of Guisangyou 12,which are widely cultivated in Guangxi,and its transcriptome was sequenced.50540436 Clean Data samples were obtained from 50844314 Raw Data.102254 unigenes were assembled,with total nucleotides of 64665080.The average size,N50 value and GC percentage are 771 bp,1473 bp and 47.2%,respectively.By comparing these Unigene sequences with those in the public databases,86332 Unigenes were annotated.68218 Coding sequences(CDS)which the total length was 3428248 bp were predicted with the 502 bp average length,756 bp N50 value,and 42.44%GC percentage.40254 Unigenes were annotated in the KEGG database,26832 Unigenes were in the COG database and 27766 Unigenes were in the GO database.The studies of transcription data of Guisangyou 12 can provide some data support for the discovery of functional genes in mulberry.
作者
张朝华
王霞
ZHANG Chao-hua;WANG Xia(Guangxi Academy of Sericulture Sciences,Nanning 530007,China)
出处
《蚕学通讯》
2021年第3期1-7,共7页
Newsletter of Sericultural Science
基金
广西自然科学基金项目(2018GXNSFBA281005)
广西重点研发计划(桂科AB17129008)。
关键词
桑树
编码序列
转录组
功能注释
Mulberry
Coding sequence
Transcriptome
Functional annotation