摘要
重复序列是动物基因组的重要组分,对于基因组结构多样性、调节基因表达和介导多种遗传疾病具有重要影响。本研究采用了2种策略:基于序列比对的Repeat Masker(RM)和从头预测的Repeat Scout(RS),对大熊猫Ailuropoda melanoleura基因组中的重复序列进行鉴定与注释,详细阐明了其转座子元件(TE)的组成、类型、数量、亚家族、长度分布、分化率等。比较2种注释方法的结果,RM注释到的TE数量在绝大部分亚家族中均多于RS,而在某些亚家族中则少于RS;RS注释到的TE亚家族类型及平均长度均小于RM。此外,RS构建的大熊猫TE一致性序列中,有20%不属于现有的重复序列类型,可能包含大熊猫特有的TE类型。研究结果对于阐明大熊猫重复序列的特征及其生物学功能奠定了重要基础。
Repeat elements,especially the transposable elements( TEs) are very important in the eukaryotic genomes contributing to the variation in genome architecture and being involved in wide ranges of biological processes such as gene mutation or activation and various types of diseases. In the present study,the TE content,type,copy number,subfamily,divergence rate and average length were investigated in the panda genome based on 2 strategies: the library based strategy of Repeat Masker( RM) and the de novo based strategy of Repeat Scout( RS). The 2 strategies were compared and the results showed that the copy number of most TEs annotated by RM were significantly more than that by RS,whereas RM identified less copy number than RS in some TE subfamilies. Moreover,RM successfully identified much more TE subfamilies than RS,and the average length of each type of TEs annotated by RM was longer than that annotated by RS. In addition,we constructed 3 400 consensus sequences of giant panda repeat elements using RS,and 20% of which were different from consensus sequences of those elements in the database,thus might include panda lineage specific repeat elements.
作者
彭长军
牛李丽
邓家波
余建秋
李静
PENG Changjun NIU Lili DENG Jiabo YU Jianqiu LI Jing(Key Laboratory of Bio-resourees and Eeo-environment, Ministry of Education, College of Life and Seienees, Sichuan University, Chengdu 610065, China Siehuan Wild Animal Research Institute, Chengdu Zoo, Chengdu 610081, China)
出处
《四川动物》
北大核心
2017年第2期121-130,共10页
Sichuan Journal of Zoology
基金
成都大熊猫繁育研究基金会项目(CPF2014-13)