蛋白质序列与EST序列的反翻译联配

REVERSE-TRANSLATED ALIGNMENT OF EST SEQUENCE WITH PROTEIN SEQUENCE

下载PDF

导出

摘要随着大规模测序技术的进步 ,收录到数据库中的序列增长很快 ,其中大多是未知功能的ESTs(表达序列标签 ,ExpressedSequenceTags)。一般通过蛋白质 -EST序列联配来实现EST的功能提示。由于EST含有5 %左右的测序误差 ,特别严重的是其中的移框误差 ,用通常的方法将EST按6个阅框翻译为蛋白质序列再进行联配难以处理移框误差问题。通过考虑EST序列各种可能的测序误差 ,将氨基酸序列反翻译为核苷酸序列 ,在核酸水平直接进行序列联配 ,用以实现蛋白质与EST序列的精确匹配 ,并对EST序列的移框误差进行识别与校正。 The sequences in database increase quickly along with the development of the high-throughput sequencing techniques, while most of the sequences are ESTs (Expressed Sequencing Tags) with unknown function. The homology alignment was often employed to identify the biological function of EST sequences, comparing all the six reading frames of EST against the selected protein databases at protein level. However, EST sequences contain nearly 5% sequencing errors, in which the frameshift errors made it difficult to treat precisely with traditional alignment. Addressing most of the possible sequencing errors, our alignment model is reverse-translateing the protein sequence into putative nucleotide sequence, which allowed direct comparison at nucleotide level. Such alignment between protein and EST sequences could be more accurate. And the knotty frameshifts in EST sequences could be identified with high quality.

作者张文张少洁汤海旭丁达夫

机构地区中国科学院上海生物化学研究所

出处《生物物理学报》 CAS CSCD 北大核心 2000年第2期322-333,共12页 Acta Biophysica Sinica

基金国家自然科学基金重大项目课题资助项目!(39990600 -03) 国家人类基因组南方研究中心项目

关键词测序误差移框误差反翻译蛋白质-EST联配 Sequencing error Frameshift error Reverse-translate Protein-EST alignment

分类号 Q75 [生物学—分子生物学]

引文网络
相关文献

参考文献4

1杨灿珠，博士学位论文，1999年
2Huang X Q，Genomics，1997年，46期，37页
3Guan X J，CABIOS，1996年，12期，31页
4Huang X Q，CABIOS，1996年，12卷，497页

1乔瑞洁,杨晓明.肽核酸[J].微生物学免疫学进展,1999,27(1):86-90. 被引量：2
2郭忠建,朱颖敏,陈克平.Sequence and Molecular Evolution Analysis of Ubiquitin Proteins Encoded by Baculoviruses[J].Agricultural Science & Technology,2010,11(9):53-57.
3解涛,盛泉虎,丁达夫.蛋白质组表达图谱用于基因组功能提示的可行性研究[J].生物化学与生物物理学报,1999,31(4):451-455. 被引量：3
4张保红,丁达夫.蛋白质结构成对比较的新方法[J].生物物理学报,1993,9(3):353-361. 被引量：5
5吴开国,吴彤,磨传真,郭松超,袁秀玲,唐正心.应用沼气池液培养钝顶螺旋藻的初步探讨[J].广西医学院学报,1990,7(2):11-17. 被引量：4
6陈国忠,李文均,徐丽华,姜成林.16S rRNA二级结构的研究进展及其在系统分类中的应用[J].微生物学杂志,2005,25(5):54-57. 被引量：8
7陈艳君,朱伟文,隋硕,张晓伟,胡松年.羊轮状病毒NT株VP1基因的测序和分子进化分析[J].微生物学报,2009,49(8):1055-1062.
8王非,杨欣,June Y.Liberamy.生物序列比对算法的实现与集成[J].计算机与应用化学,2004,21(4):583-586. 被引量：2
9邓艳春,药立波,苏成芝.PH结构域研究进展[J].生命科学,2000,12(3):117-121. 被引量：9
10季海涛,张万年,周有骏,吕加国,朱驹,朱杰.细胞色素P450超家族蛋白质基于结构知识的序列联配[J].生物物理学报,1999,15(2):360-368. 被引量：6

生物物理学报

2000年第2期

浏览历史

内容加载中请稍等...

蛋白质序列与EST序列的反翻译联配

参考文献4

相关作者

相关机构

相关主题

浏览历史