摘要
目的利用生物信息学对新克隆的人ligatin样基因HCA 5 6的基因和蛋白序列进行分析,探讨生物信息学在新基因研究中的作用。方法以人类基因组数据库为基础,利用电子PCR和SAGE数据库对HCA 5 6进行染色体定位和组织表达分布分析;BLAST程序进行HCA 5 6的基因结构分析和相似序列搜索;ORFfinder和GeneRunner3 0 5程序对HCA 5 6编码蛋白进行序列预测和功能分析。结果得出HCA 5 6的全长cDNA为2 0 2 1bp ,定位于染色体1q31 q32 ,其最长的开放读码框架为175 5bp ,编码5 84个氨基酸,编码氨基酸含有一个亮氨酸拉链和一假定的RNA结合保守序列。相似性搜索发现HCA 5 6基因片段与人ligatin基因片段具有很高的相似性(99% )。SAGE结果显示HCA 5 6在多种组织中都有表达。结论生物信息学是进行新基因研究的有效方法;HCA 5 6很可能是ligatin的全长编码基因。
Objective To analyze the gene and protein sequence of a novel gene HCA56 with bioinformatics and to explore the value of bioinformatics in the novel gene research. Methods Based on the human genome resource, e-PCR and SAGE were used for the analysis of chromosome location and tissue expression of HCA56; BLAST procedure was used to analyze the gene sequence and the similarity gene; ORF finder and Gene Runner 3.05 were used to predict the sequence and the function of the HCA56 encoding protein. Results The full length of the HCA56 is 2 021 bp and locate in 1q31-q32. The longest ORF is 1 755 bp and encodes 584 amino acid with a leucine-zipper and a Putative RNA-binding Domain. Similarity research has shown that HCA56 protein shares highest homology with human ligatin partial cDNA (99%). HCA56 gene is expressed in different tissues. Conclusion Bioinformatics is an effective method for new gene analysis and HCA56 maybe the full length of the ligatin encoded gene.
出处
《基础医学与临床》
CSCD
北大核心
2005年第2期169-172,共4页
Basic and Clinical Medicine
基金
国家重点基础性研究项目 (973项目 ) (G19990 5 390 4 )