摘要
依据基因表达水平的理论预测方法从1373个大肠杆菌基因中选出了73个甚高表达基因和100个甚低表达基因.研究了这两类基因编码区起始密码子ATG前-1到-21位点(包含SD序列)的碱基构成与基因表达水平的关系.结果表明,SD序列中的富嘌呤区(约在-7到-12位点)G和T的概率分布曲线中心到ATG的距离(记为LH)与基因表达水平有明显的关系.甚高表达基因LH约为10,甚低表达基因的LH约为8;另外在-1位点处。
Using the self consistent information cluster method,we select 73 very high expressed genes and 100 very low expressed genes from 1 373 E.coli genes and analyze their base distribution of -1 to -21 sites before ATG codon.We find that the base distribution at -1 to -21 sites (including SD region) are correlated to the gene expresseion levels,the distance from ATG to the center of SD region (Shine-Dalgarno region) which corresponding to the position of maximum P (G) and the position of minimum P (T) is increased from 8 bases for very low expressed genes to 10 bases for very high expressed genes. In addition,for very high expressed genes,the probability of base T and base C in site -1 are different from very low expressed genes.
出处
《内蒙古大学学报(自然科学版)》
CAS
CSCD
1998年第2期172-176,共5页
Journal of Inner Mongolia University:Natural Science Edition
基金
国家自然科学基金
内蒙教育厅基金