摘要
从天然碱基的36种性质参数出发,通过主成分分析(PCA)技术处理得到了1个显著的主成分得分,并将该得分作为单个碱基的信息描述子———VBPV。进而使用VBPV对38个大肠杆菌(E.coli)启动子序列一级结构进行表征,并结合多元统计方法将表征参数与转录启动强度(PS)成功地建立了定量序列活性模型(QSAM),该模型拟合复相关系数Rcum与交叉检验复相关系数Qcum分别为0.97和0.95。
Via a principle component analysis (PCA) disposal of 36 kinks of property parameters for natural bases, one principle component score is obtained and serves as information descriptor for single base, termed as principle component scores vector of base propertied variables (VBPV). Primary structure representation for 38 E. coil promoter sequences by VBPV descriptors is combined with multiple statistic methods and a quantitative sequence-activity model (QSAM) is given out between the resulting representation parameter and transcription promoter strength. This model is successful with its Rcum and Qcum respectively of 0.97 and 0.95.
出处
《化学通报》
CAS
CSCD
北大核心
2006年第6期465-468,共4页
Chemistry
基金
霍英东基金
国家"春晖计划"教育部启动基金
重庆应用基础研究基金及重庆大学自主创新基金资助项目
关键词
VBPV
定量序列活性模型
大肠杆菌启动子
碱基
Principle component scores vector of base propertied variables ( VBPV), Quantitative sequence activity model (QSAM), E. coli promoter, Base