摘要
因为研究分泌蛋白质有助于找到直接与特定生理或病理状态相关的生物分子,判断一条未知蛋白是否为分泌蛋白是非常重要的。基于同一类型蛋白质的哈斯矩阵图具有相似图像纹理假设,提取图像的几何矩作为伪氨基酸成分对未知蛋白质序列是否属于分泌蛋白进行预测,采用Jackknife算法进行测试,预测成功率与现有算法相比有很大的提高。
It is important to identify whether an uncharacterized protein sequence is a secretory proteins or not because secretory proteins are composed with signal peptides which are crucial tool in finding new drugs or reprogramming cells for gene therapy.Based on the assumption that proteins belonging to a same class must bear some sort of similar texture on the protein Hasse matrix images, geometric invariant moment factors derived from the image are used as the pseudo amino acid components to formulate the protein samples for statistical prediction.The success rates obtained on a previously constructed benchmark dataset are quite promising.
出处
《计算机工程与应用》
CSCD
北大核心
2011年第32期170-172,220,共4页
Computer Engineering and Applications
基金
国家自然科学基金No.60961003
教育部科学技术研究重点项目(No.210116)
江西省自然基金项目(No.2010GQS0127
No.2010GZS0122)
江西省教育厅科研项目(No.GJJ11557)~~
关键词
分泌蛋白
哈斯矩阵
模糊K近邻算法
Jackknife测试
secretory proteins
Hasse matrix
fuzzy K nearest neighbor algorithm
Jackknife cross-validation test