Prediction of Protein-Protein Interactions by a Novel Model Based on Domain Information

Prediction of Protein-Protein Interactions by a Novel Model Based on Domain Information

下载PDF

导出

摘要 Domain-based protein-protein interactions( PPIs) is a problem that has drawn the attentions of many researchers in recent years and it has been studied using lots of computational approaches from many different perspectives. Existing domain-based methods to predict PPIs typically infer domain interactions from known interacting sets of proteins. However,these methods are costly and complex to implement. In this paper, a simple and effective prediction model is proposed. In this model,an improved multiinstance learning( MIL) algorithm( MilCaA) is designed that doesn't need to take the domain interactions into consideration to construct MIL bags. Then, the pseudo-amino acid composition( PseAAC) transformation method is used to encode the instances in a multi-instance bag and the principal components analysis( PCA) is also used to reduce the feature dimension. Finally, several traditional machine learning and MIL methods are used to verify the proposed model. Experimental results demonstrate that MilCaA performs better than state-of-the-art techniques including the traditional machine learning methods which are widely used in PPIs prediction. Domain-based protein-protein interactions（ PPIs） is a problem that has drawn the attentions of many researchers in recent years and it has been studied using lots of computational approaches from many different perspectives. Existing domain-based methods to predict PPIs typically infer domain interactions from known interacting sets of proteins. However,these methods are costly and complex to implement. In this paper, a simple and effective prediction model is proposed. In this model,an improved multiinstance learning（ MIL） algorithm（ MilCaA） is designed that doesn＇t need to take the domain interactions into consideration to construct MIL bags. Then, the pseudo-amino acid composition（ PseAAC） transformation method is used to encode the instances in a multi-instance bag and the principal components analysis（ PCA） is also used to reduce the feature dimension. Finally, several traditional machine learning and MIL methods are used to verify the proposed model. Experimental results demonstrate that MilCaA performs better than state-of-the-art techniques including the traditional machine learning methods which are widely used in PPIs prediction.

作者 DONG Lulu XIE Fei ZHANG Cheng LI Bin 董露露;谢飞;章程;李斌(Center of Anhui Continuing Education Online,Anhui Radio and TV University;School of Computer Science and Technology,Hefei Normal University;College of Computer Science and Technology,Anhui University)

机构地区 Center of Anhui Continuing Education Online School of Computer Science and Technology College of Computer Science and Technology

出处《Journal of Donghua University(English Edition)》 EI CAS 2018年第2期163-169,共7页 东华大学学报（英文版）

基金 National Natural Science Foundations of China(Nos.61503116,61402007) Foundation for Young Talents in the Colleges of Anhui Province Committee,China(No.2013SQRL097ZD) Natural Science Foundation of Anhui Educational Committee,China(No.KJ2014A198) Natural Science Foundation of Anhui Province,China(No.1408085QF108)

关键词 domain-based PROTEIN-PROTEIN interactions (PPIs) multi-instance learning AMINO acid composition ( AAC) pseudo-amino acidcomposition (PseAAC) domain-based protein-protein interactions （PPIs） multi-instance learning amino acid composition (AAC） pseudo-amino acidcomposition （PseAAC）

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献2

1张铃,张钹.M-P神经元模型的几何意义及其应用[J].软件学报,1998,9(5):334-338. 被引量：135
2Yanping Zhang,Heng Zhang,Huazhen Wei,Jie Tang,Shu Zhao.Multiple-Instance Learning with Instance Selection via Constructive Covering Algorithm[J].Tsinghua Science and Technology,2014,19(3):285-292. 被引量：2

二级参考文献29

1张铃,张钹.多层反馈神经网络的FP学习和综合算法[J].软件学报,1997,8(4):252-258. 被引量：24
2T. G. Dietterich, R. H. Lathrop, and T. Lozano-Perez, Solving the multiple instance problem with axis-parallel rectangles, Artificial Intelligence, vol. 89, pp. 31-71, 1997.
3A. Zafra, M. Pechenizkiy, and S. Ventura, ReliefF-MI: An extension of ReliefF to multiple instance learning, Neurocomputing, vol, 75, pp Y. X. Chert, J. B. Bi, and J. 210-218, 2012.
4Z. Wang, MILES: Multiple- instance learning via embedded instance selection, IEEE Transaction Pattern Analysis and Machine Intelligence, vol. 28, pp. 1931-1947, 2006.
5X. E Song, L. C. Jiao, S. Y. Yang, X. R. Zhang, and E H. Shang, Sparse coding and classifier ensemble based multi-instance learning for image categorization, Signal Processing, vol. 93, pp. 1-11, 2013.
6Y. X. Chen and J. Z. Wang, Image categorization by learning and reasoning with regions, Journal of Machine Learnhtg Research, vol. 5, pp. 913-939, 2004.
7S. Andrews, I. Tsochantaridis, and T. Hofmann, Support vector machines for multiple-instance learning, in Advance in Neutral Information Processing System 15, 2003, pp. 561-568.
8P. Viola, J. Platt, and C. Zhang, Multiple instance boosting for object detection, in Advance hz Neutral Infotvtation Processing System 18, 2006, pp.1419-1426.
9O. Maron and T. Lozano-Perez, A framework for multiple- instance learning, in Advance in Neutral Information Processing System 10, 1998, pp. 570-576.
10Q. Zhang and S. A. Goldman, EM-DD: An improved multi-instance learning technique, in Advance #1 Neutral Information Processing System 14, 2002, pp. 1073-1080.

共引文献135

1段震,姚芳兵,张铃.基于构造性学习方法的车牌定位[J].微机发展,2004,14(8):41-43. 被引量：2
2张燕平,张铃,吴涛,徐锋,张,王伦文.基于覆盖的构造性学习算法SLA及在股票预测中的应用[J].计算机研究与发展,2004,41(6):979-984. 被引量：18
3段震,鲁杰,张铃.基于交叉覆盖神经网络的车牌识别研究[J].安徽大学学报（自然科学版）,2004,28(5):11-14. 被引量：7
4赵姝,张燕平,张媛,陈传明.基于交叉覆盖算法的改进算法——核平移覆盖算法[J].微机发展,2004,14(11):1-3. 被引量：6
5黄国宏,邵惠鹤.一种新的基于神经网络覆盖分类算法[J].中国图象图形学报（A辑）,2004,9(10):1165-1168. 被引量：6
6张燕平,张铃,段震.构造性核覆盖算法在图像识别中的应用[J].中国图象图形学报（A辑）,2004,9(11):1304-1308. 被引量：17
7阚涛,娄天玲.基于交叉覆盖算法的模糊神经网络在车用发电机故障诊断系统中的应用研究[J].安徽电子信息职业技术学院学报,2005,4(1):76-77.
8钱峰,张蕾,赵姝.基于粗糙集的交叉覆盖算法[J].铜陵学院学报,2004,3(4):70-71.
9毛军军,吴涛,郑婷婷,张铃.基于商空间的构造性分层竞争网络算法[J].微机发展,2005,15(4):37-39. 被引量：2
10唐理兵,倪志伟,李学俊,马猛.基于交叉覆盖设计算法的空间分类挖掘[J].微机发展,2005,15(4):43-45.

1Liang Liu,Peng Chen,Min Wang,Xueyan Li,Jiuyu Wang,Maolu Yin,王艳丽.C2c1-sgRNA复合物结构揭示RNA介导的C2c1切割DNA的机制[J].科学新闻,2018,0(4):83-83.
2帅丹.“宇舶爱艺术”展览闪亮成都[J].优雅,2018,0(6):108-108.
3刘欢,顾小萍.条件性恐惧记忆相关基因的生物信息学分析[J].中华行为医学与脑科学杂志,2017,26(12):1076-1080. 被引量：2

Journal of Donghua University(English Edition)

2018年第2期

浏览历史

内容加载中请稍等...

Prediction of Protein-Protein Interactions by a Novel Model Based on Domain Information

参考文献2

二级参考文献29

共引文献135

相关作者

相关机构

相关主题

浏览历史