期刊文献+

基于机器学习方法的中药化学成分作用靶点的识别研究 被引量:2

Targets identification for the TCM chemical composition by machine learning methods
原文传递
导出
摘要 与有效成分相作用靶点的识别是从分子水平阐明中药作用机理的关键步骤。本文提出1种新的思路来解决中药化学成分的靶点识别问题。该方法基于小分子化合物的结构,计算表征分子组成、电荷分布、拓扑、几何结构及物理化学性质的分子描述符,经BestFirst搜索策略和CfsSubsetEval评估策略相结合的方法筛选出与靶点作用活性相关的分子描述符。采用径向基神经网络、朴素贝叶斯和随机森林3种机器学习方法构建一系列靶点的识别模型,后期将所建模型整合成靶点识别系统,进而预测中药有效成分的作用靶点。采用10折交叉验证,3种方法得到总的预测正确率分别为83.33%~95.71%、84.62%~96.43%、82.14%~95.59%,识别过程在(0.02~0.19)秒完成。实验结果证明该方法不但简单有效,更主要的是满足面向中药化学成分的靶点识别任务对辨识效率的要求。 Revealing the interactions between target proteins and chemical compositions of Traditional Chinese Medicine (TCM) is beneficial to understand the substances and mechanisms of TCM at a molecular level. The problem of identifying the targets of TCM chemical constituents is a cornerstone challenge in constructing the TCM chemical constituent-target interaction networks. It is therefore necessary to develop improved methods that are effective, fast and scalable. Here a new method is proposed for the identification of targets for TCM chemical constituents. Main steps of this method are as follows: Firstly calculate molecular descriptors including constitutional, charge distribution, topological, geometrical, and physicochemical descriptors to characterize the molecular structure of small molecular drugs, and then select the molecular descriptors relevant to the activity of the drug-target interactions, using the approach combing BestFirst and CfsSubsetEval, further establish targets identification models by three machine learning methods. The models were validated through 10:fold cross-validation (CV) and the total prediction accuracies were 83.33%-95.71%, 84.62%-96.43%, 82.14%-95.59%, accomplishing in 0.02~0.19 second, The final results reveal that the proposed method is reliable, and overcomes the efficiency problem of targets identification. This paper therefore suggests that the proposed method used in the research of targets identification may be a good strategy in exploring the molecular mechanism of TCM.
出处 《计算机与应用化学》 CAS CSCD 北大核心 2010年第9期1201-1204,共4页 Computers and Applied Chemistry
基金 国家自然科学基金(30973946) 国家973计划(2006CB504703)资助项目
关键词 中药化学成分 机器学习方法 靶点识别 TCM chemical compositions, machine learning methods, targets identification
  • 相关文献

参考文献1

二级参考文献9

  • 1Bartus, R. T.; Dean, R. L.; Beer, B.; Lippa, A. S. Science 1982, 217, 408.
  • 2Avery, E. E.; Baker, L. D.; Asthenia, S. Drugs Aging 1997,11,450.
  • 3Martin, Y.; Burse, M.; Dahaner, E.; DeLazzer, J.; Lico, I.;Pavlik, P. J. Comput.-Aided Mol. Des. 1993, 7, 83.
  • 4SYBYL, Computer Program, Version 6.4, Tripos Associates Inc., 1699 South Hanley Rd., St. Louis, MO 63144-2913,USA, 1997.
  • 5Ward, J. S.; Merritt, L.; Calligaro, D. O.; Bymaster, F. P.;Shannon, H. E.; Mitch, C. H.; Whitesitt, C.; Brunsting, D.;Sheardown, M. J.; Olesen, P. H.; Swedberg, M. D. B.;Jeppesen, L.; Sauerberg, P. J. Med. Chem. 1998, 41,379.
  • 6Ward, J. S.; Merritt, L.; Klimkowski, V. J.; Lamb, M. L.;Mitch, C. H.; Bymaster, F. P.; Sawyer, B. D.; Shannon, H.E.; Olesen, P. H.; Honoré, T.; Sheardown, M. J.; Sauerberg,P. J. Med. Chem. 1992, 35, 4011.
  • 7Ward, J. S.; Merritt, L.; Calligaro, D. O.; Bymaster, F. P.;Shannon, H. E.; Sawyer, B. D.; Mitch, C. H.; Deeter, J. B.;Peters, S. C.; Sheardown, M. J.; Olesen, P. H.; Swedberg,M. D. B.; Sauerberg, P. J. Med. Chem. 1995, 38, 3469.
  • 8Cha, J. H.; Cho, Y. S., Pae, A. N.; Koh, H. Y.; Jeong, D.;Kong, J. Y.; Lee, E.; Choi, K. I. Bioorg. Med. Chem. Lett.2001, 11, 2855.
  • 9Sauerberg, P.; Olesen, P. H.; Nielsen, S.; Treppendahl, S.;Sheardown, M. J.; Honor6, T.; Mitch, C. H.; Ward, J. S.;Pike, A. J.; Bymaster, F. P.; Sawyer, B. D.; Shannon, H. E.J. Med. Chem. 1992, 35, 2274.

共引文献2

同被引文献9

引证文献2

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部