期刊文献+

一种创新的抗乳腺癌药物分子特征筛选模型

An I nnovative S creening M odel for M olecular C haracteristics of A nti-breast C ancer D rugs
下载PDF
导出
摘要 据不完全统计显示,2020年全球乳腺癌新增人数达到226万,女性乳腺癌是最常见的癌症类型,死亡率高居第五,因此对乳腺癌的治疗研究变得愈发重要。对雌激素受体α亚型(ERα)的研究显示其在乳腺发育过程中扮演重要角色。本文收集作用于ERα的化合物及其生物活性数据,并以一系列分子结构描述符作为自变量和以化合物的生物活性值作为因变量,通过随机森林与梯度提升树并融合专家知识来构建分子筛选模型,筛选出前20个对生物活性最具有显著影响的分子描述符。这些分子描述符对指导已有活性化合物的结构优化和药物研究具有重要意义。 According to incomplete statistics,the number of new breast cancers worldwide in 2020 will reach 2.26 million.Female breast cancer is the most common type of cancer,with the fifth highest mortality rate.Therefore,research on breast cancer treatment has become increasingly important.Research on the estrogen receptor alpha subtype(ERα)has shown that it plays an important role in the development of the breast.This paper collects data on the compounds and their biological activities that act on ERα,and uses a series of molecular structure descriptors as independent variables and the biological activity values of the compounds as dependent variables.Through random forest and GBDT and fusion of expert knowledge to construct the quantitative structure-activity relationship of the compound,the top 20 molecular descriptors with the most significant impact on biological activity are screened out.These molecular descriptors are of great significance for guiding the structural optimization and drug research of existing active compounds.
作者 郑剑国 ZHENG Jianguo(College of Electronics and Information Engineering,Tongji University,Shanghai,China,201804)
出处 《福建电脑》 2022年第3期1-4,共4页 Journal of Fujian Computer
关键词 乳腺癌 特征筛选 随机森林 梯度提升树 Breast Cancer Feature Selection Random Forest Gradient Boosting Decision Tree
  • 相关文献

参考文献2

二级参考文献50

  • 1刘微,罗林开,王华珍.基于随机森林的基金重仓股预测[J].福州大学学报(自然科学版),2008,36(S1):134-139. 被引量:8
  • 2林成德,彭国兰.随机森林在企业信用评估指标体系确定中的应用[J].厦门大学学报(自然科学版),2007,46(2):199-203. 被引量:36
  • 3陈友,程学旗,李洋,戴磊.基于特征选择的轻量级入侵检测系统[J].软件学报,2007,18(7):1639-1651. 被引量:78
  • 4Breiman L. Bagging Preditors [J].Machine Learning, 1996,24(2).
  • 5Dietterich T. An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting and Randomization [J].Machine Learning, 2000,40(2).
  • 6Ho T K. The Random Subspace Method for Constructing Decision Forests [J].Trans. on Pattern Analysis and Machine Intelligence, 1998,20 (8).
  • 7Amit Y, Gernan D. Shape Quantization and Recognition with Randomized Trees[J]. Neural Computation, 1997,9(7). Breiman L Random Forest[J]. Machine Learning, 2001,45(1).
  • 8Breiman L. Random Forests[J]. Machine Learning, 2001,45(1).
  • 9Tibshirani tL Bias, Variance, and Prediction Error for Classification Rules[C]. Technical Report, Statistics Department, University of Toronto, 1996.
  • 10Wolpert D H, Macready W G. An Efficient Method to Estimate Bagging's Generalization Error[J]. Machine Learning, 1999,35(1).

共引文献667

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部