期刊文献+

基于Lasso特征选择乳腺癌二分类算法研究

A Study on the Lasso Feature-based Selection Algorithm for Breast Cancer Binary Classification
下载PDF
导出
摘要 近年来,随着大数据挖掘技术在医疗行业的迅速发展,临床精准治疗成为医疗大数据领域的研究热点。基于UCI数据库中乳腺癌数据集,通过构建乳腺癌二分类算法来预测乳腺肿瘤类型。其中针对不平衡数据集的处理、特征选择算法的优化以及分类准确率的评估,使用了机器学习技术包括随机过采样算法、Least absolute shrinkage and selection operator(Lasso)回归进行特征选择、序列前向选择(SFS)的特征选择算法。结果表明包含其中的6个特征的随机森林算法分类准确率最高(97.07%),相对于未进行特征选择算法的准确率有所提高,有可能在乳腺癌检测方面提供新的思路。 In recent years,with the rapid development of big data mining technology in the medical industry,clinical precision therapy has become a research hotspot in the field of medical big data.In this study,based on the breast cancer dataset in the UCI database,a breast cancer dichotomous classification algorithm was constructed to predict breast tumour types.Among them,machine learning techniques including random oversampling algorithm,Least absolute shrinkage and selection operator(Lasso)regression for feature selection,and sequential forward selection(SFS)for feature selection algorithm were used for the processing of imbalanced dataset,optimisation of feature selection algorithm and evaluation of classification accuracy.The results showed that the random forest algorithm containing six of these features had the highest classification accuracy(97.07%),which improved the accuracy relative to the algorithm without feature selection and could potentially provide new ideas in breast cancer detection.
作者 冯欣 张航 辛瑞昊 FENG Xin;ZHANG Hang;XIN Ruihao(School of Mathematics and Science,Jilin Institute of Chemical Technology,Jilin City 132022,China;School of Information and Control Engineering,Jilin Institute of Chemical Technology,Jilin City 132022,China)
出处 《吉林化工学院学报》 CAS 2023年第1期23-28,共6页 Journal of Jilin Institute of Chemical Technology
基金 吉林省教育厅科学研究重点项目(1KH20220245K1) 吉林省教育厅人文社科研究项目(JIKH 20220226SK) 吉林省高教科研课题(JGJX2021D226) 吉林省自然科学基金(YDZJ202301ZYTS288) 吉林省自然科学基金(YDZJ202301ZYTS401)。
关键词 乳腺癌 Lasso SFS breast cancer lasso SFS
  • 相关文献

参考文献13

二级参考文献63

共引文献78

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部