期刊文献+

拉曼光谱结合机器学习算法对乳腺癌及其分子亚型的识别

Identification of breast cancer and its molecular sub-types via Raman spectroscopy combined with machine learning algorithms
原文传递
导出
摘要 目的开发一种简单、快速且便捷的检测方法用于乳腺癌及其分子亚型的识别。方法采用激光共聚焦拉曼仪采集乳腺正常细胞与乳腺癌不同分子亚型细胞的拉曼光谱图, 对其拉曼光谱峰进行物质归属。先选用Savitzky-Golay平滑(窗口大小为9)对拉曼光谱图进行平滑和去噪处理, 后采用迭代自适应加权惩罚最小二乘法对其进行基线矫正, 并通过主成分分析剔除异常值。借助偏最小二乘判别分析(PLS-DA)、K最邻近(KNN)和支持向量机(SVM)3种原理不同的算法建立乳腺正常细胞与乳腺癌细胞的识别模型及乳腺癌不同分子亚型细胞的识别模型。结果乳腺正常细胞与乳腺癌细胞的谱图形状以及拉曼光谱峰位移相似, 但强度却存在较大差异。机器学习的模型结果显示, PLS-DA和SVM算法对乳腺正常细胞与乳腺癌细胞区分的识别准确度分别在92.03%、90.67%以上。PLS-DA和SVM算法对乳腺癌不同分子亚型细胞的识别准确度分别为(83.66±2.77)%、(90.55±0.06)%。结论拉曼光谱结合机器学习算法可实现乳腺正常细胞与乳腺癌细胞、不同分子亚型的乳腺癌细胞的准确识别。 ObjectiveTo develop a simple,rapid,and convenient analysis method for the identification of breast cancer and its molecular sub-types.MethodsA laser confocal Raman spectrometer was used to collect Raman spectrograms of normal breast cells and different molecular sub-types of breast cancer cells,and assign the material origin of the Raman spectral peaks.First,Savitzky-Golay smoothing(with a window size of 9)was selected to perform smoothing and denoising on the Raman spectrogram.Subsequently,an iterative adaptive weighted penalty least squares method was employed for baseline correction,and principal component analysis was used to eliminate outliers.The recognition model of normal breast cells and breast cancer cells and the recognition model of different molecular sub-types of breast cancer cells were established by using three algorithms with different principles,including partial least squares discriminant analysis(PLS-DA),K-nearest neighbor(KNN),and support vector machine(SVM).ResultsThe Raman spectrogram and Raman peak shifts of normal breast cells and breast cancer cells were similar,but there were significant differences in intensity.The results of the machine learning models showed that the recognition accuracy of PLS-DA and SVM algorithms for distinguishing between normal breast cells and breast cancer cells was above 92.03%and 90.67%,respectively.The recognition accuracy of PLS-DA and SVM algorithms for different molecular sub-types of breast cancer cells was(83.66±2.77)%and(90.55±0.06)%,respectively.ConclusionsThe combination of Raman spectroscopy and machine learning algorithms can achieve accurate identification of normal breast cells,breast cancer cells,and different molecular sub-types of breast cancer cells.
作者 李娟 杨超 唐佳怡 夏静静 刘皓君 祖丽胡玛·艾合买提 蔡昕恩 阿依提拉·麦麦提江 Li Juan;Yang Chao;Tang Jiayi;Xia Jingjing;Liu Haojun;Zulhumar·Ahmat;Cai Xin’en;Ayitila·Maimaitijiang(School of Pharmaceutical Sciences and Institute of Materia Medica,Xinjiang University,Urumqi 830017,China;Department of Emergency,General Hospital of Xinjiang Military Region of the Chinese People’s Liberation Army,Urumqi 830000,China)
出处 《国际生物医学工程杂志》 CAS 2024年第3期219-226,共8页 International Journal of Biomedical Engineering
基金 新疆维吾尔自治区青年科学基金(2023D01C201) 新疆维吾尔自治区重点研发计划(2023B03002-1) 新疆维吾尔自治区级大学生创新训练计划(S202310755084)。
关键词 拉曼光谱 乳腺癌 分子亚型 机器学习 Raman spectroscopy Breast cancer Molecular sub-type Machine learning
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部