期刊文献+

基于随机森林算法的中医寒证和热证诊断模型研究 被引量:5

A model for diagnosing TCM cold and heat patterns based on random forest algorithm
原文传递
导出
摘要 目的从症状体征的角度,构建中医寒证和热证的诊断模型,为寒热辨证标准化提供依据。方法从《证候规范与辨证方法体系的研究》构建的证候要素-症状数据表中分别筛选与"寒""热"有关的症状,基于随机森林算法特征筛选出排序前15的症状,随机划分为10份,按照7∶3作为训练集和测试集,重新采样后以最佳参数分别构建寒证和热证的随机森林模型,以受试者工作特征(ROC)曲线下面积(AUC)、敏感度和特异度作为模型评价指标。结果寒证的关键特征变量包括脉浮紧、恶寒、无汗、苔白、得温痛减、冷痛、舌淡、恶寒发热、口不渴、身痛、头痛、苔腻、食欲不振、便溏、肢冷,诊断模型AUC值为0.912,特异度和敏感度分别为0.89和0.80。热证的关键特征变量包括苔黄、口渴、脉滑数、发热、壮热、脉数、小便赤、舌红、脉弦数、口苦、苔腻、舌红绛、尿黄、心烦、头痛,诊断模型AUC值为0.891,特异度和敏感度分别为0.85和0.86。结论基于变量筛选及随机森林算法,有效建立了寒热的辨证模型,显示出较好的分类效果,可以为标准化辨证提供方法学参考。 Objective To construct a model for diagnosing cold and heat patterns from the perspective of symptoms to provide basis for standardizing cold-heat pattern identification.Methods Symptoms related to the"cold"and"heat"patterns were selected from a constructed pattern elements-symptom data table from"Study on Pattern Standardization and Pattern Identification System".The top 15 symptoms were selected through feature screening of random forest algorithm.The dataset was split randomly into the training set and the test set with a ratio of 7∶3.After the data were resampled,random forest models for the cold and the heat patterns were constructed with the best parameters.The models were then evaluated with parameters including area under the ROC curve(AUC),sensitivity and specificity.Results The key characteristic variables of cold patterns include tight floating pulse,aversion to cold,absence of sweating,white tongue coating,pain relieved with warmth,cold pain,pale tongue,aversion to cold with fever,absence of thirst,body pain,headache,greasy coating,poor appetite,loose stool,and cold limbs.The model has an AUC of 0.912,a specificity of 0.89,and a sensitivity of 0.80.The key characteristic variables of heat patterns include yellow coating,thirst,slippery rapid pulse,fever,high fever,rapid pulse,dark urine,red tongue,wiry rapid pulse,bitter taste in the mouth,greasy coating,crimson tongue,brown urine,vexation,and headache.The model has an AUC of 0.891,a specificity of 0.85 and a sensitivity of 0.86.Conclusion Based on variable screening and random forest algorithm,models for identification of cold and heat patterns could be established with satisfactory classification effect,which could serve as an indirect means of standardizing cold and heat pattern identification.
作者 舒琛洁 梁浩 王耘 Shu Chenjie;Liang Hao;Wang Yun(School of Chinese Materia Medica,Beijing University of Chinese Medicine,Beijing 102488,China)
出处 《北京中医药大学学报》 CAS CSCD 北大核心 2021年第6期538-543,共6页 Journal of Beijing University of Traditional Chinese Medicine
基金 国家自然科学基金面上项目(No.81973495)。
关键词 随机森林算法 诊断模型 证候要素 random forest algorithm model for diagnosis pattern elements
  • 相关文献

参考文献11

二级参考文献89

共引文献50

同被引文献90

引证文献5

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部