期刊文献+

基于多特征属性相似的糖尿病早期预测方法 被引量:1

Diabetes Early Prediction Method Based on Multi-feature Attribute Similarity
下载PDF
导出
摘要 考虑样本数据集的差异性和相关性对疾病预测结果有着直接影响,提出一种基于多特征属性患者相似的糖尿病早期预测方法,根据患者之间特征具有相似性这一特点,对患者特征进行混合属性相似预分组,再把分组结果导入随机森林分类器进行疾病预测。首先以临床概念作为患者的特征项,通过聚类定量化分析不同特征属性类型间的距离来度量患者之间的混合相似度,根据患者混合相似度将患者集预分组为多个患者相似组。最后以随机森林分类器对相似组进行细分类,得到最终的疾病预测结果,该结果与基于全样本数据的随机森林分类结果相比,分类准确率提高了8.3%;与基于单一属性相似组的随机森林分类结果相比,分类准确率提高了5.1%。结果表明:所提方法具有较高的预测准确率,可为糖尿病诊断预测提供支持。 Considering that the difference and correlation of sample data sets has a direct impact on disease prediction results,a method for early prediction of diabetes based on the similarity of patients with multi-feature attributes was proposed.According to the characteristics of the similarity between patients,the characteristics of patients were mixed.The attributes were similar to pre-grouping,and then the grouping results were imported into the random forest classifier for disease prediction.Firstly,the clinical concept was used as the patient's feature item,and the distance between different feature attribute types was measured by clustering and quantitative analysis to measure the mixed similarity between patients,and the patient set was pre-grouped into multiple patient similar groups according to the mixed similarity of patients.Finally,a random forest classifier was used to subdivide the similar groups to obtain the final disease prediction result.Compared with the random forest classification result based on the full sample data,the classification accuracy is increased by 8.3%.Compared with the random forest classification results based on a single attribute similarity group,the classification accuracy rate is increased by 5.1%.The results show that the proposed method has a high prediction accuracy rate and can provide support for the diagnosis and prediction of diabetes.
作者 乔瀚 容芷君 许莹 但斌斌 赵慧 QIAO Han;RONG Zhi-jun;XU Ying;DAN Bin-bin;ZHAO Hui(Department of Industrial Engineering, Wuhan University of Science and Technology, Wuhan 430081, China;Wuhan Fifth Hospital, Wuhan 430050, China)
出处 《科学技术与工程》 北大核心 2021年第36期15497-15502,共6页 Science Technology and Engineering
基金 武汉市科技局企业技术创新项目(201901070211288)。
关键词 患者相似性 特征属性 聚类 分类 糖尿病预测 patient similarity characteristic attribute clustering classification diabetes prediction
  • 相关文献

参考文献9

二级参考文献51

  • 1中国成人血脂异常防治指南制订联合委员会[J].中国成人血脂异常防治指南,2007,35(5):391-392.
  • 2孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量:1072
  • 3陈黎飞,姜青山,王声瑞.基于层次划分的最佳聚类数确定方法[J].软件学报,2008,19(1):62-72. 被引量:82
  • 4Anderson KM, Wilson PW, Odell PM, et al. An updated coronary risk profile. A statement for health professionals . Circulation, 1991,83 ( l ) :356-362.
  • 5Wilson PW, D'Agostino RB, Levy D, et al. Prediction of coronary heart disease using risk factor categories. Circulation, 1998,97 (18) : 1837-1847.
  • 6Executive Summary of The Third Report of The National Cholesterol Education Program (NCEP) Expert Panel on Detection,Evaluation, And Treatment of High Blood Cholesterol In Adults ( Adult Treat- ment Panel III). JAMA,2001,285(19) :2486-2497. W.
  • 7u Y, Liu X, Li X, et al. Estimation of 10-year risk of fatal and non- fatal ischemic cardiovascular diseases in Chinese adults. Circulation, 2006,114(21 ) :2217-2225.
  • 8D'Agostino RS, Vasan RS, Pencina MJ, et al. General cardiovascular risk profile for use in primary care: the Framingham Heart Study . Circulation, 2008,117 ( 6 ) :743-753.
  • 9Report of the National Cholesterol Education Program Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults. The Expert Panel. Arch Intern Med, 1988,148 ( 1 ) : 36 -69.
  • 10The 1984 Report of the Joint National Committee on Detection, Eval- uation, and Treatment of High Blood Pressure . Arch Intern Med, 1984,144 (5) : 1045-1057.

共引文献1570

同被引文献9

引证文献1

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部