期刊文献+

Argumentative Comparative Analysis of Machine Learning on Coronary Artery Disease 被引量:1

Argumentative Comparative Analysis of Machine Learning on Coronary Artery Disease
下载PDF
导出
摘要 Cardiovascular disease (CVD) is a leading cause of death across the globe. Approximately 17.9 million of people die globally each year due to CVD, </span><span style="font-family:Verdana;">which comprises 31% of all death. Coronary Artery Disease (CAD) is a common</span><span style="font-family:Verdana;"> type of CVD and is considered fatal.</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">Predictive models that use machine learning algorithms may assist health workers in timely detection of CAD which ultimately reduce</span><span style="font-family:Verdana;">s</span><span style="font-family:Verdana;"> the mortality.</span><span style="font-family:""> </span><span style="font-family:""><span style="font-family:Verdana;">The main purpose of this study is to build a predictive model that provides doctors and health care providers with personalized information to implement better and more personalized treat</span><span style="font-family:Verdana;">ments for their patients. In</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">this study, we use the publicly available Z-Alizadeh</span><span style="font-family:Verdana;"> Sani dataset which contains random samples of 216 cases with CAD and 87 normal controls with 56 different features. The binary variable “Cath” which represents case-control status, is used the target variable. We study its relationship with other predictors and develop classification models using the five different supervised classification machine learning algorithms: Logistic Regression (LR), Classification Tree</span><span style="font-family:""> </span><span style="font-family:Verdana;">with</span><span style="font-family:""> </span><span style="font-family:""><span style="font-family:Verdana;">Bagging (Bagging CART), </span><span style="font-family:Verdana;">Random </span><span style="font-family:Verdana;">Forest (RF), Support Vector Machine (SVM), and K-Nearest Neighbors (KNN).</span><span style="font-family:Verdana;"> These five classification models are used to investigate the detection of CAD. Finally, the performance of the machine learning algorithms is compared,</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">and the best model is selected. Our results indicate that the SVM model is able to predict the presence of CAD more effectively and accurately than other models with an accuracy of 0.8947, sensitivity of 0.9434, specificity of 0.7826, and AUC of 0.8868. Cardiovascular disease (CVD) is a leading cause of death across the globe. Approximately 17.9 million of people die globally each year due to CVD, </span><span style="font-family:Verdana;">which comprises 31% of all death. Coronary Artery Disease (CAD) is a common</span><span style="font-family:Verdana;"> type of CVD and is considered fatal.</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">Predictive models that use machine learning algorithms may assist health workers in timely detection of CAD which ultimately reduce</span><span style="font-family:Verdana;">s</span><span style="font-family:Verdana;"> the mortality.</span><span style="font-family:""> </span><span style="font-family:""><span style="font-family:Verdana;">The main purpose of this study is to build a predictive model that provides doctors and health care providers with personalized information to implement better and more personalized treat</span><span style="font-family:Verdana;">ments for their patients. In</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">this study, we use the publicly available Z-Alizadeh</span><span style="font-family:Verdana;"> Sani dataset which contains random samples of 216 cases with CAD and 87 normal controls with 56 different features. The binary variable “Cath” which represents case-control status, is used the target variable. We study its relationship with other predictors and develop classification models using the five different supervised classification machine learning algorithms: Logistic Regression (LR), Classification Tree</span><span style="font-family:""> </span><span style="font-family:Verdana;">with</span><span style="font-family:""> </span><span style="font-family:""><span style="font-family:Verdana;">Bagging (Bagging CART), </span><span style="font-family:Verdana;">Random </span><span style="font-family:Verdana;">Forest (RF), Support Vector Machine (SVM), and K-Nearest Neighbors (KNN).</span><span style="font-family:Verdana;"> These five classification models are used to investigate the detection of CAD. Finally, the performance of the machine learning algorithms is compared,</span></span><span style="font-family:""> </span><span style="font-family:Verdana;">and the best model is selected. Our results indicate that the SVM model is able to predict the presence of CAD more effectively and accurately than other models with an accuracy of 0.8947, sensitivity of 0.9434, specificity of 0.7826, and AUC of 0.8868.
作者 Keshab R. Dahal Yadu Gautam Keshab R. Dahal;Yadu Gautam(Department of Statistics, Truman State University, Kirksville, MO, USA;Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA)
出处 《Open Journal of Statistics》 2020年第4期694-705,共12页 统计学期刊(英文)
关键词 Machine Learning Classification Model Comparison Coronary Artery Disease Data Mining Machine Learning Classification Model Comparison Coronary Artery Disease Data Mining
  • 相关文献

同被引文献2

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部