期刊文献+

Impact of preprocessing on medical data classification 被引量:1

Impact of preprocessing on medical data classification
原文传递
导出
摘要 The significance of the preprocessing stage in any data mining task is well known. Before attempting medical data classification, characteristics of medical datasets, including noise, incompleteness, and the existence of multiple and possibly irrelevant features, need to be addressed. In this paper, we show that selecting the right combination of prepro- cessing methods has a considerable impact on the classification potential of a dataset. The preprocessing operations con- sidered include the discretization of numeric attributes, the selection of attribute subset(s), and the handling of missing values. The classification is performed by an ant colony optimization algorithm as a case study. Experimental results on 25 real-world medical datasets show that a significant relative improvement in predictive accuracy, exceeding 60% in some cases, is obtained. The significance of the preprocessing stage in any data mining task is well known. Before attempting medical data classification, characteristics of medical datasets, including noise, incompleteness, and the existence of multiple and possibly irrelevant features, need to be addressed. In this paper, we show that selecting the right combination of prepro- cessing methods has a considerable impact on the classification potential of a dataset. The preprocessing operations con- sidered include the discretization of numeric attributes, the selection of attribute subset(s), and the handling of missing values. The classification is performed by an ant colony optimization algorithm as a case study. Experimental results on 25 real-world medical datasets show that a significant relative improvement in predictive accuracy, exceeding 60% in some cases, is obtained.
出处 《Frontiers of Computer Science》 SCIE EI CSCD 2016年第6期1082-1102,共21页 中国计算机科学前沿(英文版)
关键词 CLASSIFICATION ant colony optimization medical data classification PREPROCESSING feature subset selection discretization classification, ant colony optimization, medical data classification, preprocessing, feature subset selection,discretization
  • 相关文献

同被引文献12

引证文献1

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部