期刊文献+

Comparison of Attribute Reduction Methods for Coronary Heart Disease Data by Decision Tree Classification

Comparison of Attribute Reduction Methods for Coronary Heart Disease Data by Decision Tree Classification
下载PDF
导出
摘要 Attribute reduction is necessary in decision making system. Selecting right attribute reduction method is more important. This paper studies the reduction effects of principal components analysis (PCA) and system reconstruction analysis (SRA) on coronary heart disease data. The data set contains 1723 records, and 71 attributes in each record. PCA and SRA are used to reduce attributes number (less than 71 ) in the data set. And then decision tree algorithms, C4.5, classification and regression tree ( CART), and chi-square automatic interaction detector ( CHAID), are adopted to analyze the raw data and attribute reduced data. The parameters of decision tree algorithms, including internal node number, maximum tree depth, leaves number, and correction rate are analyzed. The result indicates that, PCA and SRA data can complete attribute reduction work,and the decision-making rate on the reduced data is quicker than that on the raw data; the reduction effect of PCA is better than that of SRA, while the attribute assertion of SRA is better than that of PCA. PCA and SRA methods exhibit goodperformance in selecting and reducing attributes. Attribute reduction is necessary in decision making system. Selecting right attribute reduction method is more important. This paper studies the reduction effects of principal components analysis (PCA) and system reconstruction analysis , SRA) on coronary heart disease data. The data set contains 1723 records, and 71 attributes in each record. PCA and SRA are used to reduce attributes number (less than 71 ) in the data set. And then decision tree algorithms. C4.5, classification and regression tree ( CART), and chi-square automatic interaction detector ( CHAID ), are adopted to analyze the raw data and attribute reduced data. The parameters of decision tree algorithms, including internal node number, maximum tree depth, leaves number, and correction rate are analyzed. The result indicates that. PCA and SRA data can complete attribute reduction work. and the decision-making rate on the reduced data is quicker than that on the raw data: the reduction effect of PCA is better than that of SRA. while the attribute assertion of SRA is better than that of PCA. PCA and SRA methods exhibit good performance in selecting and reducing attributes.
出处 《Transactions of Tianjin University》 EI CAS 2005年第6期463-468,共6页 天津大学学报(英文版)
基金 Supported by Ministry of Education of China ( No. 02038) , Asian Research Center of Nankai University ( No. AS0405) , and Tianjin Higher Education Science Development Fund( No. 20030621 ).
关键词 冠心病 主成份分析 系统重建分析 决策树 principal components analysis ( PCA) system reconstruction analysis ( SRA) attribute reduction decision tree
  • 相关文献

参考文献4

  • 1J. R. Quinlan.Induction of decision trees[J].Machine Learning.1986(1)
  • 2Kass G V.An exploratory technique for investigating large quantities of categorical data[].Applied Statistics.1980
  • 3Gonzalo Martiez-Munoz,Alberto Suarez.Using all da-la to generate decision tree ensemble[].IEEE Transactions on Systems MAN and Cybernetics ( Part C): Applications and Review.2004
  • 4Klir G J.Identification of generative structures in em-pirical data[].International Journal of General Systems.1976

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部