期刊文献+

统计独立性的离散化新方法 被引量:1

A NOVEL DISCRETIZATION METHOD FOR STATISTICAL INDEPENDENCE
下载PDF
导出
摘要 连续属性离散化方法能够提高数据挖掘和归纳学习等算法的分类预测能力。提出一种统计独立性的离散化新方法,该方法改进了基于卡方统计的区间合并函数,不仅考虑了各对合并区间中卡方自由度对离散化结果的影响,而且考虑了数据类分布的影响,很好地衡量了类-属性之间的相互独立性。实验结果表明,新方法显著地提高了Nave-bayes和SVM分类器的学习精度。 Continuous attributes discretization method can improve the classification and prediction capability of such algorithms as data mining or inductive learning.A novel discretization method for statistical independence is proposed,which improves the chi-square-based interval merging function.It considers not only the influence of chi-square freedom degree in every merged interval pair on discretization results,but also that of data class distribution so that the mutual independence between the class and the attributes are excellently measured.Experimental results show that the new method significantly enhances the classification accuracy of Nave-bayes and SVM classifiers.
作者 吴育锋
出处 《计算机应用与软件》 CSCD 北大核心 2012年第4期249-252,共4页 Computer Applications and Software
关键词 离散化 数据挖掘 卡方 统计独立性 Discretization Data mining Chi-square Statistical independence
  • 相关文献

同被引文献2

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部