期刊文献+

PIE:实值属性离散化方法及应用 被引量:1

PIE:discretization method for real attributes and its application
下载PDF
导出
摘要 提出一种基于概率与信息熵理论的实值属性离散化方法,综合考虑了各对合并区间之间的差异性;该方法利用信息熵衡量相邻区间的相似性,同时考虑离散区间大小和区间类别数对学习精度的影响,并通过概率的方法得到了这两个因素的衡量标准。仿真结果表明,新方法对See5/C5.0分类器有较好的分类学习能力,并在肿瘤诊断中得到了很好的应用。 This paper presents a diseretization method for real attributes based on probability and information entropy,namch PIE, which synthetically considers the variance among the merged intervals. This method mcasures the similarity of each imerval intervals by using information entropy and takes into account the effect of the discrete interval size and class nnmber of each interval on learning accuracy, and the measurement of two fae.tors is achieved with probabilistic means. Simulation results show that PIE eam yield more classification and learning accuracy by running See5/C5.0 classifier and has better application on tumot diagnosis.
作者 李杰 王欢
出处 《微型机与应用》 2011年第15期68-70,77,共4页 Microcomputer & Its Applications
关键词 离散化 数据挖掘 概率 信息熵 diseretization data mining probabilily information entropy
  • 相关文献

参考文献7

  • 1DOUGHERTY J, KOHAVI R, SAHAMI M. Supervised and unsupervised discretization of continuous feature [C]. Proceedings of the 12th International Conference of Machinelearning. San Francisco: Morgan Kaufmann, 1995.
  • 2FAYYAD U, IRANI K, Multi-interval discretization of continuous-valued attributes for classification learning [C]. Proceedings of the 13th International Joint Conference onArtificial Intelligence. San Mateo, CA: Morgan Kaufmann, 1993.
  • 3KURGAN L A, CIOS K J. CAIM discretization algorithm[J]. IEEE Transactions on Knowledge and Data Engineering, 2004, 16(2): 145-153.
  • 4LIU H, SETIONO R. Feature selection via discretization[J]. IEEE Transactions on Knowledge and Data Engineering, 1997, 9(4): 642-645.
  • 5CHAO T S, JYH H H. An extended chi2 algorithm for discretization of real value attributes [J]. IEEE Transactions Knowledge and Data Engineering, 2005,17(3) :437-441.
  • 6PAWLAK Z. Rough sets[J]. International Journal of Computer and Information Sciences, 1982,11(5):341-356.
  • 7HETTICH S, BAY S D. The UCI KDD Archive [DB/OL]. http : //kdd.ics.uci.edu/ , 1999.

同被引文献4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部