期刊文献+

基于IN算法的剪枝优化算法 被引量:1

Pruning Optimization Algorithm Based on IN Algorithm
下载PDF
导出
摘要 提出一种基于IN算法构造分类器的剪枝优化算法C IN.针对IN算法利用对数似然比统计量进行假设检验存在的统计意义不明确的问题,本文算法在给定层每一节点引入了样本数阈值和属性值阈值的计算,从而保证检验的有效性.给出了算法的理论依据,并且推导出了对数似然比统计量计算公式成立条件.实验表明,该算法能够消减数据维数并且可以从大规模数据集中提取简明的规则. This paper proposed a novel algorithm termed as CIN for classification based on IN ( information-theoretic network) algorithm. Aim at ignorance of statistical significance in statistical hypotesis testing by means of the log likelihood ratio in IN algorithm, the CIN algorithm in troduces the threshold of the number of records in each node of given layer so as to guarantee reliability of testing. At the same time, the theoretic basis of the algorithm is given and precondition for the validity of the log likelihood ratio is derived. Empirical results show that the data dimensionality can be reduced and compact rules can be extracted with the CIN algorithm.
出处 《信阳师范学院学报(自然科学版)》 CAS 北大核心 2007年第2期237-240,共4页 Journal of Xinyang Normal University(Natural Science Edition)
关键词 互信息 对数似然比统计量 entropy mutual information the log likelihood ratio statistic
  • 相关文献

参考文献4

  • 1LAST M,MAIMON O.A Compact and Accurate Model for Classification[J].IEEE Transactions on Knowledge and Data Engineering(S1041-4347),2004,16(2):203-215.
  • 2MAIMON O,KANDEL A,LAST M.Information-Theoretic Fuzzy Approach to Knowledge Discovery in Databases.Advances in Soft Computing-Engineering Design and Manufacturing[M].London:Eds Springer-Verlag,1999:315-326.
  • 3VANMALI M.,LAST M,KANDEL A.Using a Neural Network in the Software Testing Process International[J].Journal of Intelligent Systems(S0334-1860),2002,17(1):45-62.
  • 4BLAKE C L,MERZ C J.UCI Repository of Machine Learning Databases[EB/OL].(2002-07-19)[2006-10-20].http://www.ics.uci.edu/-mlearn/MLRepository.html.

同被引文献8

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部