期刊文献+

维规约对朴素贝叶斯分类性能的影响研究 被引量:1

ON EFFECT OF DIMENSION REDUCTION ON CLASSIFICATION PERFORMANCE OF NAVE BAYES
下载PDF
导出
摘要 朴素贝叶斯算法是一种简单而高效的分类算法,但属性的条件独立性假设并不符合客观实际,特别是高维度数据的属性之间往往存在相关关系,如何能在实现对数据降维的同时又提高朴素贝叶斯的分类性能是一个重要的研究问题。对基于条件信息熵的选择朴素贝叶斯、基于主成分分析的朴素贝叶斯和基于独立成分分析的朴素贝叶斯算法进行研究,通过在UCI数据集上的仿真实验,详细比较了几种维规约算法对朴素贝叶斯分类性能的影响。 Nave Bayes(NB) algorithm is an effective and simple classification algorithm,but its conditional independence assumption is not in compliance with the objective reality,especially the attributes in high-dimension data set are always dependent on each other.It is an important research subject to reduce the dimensions of the data while improve the classification performance.In the paper,conditional information entropy-based selected nave Bayes(CIESNB),principal components analysis-based na?ve Bayes(PCANB) and independent components analysis-based nave Bayes(PCANB) are studied.The effects of these dimension reduction algorithms on NB classification performance are compared in detail through the simulation experiment carried out on UCI data sets.
作者 邓维斌
出处 《计算机应用与软件》 CSCD 2010年第6期89-91,共3页 Computer Applications and Software
基金 重庆市自然科学基金项目(2008BA2017)
关键词 朴素贝叶斯 维规约 条件信息熵 主成分分析 独立成分分析 Nave Bayes Dimension reduction Conditional information entropy Principal components analysis Independent components analysis
  • 相关文献

参考文献8

  • 1Chickering D M.Learning Bayesian networks is NP-complete[M].New York:Springer.1996:121-130.
  • 2李国英.关于高维、相依和不完全数据的统计分析[J].数学进展,2002,31(3):193-199. 被引量:4
  • 3Ye J P,Li Q,Xiong H,et al.IDR/QR:An Incremental Dimension Reduction Algorithm via QR Decomposition[J].IEEE Transactions on Knowledge and Data Engineering,2005,17(9):1208-1222.
  • 4邓维斌,黄蜀江,周玉敏.基于条件信息熵的自主式朴素贝叶斯分类算法[J].计算机应用,2007,27(4):888-891. 被引量:16
  • 5Bura E.Dimension Reduction Techniques:A Rewies[EB/OL].2006.http://srccs.snu.ac.kr/Work-shop/04Statistics/7.pdf.
  • 6Sergios Theodoridis,Konstantinos Koutroumbas.模式识别[M].3版.北京:电子工业出版社,2006:744.
  • 7Jutten C,Herault J.Independent component analysis versus PCA[C] //Proc of European Symposium on Signal Processing.1988:287-314.
  • 8Hyvarinen A.Fast and robust fixed-point algorithm for independent component analysis[J].IEEE Transactions on Neural Networks,1999,10(3):626-634.

二级参考文献18

  • 1Efron B R A. Fisher in the 21st Century. Statistical Science, 1998, 13: 95-122.
  • 2Casella G, etc. Vignettes for the year 2000: Theory and Metheods. J. Amer Statist. Assoc. 98: 1269-1368.
  • 3PAWLAK Z.Rough set[J].International Journal of Computer and Information Sciences,1982,11 (5):341:356.
  • 4YIN DS,WANG GY,WU Y.A Self-learning Algorithm for Decision Tree Pre-pruning[A].Proceedings of the Third International Conference on Machine Learning and Cybernetics[C].Shanghai,2004.2140 -2145.
  • 5CHICKERING DM.Learning Bayesian networks is NP-complete[A].Learning from Data:AI and Statistics[C].New York,1996.121 -130.
  • 6LANGLEY P,SAGE S.Induction of selective Bayesian classifiers[A].Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence[C].Morgan Kaufmann,1994.399 -406.
  • 7HUANG J,LU JJ,LING CX.Comparing Naive Bayes,Decision Trees and SVM with AUC and Accuracy[A].Proceedings of the Third IEEE International Conference on Data Mining(ICDM'2003)[C].Melbourne,Florida,USA.2003.
  • 8FRIEDMAN N,GEIGER D,GOLDSZMIDT M.Bayesian Network Classifiers[J].Machine Learning,1997,29:131-163.
  • 9HARRY Z,SHENG SL.Learning Weighted Naive Bayes with Accurate Ranking[A].Fourth IEEE International Conference on Data Mining (ICDM'04)[C].Brighton,UK.2004.567-570.
  • 10WANG GY,ZHENG Z,WU Y.RIDAS-A Rough Set Based Intelligent Data Analysis System[A].First IEEE International Conference On Machine Learning and Cybernitics(ICMLC2002)[C].Beijing.2002.646-649.

共引文献20

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部