期刊文献+

基于K-均值聚类的朴素贝叶斯网络分类模型 被引量:3

A Nave Bayesian Network Classification Model Based on K-means Clustering
下载PDF
导出
摘要 针对朴素贝叶斯网络分类模型在处理高维大数据量时的效率偏低和准确率有待提高的问题,结合主元分析法与K-均值聚类算法构造出了一个改进的朴素贝叶斯网络分类模型;摒弃了非类属性变量相对于类属性变量相对独立的前提条件,算法首先用主元分析法在对数据集的信息量尽量保存的同时进行了降维操作,使得算法可以着重于进行分类问题;算法还提出了一个"相对融合点"的概念,有效地提高了算法的性能;最后对算法的性能进行了分析,并将改进的算法应用到实际的数据集进行实验,用算法产生的分类结果对数据集中产生的一些缺失数据进行修补。 According to the low efficiency and low accuracy of the naive Bayesian network classification model in dealing with large number of high-dimensional data, by combining Principal Component Analysis and K-means clustering algorithm, this paper gives an improved Navve Bayesian network classification model. The model abandoned the premise for the relative independence between non-class attribute variables and class attribute variables. Firstly, we use principal component analysis to reduce the dimensionality of the data set, so the algorithm can focus on the classification problem. The algorithm has also proposed a concept called "relative fusion point" to effectively improve the performance of the algorithm. Finally, the performance of the algorithm is analyzed, and the improved algorithm is applied to the actual data set for experiment to repair the missing data of the data set, the results show that the algorithm is effective.
出处 《重庆工商大学学报(自然科学版)》 2012年第8期36-41,共6页 Journal of Chongqing Technology and Business University:Natural Science Edition
基金 重庆市科技攻关资金资助项目(CSTC 2009AC2068)
关键词 贝叶斯网络分类 朴素贝叶斯网络 K-均值聚类 数据挖掘 Bayesian network classification Naive Bayesian network K-means clustering data mining
  • 相关文献

参考文献7

  • 1PELIKAN M, GOLDBERG D, SASTRY K. Bayesian optimization algorithm, decision graphs,and Ocam' s razor[ R]. Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001) ,PP. 519-526. Also IlliGAL Report No. 2000020 (2001).
  • 2FRIEDMAN N,GEIGER D,GOLDSZMIDT M. Bayesian Network Classifiers[ J]. Machine Learning, 1997,29:103-163.
  • 3PELIKAN M, SASTRY K, GOLDBERG D. Scalability of the Bayesian optimization algorithm [ J ]. International Journal of Approximate Reasoning,2002,3i (3) :221-258.
  • 4KAI M, ZHENG Z. A Study of AdaBoost with Nm've Bayesian Classifier: Weakness and Improvement [ J ]. Computational Intelligence ,2003 (19) : 186-200.
  • 5DING Z,PENG Y,PAN R. BayesOWL: Uncertainty Modeling in Semantic Web Ontologies [ J ]. In Soft Computing in Ontologies and Semantic Web, Springer-Verlag, December 2005.
  • 6HAN J, KAMBER M. Data Mining : Concepts and Techniques[ M ]. Academic Press ,2001.
  • 7王洪春,彭宏.一种基于主成分分析的异常点挖掘方法[J].计算机科学,2007,34(10):192-194. 被引量:14

二级参考文献9

共引文献13

同被引文献39

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部