摘要
朴素贝叶斯分类模型是一种简单而有效的分类方法,但它的属性独立性假设使其无法表达属性变量间存在的依赖关系,影响了它的分类性能.通过分析贝叶斯分类模型的分类原则以及贝叶斯定理的变异形式,提出了一种基于贝叶斯定理的新的分类模型DLBAN(double-level Bayesian network augmented naive Bayes).该模型通过选择关键属性建立属性之间的依赖关系.将该分类方法与朴素贝叶斯分类器和TAN(tree augmented naive Bayes)分类器进行实验比较.实验结果表明,在大多数数据集上,DLBAN分类方法具有较高的分类正确率.
Naive Bayes classifier is a simple and effective classification method, but its attribute independence assumption makes it unable to express the dependence among attributes, and affects its classification performance. On the basis of analyzing the classification principle of Bayesian classification model and a variant of Bayes theorem, a new classification model based on Bayes theorem, DLBAN (double-level Bayesian network augmented naive Bayes), which adds the dependence among attributes by selecting the key attributes, is proposed. DLBAN classifier is compared with Naive Bayes classifier and TAN (tree augmented naive Bayes) classifier by an experiment. Experimental results show this model has higher classification accuracy in most data sets.
出处
《软件学报》
EI
CSCD
北大核心
2004年第2期193-199,共7页
Journal of Software
基金
国家"十五"重点科技攻关项目No.2002BA407B~~