期刊文献+

多阶段聚类—朴素贝叶斯的异常检测 被引量:1

Anomaly detection based on the multi-phase clustering and naive bayes
下载PDF
导出
摘要 针对异常检测手段用来标定数据集中明显的不同于其他数据的对象,提出多阶段聚类旨在解决噪声数据的引入和缺失属性样本的处理,并改变传统的贝叶斯分类的被动学习为主动学习的方式来构造性能优越的分类器。在数据预处理阶段,利用密度聚类滤去噪声点,密度聚类的输出作为下一阶段的K-means聚类的输入,提高了K-means的分类准确率。K-means负责对缺失属性的样本进行处理。在分类阶段,利用adaboost学习算法优化朴素贝叶斯分类器,使其获得较好的分类效果。 Anomaly detection method was used for calibration data concentration significantly different from other data objects. In this paper, the multi-phase clustering aimed at resolving the import of noise data and the lack of the attributive sample, and changing the traditional passive learning of bayes for active learning ways to structure the superior performance classifier. In the pre-processing stage, a clustering algorithm based on density is introduced to handle noise data. And the output of the density-based clustering algorithm can be used as the input of K-means, which responsible for handling the training samples with absent values. At classification time, we introduce adaboost algorithm into naive bayes to generate a more effective classifier.
出处 《重庆大学学报(自然科学版)》 EI CAS CSCD 北大核心 2009年第8期983-986,共4页 Journal of Chongqing University
基金 山东省自然科学基金资助项目(Y2007G19) 哈尔滨工业大学(威海)研究基金资助项目(HIT(WH)ZB200813)
关键词 聚类 朴素贝叶斯 主动学习 K—means算法 clustering algorithms naive bayes active learning k-means algorithms
  • 相关文献

参考文献4

二级参考文献11

  • 1Dougherty J,Kohavi R,Sahami M.Supervised and unsupervised discretization of continuous features[C].In:Proceedings of the 12th International Conference on Machine Learning,Morgan Kaufmann Publishers,San Francisco,CA,1995:194~202
  • 2Schapire R E.The strength of learnability[J].Machine learning,1990;5(2):197~227
  • 3Freund Y,Schapire R E.A decision theoretic genelization of online learning and an application to boosting[J].Joumal of Computer and System Science,1997; 55 (1):119~139
  • 4Schapire R E,Singer Y.Improved boosting algorithms using confidence related predictions[C].In:Proceeding of the 11th Annual Conference on Computational Learning Theory,1998:80~91
  • 5E Bauer,R Kohavi.An empirical comparison of voting classification algorithm:Bagging,boosting and variants.Machine Leaming,1999:105~142
  • 6K M Ting,Z Zheng.Improving the performance of boosting for naive Bayesian classification[C].In:Proceedings of the 3th Pacific-Asia Conference on Knowledge Discovery and Data Mining,Berlin Germany:Springer-Verlag,1999:296~312
  • 7Yonglong W et al.MIQR Active Learning on a Continuous Function and a Discontinuous Function[J].Neural Computing and Applications,2001; 10(3):253~270
  • 8Hong L Shang-teng H.A Genetic Semi-supervised Fuzzy Clustering Approach to Text Classification[C].In:Proceedings of the 4th International Conference on Web-Age Information Management,Chengdu,China,2003:173~180
  • 9Han Jiawei,Kamber M.数据挖掘概念与技术[M].第2版.范明,孟小峰,等译.北京:机械工业出版社,2001.
  • 10Semi K L.Naive Bayesian Classifiers[C]//In:Proceedings of European Conference on Artificial Intelligence.Porto,Portugal:Springer Verlag,1991:206-219.

共引文献46

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部