期刊文献+

用AUC评估分类器的预测性能 被引量:2

Using AUC to Evaluate Predictive Performance of Classifiers
下载PDF
导出
摘要 准确率一直被作为分类器预测性能的主要评估标准,但是它存在着诸多的缺点和不足。本文将准确率与AUC(the area under the Receiver Operating Characteristic curve)进行了理论上的对比分析,并分别使用AUC和准确率对3种分类学习算法在15个两类数据集上进行了评估。综合理论和实验两个方面的结果,显示了AUC不但优于而且应该替代准确率,成为更好的分类器性能的评估度量。同时,用AUC对3种分类学习算法的重新评估,进一步证实了基于贝叶斯定理的Naive Bayes和TAN-CMI分类算法优于决策树分类算法C4.5。 Accuracy has been used as a main evaluation criterion for predictive performance of classifiers. However, it has many shortcomings and disadvantages. In this paper, we compared accuracy to AUC( the area under the Receiver Operating Characteristic curve) measure in theory and used respectively AUC and accuracy to evaluate three classification learning algorithms on fifteen binary datasets. Theoretical and experimental results show that AUC is not only a better measure than accuracy but also should replace it in comparing classifiers. Furthermore, using AUC to re-evaluate three classification algorithms shows classification algorithm NaiveBayes and TAN-CMI based on Bayes theorem are better than decision tree classification algorithm CA.5 in performance.
出处 《情报学报》 CSSCI 北大核心 2007年第2期275-279,共5页 Journal of the China Society for Scientific and Technical Information
关键词 ROC AUC 准确率 交叉验证 ROC, AUC, accuracy, cross-validation
  • 相关文献

参考文献11

  • 1Provost F,Fawcett T,Kohavi R.The case against accuracy estimation for comparing induction algorithms[C]∥Proceedings of 15th International Conference on Machine Learning.Morgan Kaufmann,1998:445-453.
  • 2Provost F,Fawcett T.Analysis and visualization of classifier performance:Comparison under imprecise class and cost distribution[C]∥Proceedings of 3rd International Conference on Knowledge Discovery and Data Mining (KDD-97).Menlo Park,CA:AAAI Press,1997:43-48.
  • 3Ling C X,Huang J,Zhang H.AUC:a statistically consistent and more discriminating measure than accuracy[C]∥Proceedings of 18th International Joint Conference on Artificial Intelligence (IJCAI-03).Acapulco,Mexico,2003.
  • 4Hanley J A,McNeil B J.The meaning and use of the area under a receiver operating characteristic (ROC) curve.Radiology[J].1982,143:29-36.
  • 5Bradley A P.The use of the area under the ROC curve in the evaluation of machine learning algorithms[J].Pattern Recognition,1997,30:1145-1159.
  • 6Hand D J,Till R J.A simple generalisation of the area under the ROC curve for multiple class classification problems[J].Machine Learning,2001,45:171-186.
  • 7Wu S M,Flach P.Scored and weighted AUC metrics for classifier evaluation and selection[C]∥Proceedings of 2nd Workshop on ROC Analysis in Machine Learning (ROCML-05).Bonn,Germany,2005.
  • 8Friedman N,Geiger D,Goldszmidt M.Bayesian network classifiers[J].Machine Learning,1997,29:131-163.
  • 9Quinlan J R.C4.5:Programs for Machine Learning.San Francisco:Morgan Kaufmann Publishers,1993.
  • 10Blake C,Merz C.UCI Repository of Machine Learning Databases[OL].[1998].http://www.ics.uci.edu/-mlearn/ MLRepository.html.

同被引文献16

引证文献2

二级引证文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部