期刊文献+

从局部分类精度到分类置信度的变换 被引量:6

Transformation from Local Accuracy to Classification Confidence
下载PDF
导出
摘要 基于局部分类精度设计多分类器系统能够有效地提高分类正确率.目前流行的动态分类器选择方法不能充分利用各个基本分类器的信息.在动态分类器选择方法中,局部分类精度最高的基本分类器决定最终的分类结果,其他基本分类器的信息被忽略.提出了一种将局部分类精度变换为分类置信度的方法,从而可以利用度量层分类器融合方法对得到的置信度进行融合.与动态分类器选择方法相比,度量层分类器融合方法能够利用更多的信息,从而能够取得更高的分类正确率.ELENA数据库、UCI数据库和DELVE数据库上的大量实验表明,新方法在分类正确率方面超过动态分类器选择方法大约0.2%~13.6%. Local accuracy, which represents the accuracy of a base classifier for an input pattern, is one kind of valuable information used in the classifier combination methods. However, the only existing classifier combination method which uses local accuracy, namely dynamic classifier selection, can not take full advantage of the information from the base classifiers. In dynamic classifier selection, the final classification result is determined by the base classifier with the highest local accuracy, and the local accuracies of the other base classifiers are neglected. In this paper, a method of transforming local accuracy into classification confidence is proposed, in which the confidence value corresponding to the class outputted by a base classifier is proportional to the local accuracy of the base classifier, and the confidence values corresponding to the other classes are assumed to be equal. After the transformation, multiple classifier systems can be designed with measurement level classifier fusion methods such as linear combination. Compared with dynamic classifier combination, the classifier fusion methods, which make decisions by integrating the classification confidences of all the base classifiers, can use more information and achieve a higher classification rate. To evaluate this approach, a lot of experiments are conducted on six large data sets selected from the ELENA, UCI, and DELVE databases. The experimental results show that the approach outperforms dynamic classifier selection by 0.2% to 13.6% in classification rate.
出处 《计算机研究与发展》 EI CSCD 北大核心 2008年第9期1612-1619,共8页 Journal of Computer Research and Development
基金 国家"九七三"重点基础研究发展规划基金项目(2004CB318110) 北京市自然科学基金项目(4082025) 高等学校博士点新教师基金项目(20070004037) 河北省科学技术研究与发展计划基金项目(072135188)~~
关键词 信息融合 模式识别 集成学习 分类器组合 局部分类精度 分类置信度 information fusion pattern recognition ensemble learning classifier combination localaccuracy classification confidence
  • 相关文献

参考文献31

  • 1姜远,周志华.基于词频分类器集成的文本分类方法[J].计算机研究与发展,2006,43(10):1681-1687. 被引量:22
  • 2Dietterich T G. Machine learning research: Four current directions [J]. AI Magazine, 1997, 18(4): 97-136
  • 3唐伟,周志华.基于Bagging的选择性聚类集成[J].软件学报,2005,16(4):496-502. 被引量:94
  • 4Breiman T. Bagging predictors [J]. Machine Learning, 1996, 24(2): 123-140
  • 5Freund Y, Schapire R E. A short introduction to boosting[J]. Journal of Japanese Society for Artificial Intelligence, 1999, 14(5): 771-780
  • 6Zhou Z H, Yang Y. Ensembling local learners trough multimodal perturbation [J]. IEEE Trans on Systems, Man, and Cybernetics, Part B, 2005, 35(4): 725-735
  • 7Kittler J, Hatef H, Duin R P W, et al. On combining classifiers [J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 1998, 20(3): 226-239
  • 8Kuncheva L I . Combining Pattern Classifiers: Methods and Algorithms [M]. New York: Wiley, 2004.
  • 9Huang Y S, Suen C Y. A method of combining multiple experts for the recognition of unconstrained handwritten numerals [J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 1995, 17 (1): 90-94
  • 10Kegelmeyer W P, Bowyer K. Combination of multiple classifiers using local accuracy estimates [J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 1997,19(4) : 405 -410

二级参考文献51

  • 1马亮,陈群秀,蔡莲红.一种改进的自适应文本信息过滤模型[J].计算机研究与发展,2005,42(1):79-84. 被引量:18
  • 2Estivill-Castro V. Why so many clustering algorithms-A position paper. SIGKDD Explorations, 2002,4(1):65-75.
  • 3Dietterich TG. Machine learning research: Four current directions. AI Magazine, 1997,18(4):97-136.
  • 4Breiman L. Bagging predicators. Machine Learning, 1996,24(2):123-140.
  • 5Zhou ZH, Wu J, Tang W. Ensembling neural networks: Many could be better than all. Artificial Intelligence, 2002,137(1-2):239-263.
  • 6Strehl A, Ghosh J. Cluster ensembles-A knowledge reuse framework for combining partitionings. In: Dechter R, Kearns M,Sutton R, eds. Proc. of the 18th National Conf. on Artificial Intelligence. Menlo Park: AAAI Press, 2002. 93-98.
  • 7MacQueen JB. Some methods for classification and analysis of multivariate observations. In: LeCam LM, Neyman J, eds. Proc. of the 5th Berkeley Symp. on Mathematical Statistics and Probability. Berkeley: University of California Press, 1967,1:281-297.
  • 8Blake C, Keogh E, Merz CJ. UCI Repository of machine learning databases. Irvine: Department of Information and Computer Science, University of California, 1998. http://www.ics.uci.edu/~mlearn/MLRepository.html
  • 9Modha DS, Spangler WS. Feature weighting in k-means clustering. Machine Learning, 2003,52(3):217-237.
  • 10Zhou ZH, Tang W. Clusterer ensemble. Technical Report, Nanjing: AI Lab., Department of Computer Science & Technology,Nanjing University, 2002.

共引文献185

同被引文献61

  • 1马骏声.GBR地基成像雷达及其难点[J].航天电子对抗,1994(4):32-35. 被引量:1
  • 2林志贵,徐立中,周金陵.基于修改模型的冲突证据组合方法[J].上海交通大学学报,2006,40(11):1964-1970. 被引量:19
  • 3王涛,李舟军,胡小华,颜跃进,陈火旺.一种高效的数据流挖掘增量模糊决策树分类算法[J].计算机学报,2007,30(8):1244-1250. 被引量:18
  • 4SCHLIMMER J C,GRANGER R H.Incremental learning from noisy data[J].Machine Learning,1986,1(3):317-354.
  • 5WIDMER G,KUBAT M.Learning in the presence of concept drift and hidden contexts[J].Machine Learning,1996,23(1):69-101.
  • 6SALGANICOFF M.Tolerating concept and sampling shift in lazy learning using prediction error context switching[J].Artificial Intelligence Review,1997,11(1/5):133-155.
  • 7HARRIES M B,SAMMUT C,HORN K.Extracting hidden context[J].Machine Learning,1997,27(3):259-286.
  • 8DOMINGOS P,HULTEN G.Mining high-speed data streams[C]//Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.USA,Boston:2000:71-80.
  • 9WANG H,FAN W,YU S,et al.Mining concept-drifting data streams using ensemble classifiers[C]//Proceeding of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.USA,Washington:2003:226-235.
  • 10STREET W N,KIM Y S.A streaming ensemble algorithm for large-scale classification[C]//Proceeding of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.USA,San Francisco:2001:377-382.

引证文献6

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部