期刊文献+

面向Stacking集成的改进分类算法及其应用 被引量:10

IMPROVED CLASSIFICATION ALGORITHM FOR STACKING INTEGRATION AND ITS APPLICATION
下载PDF
导出
摘要 为了提高Stacking集成算法的分类性能,充分利用Stacking学习机制产生的先验信息和贝叶斯网络丰富的概率表达能力,提出一种基于属性值加权朴素贝叶斯算法的Stacking集成分类算法AVWNB-Stacking(Stacking based Attribute Value Weight Naive Bayes)。通过考虑属性值这个深层次的因素,以互信息(Mutual Information,MI)作为权值度量的基础,对属性权值向量横向扩展为每个属性值分配一个权值,避免不同的属性值共享相同的权值,从而解决朴素贝叶斯算法作为Stacking元分类器由于属性独立性假设带来的分类精度损失。实验结果表明,相比于传统算法及其他元分类器的Stacking分类算法,AVWNB-Stacking算法有效提高了模型的分类性能,在两个测试集上AUC值分别达到了0.8007和0.8607。 In order to improve the classification performance of the Stacking integration algorithm,making full use of the a priori information generated by learning mechanism of Stacking and the rich probability expression ability of the Bayesian network,a Stacking integrated classification algorithm based on attribute value weighted Naive Bayes algorithm,AVWNB-Stacking(Stacking based Attribute Value Weight Naive Bayes),is proposed.By considering the deep factor of attribute values and using mutual information(MI)as a basis of weight measure,we expanded horizontally the attribute weight vector and assigned a weight to each attribute value,avoiding different attribute values sharing the same weight value,thereby solving loss of classification accuracy brought by the Naive Bayes algorithm as a Stacking meta classifier due to attribute independence assumptions.The experimental results show that compared with the traditional algorithms and other meta-classifiers Stacking classification algorithm,the AVWNB-Stacking algorithm effectively improves the classification performance of the model,and the AUC value reaches 0.8007 and 0.8607 on the two test sets respectively.
作者 陆万荣 许江淳 李玉惠 Lu Wanrong;Xu Jiangchun;Li Yuhui(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,Yunnan,China)
出处 《计算机应用与软件》 北大核心 2022年第2期281-286,共6页 Computer Applications and Software
基金 国家自然科学基金项目(61363043)。
关键词 Stacking集成 贝叶斯网络 互信息 属性值加权 Stacking integration Bayesian network Mutual information Attribute value weight
  • 相关文献

参考文献6

二级参考文献60

  • 1李孟来.我国个人信用评分模型的应用探讨[J].金融管理与研究(杭州金融研修学院学报),2009(2):51-53. 被引量:3
  • 2张世勇,熊忠阳.基于禁忌搜索的混合粒子群优化算法[J].计算机研究与发展,2007,44(z2):339-343. 被引量:4
  • 3沈翠华,邓乃扬,肖瑞彦.基于支持向量机的个人信用评估[J].计算机工程与应用,2004,40(23):198-199. 被引量:19
  • 4姜明辉,谢行恒,王树林,温潇.个人信用评估的Logistic-RBF组合模型[J].哈尔滨工业大学学报,2007,39(7):1128-1130. 被引量:16
  • 5S. P. Abney. Principle Based Parsing: Computation and Psycholinguistics. Dordrecht: Kluwer Academic Publishers, 1991.
  • 6A. Ratnaparkhi. Maximum entropy models for natural language ambiguity resolution: [ Ph. D. dissertation ] . Pennsylvania:University of Pennsylvania, 1998.
  • 7H. van Halteren, J. Zavrel, W. Daelemans. Improving data driven word class tagging by system combination. In: Proc. the 17th COLING and the 36th Annual Meeting of ACL. San Francisco: Morgan Kaufmann Publishers, 1998. 491~497.
  • 8R. Florian, A. Ittycheriah, H. Jing, et al. Named entity recognition through classifier combination. In: CoNLL-2003. San Francisco: Morgan Kaufmann Publishers, 2003. 168~ 171.
  • 9L.S. Larkey, W. B. Croft. Combining classifiers in text categorization. In: Proc. SIGIR-96. New York: ACM Press,1996. 289~297.
  • 10R.E. Schapire, Y. Singer. Boostexter: A boosting-based system for text categorization. Machine Learning, 2000, 39 (2-3): 135~168.

共引文献206

同被引文献108

引证文献10

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部