期刊文献+

基于混合样本训练的并行层叠支持向量机研究 被引量:1

Research on Parallel Casecade SVM Based on Mixed Samples
下载PDF
导出
摘要 层叠支持向量机将原始数据集随机划分为多个子集,对数据子集采取并行训练,可以有效提高分类器的训练效率。但其在将原始数据随机划分为多个训练子集时,可能会给各并行节点带来文本信息结构的不均衡,进而影响分类器的最终分类效果。提出了一种基于混合样本训练子分类器的训练模型,实验表明,基于混合样本训练的层叠支持向量机,可以较好地解决训练样本信息结构不均衡问题,保证层叠训练得到的分类器具有较好的精确度和稳定性。 Cascading support vector machine(SVM)divides the original data sets by randomly dividing them into multiple subsets.Parallel data subset training can effectively improve the training efficiency of the classifier.However,when the original data is randomly divided into multiple training subsets,it may bring the imbalance of various parallel node text information structure to each parallel node,and then affect the classification effect of the final classifier.In this paper,a training model based on mixed sample training subclassifier is proposed,and the experiment shows that the cascade support vector machine based on mixed sample training can solve the problem of unbalanced information structure of training samples,and ensure that the classifier obtained by cascade training has better accuracy and stability.
作者 张洪胜 丁永红 ZHANG Hong-sheng;DING Yong-hong(Huainan United University,Huainan 232038,China)
出处 《金陵科技学院学报》 2019年第3期8-11,共4页 Journal of Jinling Institute of Technology
基金 安徽省教育厅自然科学重点项目(KJ2017A586)
关键词 文本分类 混合样本 层叠支持向量机 text classification simulation sample casecade support vector machine
  • 相关文献

参考文献4

二级参考文献37

  • 1胡健,马范援.基于Morphology处理和主题词抽取的垃圾邮件过滤方法[J].上海交通大学学报,2005,39(12):1963-1966. 被引量:4
  • 2张宝昌,陈熙霖,山世光,高文.基于支持向量的Kernel判别分析[J].计算机学报,2006,29(12):2143-2150. 被引量:10
  • 3邹汉斌,雷红艳,邓卫红.支持向量机在反垃圾邮件过滤中的应用[J].计算机工程与设计,2007,28(9):2015-2017. 被引量:7
  • 4黄萱青 吴立德.独立于语种的文本分类方法[M].,2000.37-43.
  • 5鲁松 白硕 等.文本中词语权重计算方法的改进[M].,2000.31-36.
  • 6卜东波.聚类/分类理论研究及其在大模型文本挖掘的应用:博士论文[M].,2000..
  • 7YANG Yiming, LIU Xin, A re-examination of text categorization methods [EB/OL]. http: //citeseer. nj. nec.com/yang99reexamination. html, 1999.
  • 8Cohen W W, Singer Y. Context-sensitive learning methods for text categorization [EB/OL], http: //citeseer. nj. nec.com/cohen96contextsensitive, html, 1996.
  • 9David D. Lewis, Training algorithms for linear text classifier[EB/OL]. http: //citeseer. nj. nec. com/lewis96training.html, 1996,.
  • 10Salton G, Wang A, Yang C S. A vector space model for automatic indexing [J]. Communication of ACM, 1975,18(11): 613 - 620 .

共引文献314

同被引文献9

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部