期刊文献+

基于SVM算法的文本分类技术研究 被引量:83

Research of Text Categorization Based on Support Vector Machine
下载PDF
导出
摘要 在优化分类技术的研究中,文本特征化后通常具有高维性和不平衡性的特点,导致传统的分类算法准确率不高的问题。针对文本分类器的性能容易受到核函数和参数的影响的问题,为提高文本分类器的准确性。采用支持向量机(SVM)的理论在文本分类技术同时将根据优化的粒子群算法(PSO)引入SVM分类算法中进行优化文本分类器的参数,将分类器的准确率作为PSO算法适应度函数通过粒子移动操作找出最佳参数并用SVM算法进行分类。在文本数据集上的仿真结果表明,与传统的算法相比,经PSO算法优化后的SVM文本分类器的准确性更高,PSO算法是一种有效的优化方法,能广泛应用于文本分类问题。 Text characterization usually has the characteristics of high dimensional and unbalanced, which causes the probems that traditional classification algorithm accuracy is not high, the performance of text categorization is vulnerable to the influence of kernel function and parameters. In order to improve the accuracy of the text classifier, this article used the support vector machine (SVM) theory to study the text classification technology, and the theory of particle swarm optimization (PSO) algorithm, the classification algorithm was introduced to the SVM to optimize the parameters of the text classifie, we used the accuracy of the classifier as fitness functions, used particles move operation to find the best parameters, and used the SVM algorithm to classify the texts. Compared with the traditional algorithm, the new classifier has higher accuracy.
出处 《计算机仿真》 CSCD 北大核心 2013年第2期299-302,368,共5页 Computer Simulation
关键词 支持向量机 文本分类 算法 SVM Text categorization Algorithm
  • 相关文献

参考文献4

二级参考文献20

  • 1胡佳妮,徐蔚然,郭军,邓伟洪.中文文本分类中的特征选择算法研究[J].光通信研究,2005(3):44-46. 被引量:47
  • 2陈展荣,曾毅平.Web汉语料的智能抽取与词汇切分[J].计算机工程与设计,2005,26(6):1422-1424. 被引量:4
  • 3Zhang Yizhong,Zhao Mingsheng,Wu Youshou.The automatic classification of web pages based on neural networks[C].Neural Information Processing,ICONIP2001 Proceedings,2001.570-575.
  • 4Fabrizio Sebastiani.Machine learning in automated text categorization[J].ACM Computing Surveys,2002,34 (1):11-12,32-33.
  • 5Deng Zhi-Hong,Tang Shi-Wei,Yang Dong-Qing,et al.Two odds radio-based text classification algorithms[C].Singapore:Proceedings of 1st International Workshop on Mining for Enhanced Web Search (MEWS 2002,Workshop of 3rd Conference of Web Information Systems Engineering),IEEE CS Press,2002.223-230.
  • 6Liang Jiu-zhen.Svm based Chinese web page automatic classification[C].Shanghai:Proceedings of the Third International Conference on Machine Learning and Cybernetics,IEEE,2003.2265-2268.
  • 7David Heckerman. Bayesian Networks for Data Mining[J] 1997,Data Mining and Knowledge Discovery(1):79~119
  • 8David Heckerman,Dan Geiger,David M. Chickering. Learning Bayesian Networks: The Combination of Knowledge and Statistical Data[J] 1995,Machine Learning(3):197~243
  • 9Gregory F. Cooper,Edward Herskovits. A Bayesian Method for the Induction of Probabilistic Networks from Data[J] 1992,Machine Learning(4):309~347
  • 10巩知乐,张德贤,胡明明.一种改进的支持向量机的文本分类算法[J].计算机仿真,2009,26(7):164-167. 被引量:37

共引文献78

同被引文献616

引证文献83

二级引证文献462

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部