期刊文献+

噪声消除与SMO算法收敛性

Eliminating Noisy and SMO Algorithm Convergence
下载PDF
导出
摘要 近年来,随着序列最小优化分类算法SMO等一系列快速算法的推出,支持向量机在自动文本分类研究领域取得了很大的成功。大多数文本分类问题是线性可分的,使用线性核函数的SMO算法能够取得非常好的分类效果。但是文本向量是一种非常稀疏的向量,采用线性核函数的SMO算法对噪声样本非常敏感,容易产生发散的问题。文章分析证明了噪声如何影响SMO算法收敛性。为了解决训练样本中噪声样本影响SMO算法收敛的问题,设计了一个消除噪声样本的算法,取得了非常好的效果。 In recent years,accompany with the appearance of a series of rapid training algorithm as Sequential Minimal Optimization (SMO),support vector machines achieved great success in text categorization.Most text categorization problems are linearly separable,and SMO algorithm using linear kernel-induced can perform well for text categorization. However,text vectors are a kind of extremely sparse vector,and SMO algorithm with linear kernel or polynomial kernel is very sensitive to the extremely sparse noisy example,which is easy to bring on the problem that algorithm can not converge.It is been proved that the noisy example how to influence the convergence of SMO algorithm in the paper.To solve the problem that noisy sample in training samples affect the convergence of SMO algorithm,this paper designs the algorithm that can eliminate noisy samples,and good results is achieved.
出处 《计算机工程与应用》 CSCD 北大核心 2006年第24期160-163,共4页 Computer Engineering and Applications
基金 国家自然科学基金资助项目(编号:60435010) 国家863高技术研究发展计划资助项目(编号:2003AA115220) 中澳科技合作特别基金项目 北京市自然科学基金资助项目(编号:4052025)
关键词 文本分类 支持向量机 SMO算法 噪声样本 text categorlzation,SVM, SMO algorithm, noisy sample
  • 相关文献

参考文献7

  • 1Thorsten Joachims.Text Categorization with Support Vector Machines:Learning with Many Relevant Features[C].In:Proceedings of European Conference on Machine Learning,1998
  • 2Thorsten Joachims.Making large-scale SVM learning practical.In:B Scholkopf,C J C Burges,A J Smola eds.Advances in Kernel MethodsSupport Vector Learning,MIT Press,1999:169~184
  • 3B E Boser,I M Guyon,V N Vapnik.A training algorithm for optimal margin classifiers[C].In:D Haussler eds.Proceeding of the 5th Annual ACM Workshop on Computational Learning Theory,ACM Press,1992:144~152
  • 4C Cortes,V Vapnik.Support vector networks[J].Machine Learning,1995;20:273~297
  • 5H Kuhn,A Tucher.Nonlinear programming[C].In:Proceedings of 2nd Berkley Symposium on Mathematical Statistics and Probabilistics,University of California Press,1951
  • 6J Platt.Fast training of support vector machines using sequential minimal optimization[C].In:B Scholkopf,C Burges,A Smola eds.Advances in Kernel Methods-Support Vector Learning,MIT Press,1998
  • 7K Aas,L Eikvil.Text categorization:a survey[R].Technical report,Norwegian Computing Center,1999-06

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部