文本分类支持向量机的i-ξα估计

i-ξα Estimator of SVM Text Classification

导出

摘要 ξα估计是进行支持向量机模型选择的重要指标,它通过分析支持向量的特性,可以在训练一次的情况下估计出训练集发生"留一错误"的次数,进而判断当前模型参数选择的优劣。本文分析了文本向量及RBF核函数的特点,对用于文本分类领域的ξα估计进行了改进,提出了一种计算简便的"i-ξα估计"。实验表明,改进后"i-ξα估计"在保证准确性的前提下,明显提高了计算速度。 The ξα estimator is an important guideline for the SVM model selection . Using the characteristics of the support vectors, the ξα estimator can estimate the LOO errors with the SVM being trained for only one time, and then whether parameters are appropriate can be iudged. By analyzing the characteristics of text vectors and RBF kernel , the ξα estimator in text field is improved, which is called i-ξα estimator. Experimental results show the correctness of the i-ξα estimator is identical to that of the ξα estimator and its speed is much higher than the ξα estimator.

作者王晔黄上腾

机构地区上海交通大学计算机科学与工程系

出处《模式识别与人工智能》 EI CSCD 北大核心 2005年第6期670-674,共5页 Pattern Recognition and Artificial Intelligence

关键词推广误差留一错误 ξα估计支持向量机文本分类 Generalization Error, Leave One Out Error （LOO Error）, ξα Estimator, Support Vector Machine, Text Classification

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献14

1Joachims T. Estimating the Generalization Performance of a SVM Efficiently. In: Proe of the 17th International Conferenceof Machine Learning. San Francisco, USA: Morgan Kaufman,2000, 431-438.
2Vapnik V N. Statistical Learning Theory. New York, USA:John Wiley & Son, 1998.
3Devroye L, Gy6rfi L, Lugosi G. A Probabilistic Theory of Pattern Recognition. New York, USA: Springer-Verlag, 1996.
4Lunts A, Brailovski V. Evaluation of Attributes Obtained in Statistical Decision Rules. Engineering Cybernetics, 1967, 3:98-109.
5Stone M. Cross-Validatory Choice and Assessment of Statistical Predictions. Journal of the Royal Statistical Society (Series B),1974, 36(1):111-147.
6Scholkopf B. Statistical Learning and Kernel Methods. Technical Report, MSR-TR 2000-23, Microsoft Researgh, Cambridge, UK, 2000.
7Joachims T. Text Categorization with Support Vector Machines. In: Proc of the 10th European Conference on Machine Learning. Chemnitz, Germany, 1998, 137-142.
8LIBSVM. http://www.csie.ntu.edu.tw/-cjlin/libsvmtools.
9Bennett K P. Combining Support Vector and Mathematical Programming Methods for Classification. In: Seholkopf B, Burges C J C, Srwola A J, eds. Advances in Kernel Methods: Support Vector Learning. Cambridge, USA: MIT Press, 1999, 307-326.
10Kreel U H G. Pairwise Classification and Support Vector Ma chines. In: Scholkopf B, Burges C J C, Srowla A J, eds. Ad vances in Kernel Methods: Support Vector Learning. Cam bridge, USA: MIT Press, 1999, 255-268.

1饶峰.核机器集成学习算法的误差分析[J].重庆文理学院学报（自然科学版）,2010,29(4):61-64.
2贺方超,陈娜.两种算法稳定情形下交叉验证推广误差的界[J].湖北工业大学学报,2008,23(5):69-72. 被引量：1
3邹斌,董雪梅,李落清.联合算法稳定下变一推广误差的界[J].湖北大学学报（自然科学版）,2005,27(4):313-316.
4周学君,彭锦.关于ε不敏感损失函数推广误差的界[J].数学杂志,2010,30(3):527-532.
5郭春璐,陶琳.特征选择和分类器参数优化联合进行的人体行为识别[J].微型电脑应用,2016,32(4):74-77.
6徐利新,欧阳咸泰,胡中功,文小玲.模型预测控制在工业控制领域中的发展与应用[J].武汉化工学院学报,2001,23(1):77-81. 被引量：9
7周学君,李凯,彭锦.论如何提高子空间信息准则的精度[J].重庆文理学院学报（自然科学版）,2008,27(4):1-3.
8董雪梅,邹斌,李落清.联合算法稳定下的排一推广误差界[J].工程数学学报,2005,22(6):1121-1124.
9陈凯.基于GASEN技术的选择性BagBoosting Trees集成学习研究[J].统计教育,2008(12):3-7.
10周学君,彭锦.应用子空间信息准则选择模型参数[J].兰州理工大学学报,2007,33(4):83-85.

模式识别与人工智能

2005年第6期

浏览历史

内容加载中请稍等...

文本分类支持向量机的i-ξα估计

参考文献14

相关作者

相关机构

相关主题

浏览历史