期刊文献+

Entire Solution Path for Support Vector Machine for Positive and Unlabeled Classification

Entire Solution Path for Support Vector Machine for Positive and Unlabeled Classification
原文传递
导出
摘要 Support vector machines (SVMs) aim to find an optimal separating hyper-plane that maximizes separation between two classes of training examples (more precisely, maximizes the margin between the two classes of examples). The choice of the cost parameter for training the SVM model is always a critical issue. This analysis studies how the cost parameter determines the hyper-plane; especially for classifications using only positive data and unlabeled data. An algorithm is given for the entire solution path by choosing the 'best' cost parameter while training the SVM model. The performance of the algorithm is compared with conventional implementations that use default values as the cost parameter on two synthetic data sets and two real-world data sets. The results show that the algorithm achieves better results when dealing with positive data and unlabeled classification. Support vector machines (SVMs) aim to find an optimal separating hyper-plane that maximizes separation between two classes of training examples (more precisely, maximizes the margin between the two classes of examples). The choice of the cost parameter for training the SVM model is always a critical issue. This analysis studies how the cost parameter determines the hyper-plane; especially for classifications using only positive data and unlabeled data. An algorithm is given for the entire solution path by choosing the 'best' cost parameter while training the SVM model. The performance of the algorithm is compared with conventional implementations that use default values as the cost parameter on two synthetic data sets and two real-world data sets. The results show that the algorithm achieves better results when dealing with positive data and unlabeled classification.
出处 《Tsinghua Science and Technology》 SCIE EI CAS 2009年第2期242-251,共10页 清华大学学报(自然科学版(英文版)
基金 Supported by the National Natural Science Foundation of China(Nos.90604025 and 60703059) the Chinese Young Faculty Research Fund(No.20070003093)
关键词 support vector machine cost parameter positive and unlabeled classification support vector machine cost parameter positive and unlabeled classification
  • 相关文献

参考文献10

  • 1Yu H.SVMC:Single-class classification with support vector machines[].Proceedings of IJCAI‘.2003
  • 2Cauwenberghs G,Poggio T.Incremental and decremental support vector machine learning[].Proceedings of NIPS‘.2001
  • 3http://sdmc.lit.org.sg/GEDatasets/Datasets.html#DLBCL .
  • 4http://people.csail.mit.edu/jrennie/20Newsgroups/ . 2008
  • 5Joachims T.Making large-Scale SVM Learning Practical[].Advances in Kernel Methods-Support Vector Learning.1999
  • 6Manevitz L M,Yousef M.One-class SVMs for document classification[].Journal of Machine Learning Research.2001
  • 7Alizadeh AA,Eisen MB,Davis RE,et al.Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling[].Nature.2000
  • 8Hastie, T,Tibshirani, R,and Friedman, J.The Elements of statistical Learning:Data mining, Inference, and Prediction[]..2001
  • 9Hastie T,Rosset S,Tibshirani R et al.The Entire Regularization Path for the Support Vector Machine[].Journal of Machine Learning Research.2004
  • 10Li X,,Liu B."Learning to classify text using positive and unlabeled data."[].Proceedings of Eighteenth International Joint Conference on Artificial Intelligence (IJCAI-).2003

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部