摘要
在支持向量机 (supportvectormachines ,SVM )中 ,训练样本总是具有明确的类别信息 ,而对于一些不确定性问题并不恰当 受贝叶斯决策规则的启发 ,利用样本的后验概率来表示这种不确定性 将贝叶斯决策规则与SVM相结合 ,建立后验概率支持向量机 (posterioriprobabilitysupportvectorma chine ,PPSVM )的体系框架 并详细讨论线性可分性、间隔、最优超平面以及软间隔算法 ,得到了一个新的优化问题 ,同时给出了一个支持向量的新定义 实际上 ,后验概率支持向量机是建立于统计学习理论(statisticallearningtheory)基础之上 ,是标准SVM的扩展 针对数据 ,还提出了一个确定后验概率的经验性方法 实验也证明了后验概率支持向量机的合理性。
To solve uncertain classification problem, an SVM (support vector machine) is trained to behave like a Bayesian optimal classifier based on the training data. The idea is to weigh each unbalanced training sample by a posteriori probability. A whole framework of posteriori probability support vector machine (PPSVM) is presented and SVM is reformulated into PPSVM. The linear separability, margin, optimal hyperplane and soft margin algorithms are discussed. A new optimization problem is obtained and a new definition of support vector is given. In fact, PPSVM is motivated by statistical learning theory and is an extension of regular SVM. An empirical method is also proposed for determining the posteriori probability. Two artificial examples show that PPSVM formulation is reasonable if the class-conditional probability is known, and some real experiments demonstrate that the weighted data cases by some empirical methods can produce better results than regular SVM.
出处
《计算机研究与发展》
EI
CSCD
北大核心
2005年第2期196-202,共7页
Journal of Computer Research and Development
基金
国家自然科学基金项目 (60 175 0 2 3 )
安徽省优秀青年科技基金项目 (0 40 42 0 69)
关键词
支持向量机
分类
后验概率
间隔
最大间隔算法
非确定性分类问题
support vector machines
classification
posterior probability
margin
maximal margin algorithm
uncertain classification problem