摘要
针对混合不完备数据集,提出一种基于邻域联系度距离拓展高斯核函数的支持向量机分类方法。首先,给出了基于邻域联系度距离拓展高斯核函数的定义;其次,给出基于二次函数逼近的支持向量机SMO训练算法和分类算法;最后,取多个UCI数据集进行了实验分析,通过与填充支持向量机、混合距离支持向量机和风险重构支持向量机分类方法进行比较,结果表明提出的分类方法在不对缺失值作任何处理、不改变支持向量机模型结构与约束条件的情况下,仍然获得了优异的分类效果。
In order to process incomplete data set with hybrid value,a novel classification method based on Expand Gaussian kernel-Support vector machine is proposed.Firstly,based on the connection degree distance function for Expand Gaussian kernel is defined;Secondly,the SMO training algorithm base on quadratic function approximation for support vector machine and classifi⁃cation algorithm are provided.Finally,some experiments are carried out on UCI data sets,and compare to the filling support vector machine,the risk reconstruction support vector machine and the HEOM support vector machine,the experiments show that the ex⁃cellent classification results of proposed classification method which is no need to handle missing values and no need to change SVM model structure and constraints are still obtained.
作者
黄恒秋
翁世洲
Huang Hengqiu;Weng Shizhou(School of Mathematics,Physics and Electronic Information Engineering,Guangxi Normal University for Nationalities,Chongzuo 532200;School of Economic management,Guangxi Normal University for Nationalities,Chongzuo 532200)
出处
《现代计算机》
2022年第21期18-25,共8页
Modern Computer
基金
广西高校中青年教师科研基础能力提升项目(2020KY20012):混合型不完备数据的邻域粗糙集分类方法
广西高校中青年教师科研基础能力提升项目:概率数及其在不确定决策理论中的应用研究
广西民族师范学院校级科研项目(2020YB007):模糊区间粗糙数信息系统下的多属性决策方法及应用研究。
关键词
混合不完备数据
联系度距离
联系度距离高斯核
支持向量机
incomplete data with hybrid value
connection degree distance
connection degree distance Gaussian kernel
support vector machine