基于最大散度差的保序分类算法

A Rank-preserving Classification Method Based on Maximum Scatter Difference

下载PDF

导出

摘要分类算法主要存在问题:(1)无法充分利用样本的分布特征;(2)无法保持样本的相对关系不变;(3)无法解决大规模分类问题。对此,提出了一种基于最大散度差的保序分类算法RPCM,该方法利用线性判别分析算法中的类间离散度和类内离散度来表征样本的分布特征,通过保持各类样本中心相对关系不变来实现样本相对关系不变。理论分析表明:RPCM的对偶形式与最小包含球等价。在核心向量机的基础上提出了RPCM-CVM算法,该算法可用来解决大规模分类问题,标准数据集上的比较实验验证了所提方法的有效性。 There exist several problems in the traditional classifiers ：（ 1 ） they cant fully utilize the distribution feature of training data ; （2） they cant preserve the rank relations between different classes; （3） most of them cant deal with the large-scale classification prob- lem. In order to solve the above problems, a rank-preserving classification method based on maximum scatter difference （ RPCM ） is pro- posed in this paper. The between-class scatter and the within-class scatter in linear discriminant analysis （LDA） are introduced to de- scribe the distribution feature and the rank relations between different classes can be preserved by keeping the average values of differ- ent classes invariant. It can be proved that the dual form of RPCM is equivalent to the minimal enclosing ball （MEB） by theoretical a- nalysis and the RPCM-CVM algorithm is proposed based on core vector machine （ CVM ）, which can be used to solve the large-scale classification problem. The experiments on several standard datasets verify the effectiveness of the proposed RPCM and RPCM-CVM methods.

作者郝伟刘忠宝

机构地区山西工商学院计算机信息工程学院中北大学软件学院

出处《西安石油大学学报（自然科学版）》 CAS 北大核心 2017年第4期123-126,共4页 Journal of Xi’an Shiyou University（Natural Science Edition）

基金国家自然科学基金项目(编号:61202311) 山西自然科学基金项目(编号:201601D011042)

关键词最大散度差保序分类类间离散度类内离散度 maximum scatter difference rank preserving classification between-class scatter within-class scatter

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1冯昌,廖士中.随机傅里叶特征空间中高斯核支持向量机模型选择[J].计算机研究与发展,2016,53(9):1971-1978. 被引量：10

二级参考文献25

1Vapnik V N. Statistical I.earning Theory [M]. New York: John Wiley I Sons, 1998.
2Sch61kopf B, Smola A J. Learning with Kernels: Support Vector Machines, Regularization, Optimization,and Beyond [M]. Cambridge, MA: MIT Press, 2002.
3Chapelle O, Vapnik V. Model selection for support vector machines [C] //Advances in Neural In{ormation Processing Systems 12. Cambridge, MA: MIT Press, 2000:230-236.
4Guyon I, Saffari A, Dror G, et al. Model selection: Beyond the Bayesian/frequentist divide [J]. Journal o{ Machine Learning Research, 2010, 11:61-87.
5Duan K, Keerthi S S, Poo A N. Evaluation of simple performance measures for tuning SVM hyperparameters [J]. Neurocomputing, 2003, 51:41-59.
6Chapelle O, Vapnik V N, Bousquet O, et ai. Choosing multiple parameters for support vector machines [J]. Machine Learning, 2002, 46(1/2/3): 131-159.
7Vapnik V N, Chapelle O. Bounds on error expectation for support vector machines [J]. Neural Computation, 2000, 12 (9) : 2013-2036.
8Platt J C. Fast Training of support vector machines using sequential minimal optimization [C] //Advances in Kernel Methods: Support Vector Learning. Cambridge, MA: MIT Press, 1999:185-208.
9Zhang T. Solving large scale linear prediction problems using stochastic gradient descent algorithms [C] //Proc of the 21st Int Conf on Machine I.earning. New York: ACM, 2004: 919-926.
10Joachims T. Training linear SVMs in linear time [C] //Proc of the 12th ACM SIGKDD Int Conf on Knowledge Discovery and Data Mining. New York: ACM, 2006:217-226.

共引文献9

1高世伟,赵力.一种基于支持向量机的软测量建模方法[J].自动化仪表,2017,38(7):42-45. 被引量：5
2韩志卓,廖士中.高斯核选择的线性性质检测方法[J].模式识别与人工智能,2017,30(9):815-821.
3冯昌,廖士中.大规模核方法的随机假设空间方法[J].计算机科学与探索,2018,12(5):785-793. 被引量：6
4张闯,廖士中.并行效率敏感的大规模SVM数据分块数选择[J].数据采集与处理,2018,33(6):1068-1076. 被引量：1
5张骁,廖士中.基于局部后悔的在线核选择[J].计算机学报,2019,42(1):61-72. 被引量：1
6黄华娟,韦修喜.基于自适应调节极大熵的孪生支持向量回归机[J].南京大学学报（自然科学版）,2019,55(6):1030-1039. 被引量：2
7黄华娟,韦修喜,周永权.基于模糊核聚类粒化的粒度支持向量机[J].智能系统学报,2019,14(6):1271-1277. 被引量：2
8廖芸,张骁,廖士中.统一框架下在线核选择的竞争性分析[J].计算机科学与探索,2020,14(7):1126-1132.
9武玉坤,李伟,陈沅涛.卷积自编码器融合核近似技术的异常检测模型[J].计算机测量与控制,2022,30(3):259-265.

1刘海峰,刘守生,姚泽清.一种基于类别的混合型文本特征降维[J].微电子学与计算机,2010,27(10):13-17. 被引量：1
2史荧中,汪菊琴,许敏,王士同.正则化多任务学习的快速算法[J].计算机科学与探索,2017,11(6):988-997. 被引量：4
3蒋栋年,李炜.基于数据驱动残差评价策略的故障检测方法[J].控制与决策,2017,32(7):1181-1188. 被引量：16
4李小敏.AMS SIR-S航管二次监视雷达系统分析[J].科技视界,2014(32):101-103. 被引量：1
5刘君,何南,武和雷.一种基于FCM和VBM的头部磁共振图像多组织分割算法[J].南昌大学学报（工科版）,2017,39(2):179-183.
6吴群群,王兴起.基于时间自动机的符号状态拆分优化算法[J].计算机工程与设计,2017,38(7):1866-1871. 被引量：2
7秦波,张鲁洋,孙国栋,王建国.排列熵与核极限学习机在齿轮故障诊断中的应用[J].中国测试,2017,43(7):108-111. 被引量：3
8刘莹,王宁,李保华,罗强.模糊语法方法在犯罪文本分类中的应用[J].计算机工程与设计,2017,38(7):1965-1971. 被引量：2

西安石油大学学报（自然科学版）

2017年第4期

浏览历史

内容加载中请稍等...

基于最大散度差的保序分类算法

参考文献1

二级参考文献25

共引文献9

相关作者

相关机构

相关主题

浏览历史