高维空间L1范数约束的一类数据稀疏距离测度学习算法与应用

Sparse Distance Metric Learning with L1-Norm Constraint for One-Class Samples in High-Dimensional Space and Its Application

原文传递

导出

摘要现有一类分类算法通常采用经典欧氏测度描述样本间相似关系,然而欧氏测度不能较好地反映一些数据集样本的内在分布结构,从而影响这些方法对数据的描述能力.提出一种用于改善一类分类器描述性能的高维空间一类数据距离测度学习算法,与已有距离测度学习算法相比,该算法只需提供目标类数据,通过引入样本先验分布正则化项和L1范数惩罚的距离测度稀疏性约束,能有效解决高维空间小样本情况下的一类数据距离测度学习问题,并通过采用分块协调下降算法高效的解决距离测度学习的优化问题.学习的距离测度能容易的嵌入到一类分类器中,仿真实验结果表明采用学习的距离测度能有效改善一类分类器的描述性能,特别能够改善SVDD的描述能力,从而使得一类分类器具有更强的推广能力. Most one-class classification algorithms measure similarity based on euclidean distance between samples. Unfortunately, the Euclidean distance can not well reveal the internal distribution of some datasets, and reduces the descriptive ability of these methods. A distance metric learning algorithm in high-dimensional space is proposed to improve the performance of one-class classifiers in this paper. Compared with existing distance metric learning algorithm, the algorithm only needs to provide target class data, it can effectively solve distance metric learning problem for one-class samples in case of small sample size and in high-dimensional space by imposing sample distribution prior and sparsity prior with l1-norm constraint on the distance metric, and the formulation can be efficiently optimized in a block coordination descent algorithm. The learned metric can be easily embedded into one-class classifiers, the simulation experimental results show that the learned metric can effectively improve the description performance of one-class classifiers, in particular the description of SVDD, and it makes a stronger generalization ability of one-class classifiers.

作者胡正平路亮许成谦侯明玉

机构地区燕山大学信息科学与工程学院

出处《数学的实践与认识》 CSCD 北大核心 2011年第6期116-124,共9页 Mathematics in Practice and Theory

基金河北省自然科学基金(F2008000891) 河北省自然科学基金(F2010001297) 中国博士后自然科学基金(20080440124) 第二批中国博士后基金特别资助(200902356) 国家自然科学基金(61071199)

关键词高维空间稀疏距离测度学习 L1范数一类分类器 high-dimensional space sparse distance metric learning /l-norm one-class classifter

分类号 O174.12 [理学—基础数学]

引文网络
相关文献

参考文献20

1潘志松,陈斌,缪志敏,倪桂强.One-Class分类器研究[J].电子学报,2009,37(11):2496-2503. 被引量：37
2Mahadevan S, Shah S L. Fault detection and diagnosis in process data using one-class support vector machines[J]. Journal of Process Control, 2009, 19(10): 1627-1639.
3Mena L, Jesus A G. Symbolic one-class learning from imbalanced datasets: application in medical diagnosis~J]. International Journal on Artificial Intelligence Tools, 2009, 18(2): 273-309.
4D M J Tax, Duin R P W. Support vector data description[J]. Machine Learning, 2004, 54(1): 45-56.
5Lee K, Kim D W, Lee K H, et at. Density-induced support vector data description[J]. IEEE Transactions on Neural Networks, 2007, 18(1): 284-289.
6Guo S M. Chen L C, J S H Tsai.A boundary method for outlier detection based on support vector domain description[J]. Pattern Recognition, 2009, 42(1): 77-83.
7Hao Pei-Yi.Fuzzy one-class support vector machines[J]. Fuzzy Sets and Systems, 2008, 159(18): 2317-2336.
8Choi Y S.Least squares one-class support vector machine[J]. Pattern Recognition Letters, 2009, 30(13): 1236-1240.
9Piotr J, D M 3 Tax, Elzbieta P, et al.Nlinimum spanning tree based one-class classifier[J]. Neuro- computing, 2009, 72: 1859-1869.
10"Weinberger K Q, Saul L K.Distance metric learning for large margin nearest neighbor classifiea- tion[J].The Journal of Machine Learning Research, 2009, 10: 207-244.

二级参考文献55

1罗隽,丁力,潘志松,胡谷雨.异常检测中频率敏感的单分类算法研究[J].计算机研究与发展,2007,44(z2):235-239. 被引量：3
2燕继坤,王勇,曹春霞,郑辉.样本错误加权的支持向量数据描述[J].计算机工程,2005,31(2):24-26. 被引量：3
3潘志松,倪桂强,谭琳,胡谷雨.异常检测中单类分类算法和免疫框架设计[J].南京理工大学学报,2006,30(1):48-52. 被引量：5
4冯爱民,陈斌.基于局部密度的单类分类器LP改进算法[J].南京航空航天大学学报,2006,38(6):727-731. 被引量：3
5Markos M, Sameer S. Novelty Detection: A Review--Part Ⅰ: Statistical Approaches. Signal Processing, 2003, 83(12): 2481 -2497.
6Tax D. One-Class Classification--Concept-Learning in the Absence of Counter-Examples. Ph. D Dissertation. Holland, Netherlands : Delft University of Technology. Faculty of Electrical Engineering, 2001.
7Scholkopf B, Platt J, Shawe-Taylor J, et al. Estimating the Support of High-Dimensional Distribution. Neural Computation, 2001, 13 (7) : 1443 -1471.
8Tax D, Duin R. Support Vector Domain Description. Pattern Recognition Letters, 1999, 20( 11/12/13 ) : 1191 - 1199.
9Tax D, Duin R. Support Vector Data Description. Machine Learning, 2004, 54 ( 1 ) : 45 - 66.
10Quan Yong, Yang Jie. Modified Kernel Functions by Geodesic Distance. EURASIP Journal on Applied Signal Processing, 2004, 16 (1) : 2515 -2521.

共引文献38

1王俊,梅涛,孔斌,董翔.老年服务机器人视觉定位方法研究[J].华中科技大学学报（自然科学版）,2011,39(S2):255-258.
2胡正平,路亮,许成谦.基于高维空间稀疏最小生成树自适应覆盖模型的一类分类算法[J].模式识别与人工智能,2011,24(3):444-451.
3吴定海,张培林,任国全,傅建平.基于最大间隔超球分类器的柴油机异常检测研究[J].兵工学报,2011,32(7):790-794.
4胡正平,路亮,许成谦.基于高维空间典型样本Steiner最小树覆盖模型的一类分类算法[J].信号处理,2011,27(6):874-882. 被引量：1
5陈岳兵,冯超,张权,唐朝京.基于人工免疫系统的单类分类算法研究[J].计算机工程与设计,2011,32(9):3144-3147. 被引量：4
6王万良,王震宇,郑建炜,郑泽萍.密度诱导型数据描述单类分类机[J].控制与决策,2011,26(11):1665-1669. 被引量：1
7胡正平,路亮,许成谦.基于L1范数稀疏距离测度学习的单类分类算法[J].电子学报,2012,40(1):134-140. 被引量：4
8王洪波,赵光宙,齐冬莲,卢达.一类支持向量机的快速增量学习方法[J].浙江大学学报（工学版）,2012,46(7):1327-1332. 被引量：6
9杨森,孟晨,王成.基于最大分类间隔SVDD的电子装备状态监测模型研究[J].计算机测量与控制,2012,20(9):2335-2337.
10杨森,孟晨,王成.电子系统健康状态监测数据优化算法[J].计算机应用,2012,32(10):2927-2930. 被引量：1

1胡正平,路亮,许成谦.基于L1范数稀疏距离测度学习的单类分类算法[J].电子学报,2012,40(1):134-140. 被引量：4
2吴伟,高光来,聂建云.一种融合语义距离的最近邻图像标注方法[J].计算机科学,2015,42(1):297-302. 被引量：5
3Sailor.本本情报站[J].微型计算机,2003(18):68-68.
4sailor.本本情报站[J].微型计算机,2003(22):51-51.
5sailor.本本情报站[J].微型计算机,2003(21):57-57.
6本本情报站[J].微型计算机,2003(12):58-59.
7sailor.本本情报站[J].微型计算机,2003(13):56-57.
8本本情报站[J].微型计算机,2004(6):49-49.
92013年显卡性能大爆炸[J].新潮电子,2013(5):72-73.
10刘丛山,李祥宝,杨煜普.一种基于近邻元分析的文本分类算法[J].计算机工程,2012,38(15):139-141. 被引量：10

数学的实践与认识

2011年第6期

浏览历史

内容加载中请稍等...

高维空间L1范数约束的一类数据稀疏距离测度学习算法与应用

参考文献20

二级参考文献55

共引文献38

相关作者

相关机构

相关主题

浏览历史