基于信息熵的支持向量数据描述分类被引量：3

Classification method for SVDD based on information entropy

下载PDF

导出

摘要针对现有的支持向量数据描述(SVDD)在解决分类问题时通常存在盲目性和有偏性,在研究信息熵和SVDD分类理论的基础上,提出了改进两类分类问题的E-SVDD算法。首先对两类样本数据分别求出其熵值;然后根据熵值大小决定将哪类放在球内;最后结合两类样本容量以及各自的熵值所提供的分布信息,对SVDD算法中的C值重新进行定义。采用该算法对人工样本集和UCI数据集进行实验,实验结果验证了算法的可行性和有效性。 Most of Support Vector Data Description（SVDD） methods have blindness and bias issues when working on two-class problems.The authors proposed a new SVDD method based on information entropy.In this algorithm,firstly,the entropy values were resolved respectively of the two classes of samples.Secondly,according to the size of the value,one class was placed inside the ball.Finally,the penalty was given based on the information provided by the sizes of the two sample data and their entropy values.The efficiency of this algorithm was verified by using artificial data and UCI datasets for the data imbalanced classification problem.The experimental results on artificial data sets and UCI data sets show the feasibility and effectiveness of the proposed method.

作者何伟成方景龙

机构地区杭州电子科技大学计算机学院

出处《计算机应用》 CSCD 北大核心 2011年第4期1114-1116,共3页 journal of Computer Applications

基金国家自然科学基金资助项目(60874074) 浙江省科技计划重点项目(2009C14032)

关键词信息熵分布特性支持向量数据描述分类 information entropy distribution character Support Vector Data Description（SVDD） classification

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献8

1TAX D M J, DUIN R P W. Support vector domain description [ J ]. Pattern Recognition Letters, 1999, 20(11/12/13) : 1191 - 1199.
2TAX D M J. One-class classification [ D ]. Delft, Netherlands: Delft University of Technology, 2001.
3MU T, NANDI A K. Multiclass classification based on extended support vector data description [ J ]. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 2009, 39(5): 1206-1216.
4CHEN B, LI B, PAN Z S. SVDD regularized with area under theROC [ C ] //Proceedings of the 2nd Intemational Conference on Interaction Sciences: Information Technology, Culture and Human. Seoul: [ s. n. ], 2009:364 -368.
5TAX D M J, DUIN R P W. Support vector data description [ J ]. Machine Learning, 2004, 54( 1):45-66.
6熊家军,李庆华.信息熵理论与入侵检测聚类问题研究[J].小型微型计算机系统,2005,26(7):1163-1166. 被引量：14
7BABAIE T, LUCAS C. Variable selection using information entropy in time series prediction [ C ] //Proceedings of the Seventh International Conference on Computer Science and Information Technolo- gies. Yerevan, Armenia: [s.n.], 2009:118-121.
8TAX D M J, JUSZCZAK P. Kernel whitening for oneclass classification [ J ]. International Joumal of Pattern Recognition and Artificial Intelligence, 2003, 17(3) : 333-347.

二级参考文献10

1Klaus Julish. Data mining for intrusion detection:a critical review[R]. Switzerland:IBM Research, Zurich Research Laboratory, 2001.
2Portnoy L, Eskin E, Stolfo S J. Intrusion detection with unlabeled data using clustering[C]. In:Proceedings of the ACM CCS Workshop on Data Mining for Security Applications,2001.
3Han J, Kamber M. Data mining: concepts and techniques[M]. Morgan Kaufmann Publisher,2000.
4Jain A, Murty M, Flynn P. Data clustering: a review[J]. ACM Computing Surveys, 1999, 31(3):513-521.
5Guha S, Rastogi R, Shim K. ROCK: A robust clustering algorithm for categorical attributes[J]. Information Systems, 2000, 25(5):345-366.
6Daniel Barbara, Julia Couto, Yi Li. COOLCAT: An entropy-based algorithm for categorical clustering[D]. George Mason University, Information and Software Engineering Department, October 1,2001.
7Periklis Andritsos, Panayiotis Tsaparas, Renee J.Miller et al. LIMBO:a scalable algorithm to cluster categorical data[R]. University of Toronto, Department of Computer Science, 2003,7.
8Li Xiang-yang. Clustering and classification algorithm for computer intrusion detection[D]. Arizone State University,2001.
9Wenke Lee, Dong Xiang. Information-theoretic measures for anomaly detection[D].Computer Science Department, North Carolina State University, 2000.
10Garey M,Johnson D.Computers and intractability:a guide to the theory of NP-completeness[M]. W.H.Freeman,1979.

共引文献13

1赵建利,刘教民,冯卫强.基于小波包能谱熵与自组织RBF神经网络的低压断路器机械故障诊断[J].低压电器,2010(4):1-5. 被引量：6
2张贺,蔡江辉,张继福,乔衎.信息熵度量的离群数据挖掘算法[J].智能系统学报,2010,5(2):150-155. 被引量：7
3史志才,夏永祥.基于知识约简的网络入侵特征提取[J].计算机工程,2011,37(5):134-136. 被引量：5
4蔡江辉,孟文俊,孙士卫,赵旭俊,张继福.基于信息熵的变星光谱快速识别方法[J].光谱学与光谱分析,2012,32(1):255-258. 被引量：2
5李文忠,左万利,赫枫龄.一种基于信息熵的多维流数据噪声检测算法[J].计算机科学,2012,39(2):191-194. 被引量：5
6史志才,夏永祥.基于粗糙集的入侵检测方法研究[J].计算机工程与科学,2012,34(2):13-18. 被引量：4
7王燕华,范涵冰.基于粗糙集与概念格的入侵检测模型研究[J].信息网络安全,2013(7):50-52. 被引量：1
8李秋德.基于集成RBF神经网络的入侵检测研究[J].网络安全技术与应用,2013(9):87-88.
9宋海波,陆正福,张翔.基于条件熵和改进遗传算法的入侵检测算法研究[J].软件导刊,2013,12(10):68-69. 被引量：1
10叶正旺.基于熵的聚类入侵检测算法研究[J].通化师范学院学报,2013,34(12):36-38.

同被引文献36

1汪景宁.安徽统计年鉴[M].北京:中国统计出版社,2008.
2陈斌,冯爱民,陈松灿,李斌.基于单簇聚类的数据描述[J].计算机学报,2007,30(8):1325-1332. 被引量：18
3Chames A, Cooper W W, Rhodes E. Measuring The Efficiency of Decision Making Units [ J ]. European Journal of Operational Research, 1978 (2) :429-444.
4SCHOLKOPF B,SMOLA A.Learning with kernels[M].Cambridge:MIT Press,2002.
5CRISTIANINI N,TAYLOR J S.An introduction to support vector machines[M].Cambridge:Cambridge University Prees,2000.
6ZAFEIRIOU S,TEFAS A,PITAS I.Minimum class variance support vector machines[J].IEEE Transactions on Image Processing,2007,16(10):2551-2564.
7HUANG K Z,YANG H Q,KING I,et al.Maxi-min margin machine:Learning large margin classifiers locally and globally[J].IEEE Transactions on Neural Networks,2008,19(2):260-272.
8TAX D M J,DUIN R P W.Support vector data description[J].Machine Learning,2004,54(1):45-66.
9BEN-HUB A,HORN D,SIEGELMANN H T,et al.Support vector clustering[J].Journal of Machine Learning Research,2001,2:125-137.
10TSANG I W,KWOK J T,ZURADA J M.Generalized core vector machines[J].IEEE Transactions on Neural Networks,2006,17(5):1126-1140.

引证文献3

1王晓明,王士同,彭宏.最小方差支撑向量数据域描述[J].计算机应用,2012,32(2):416-418.
2程兰兰,翟素兰,包小兵,王骑.基于数据包络分析的城市综合评价[J].合肥学院学报（自然科学版）,2012,22(1):21-25. 被引量：2
3张莉,卢星凝,夏佩佩.基于一类支持向量机的快速人脸相似性学习[J].浙江师范大学学报（自然科学版）,2015,38(1):67-72.

二级引证文献2

1周成成.基于数据包络分析的城市公交服务质量评价研究[J].公路与汽运,2013(2):69-72. 被引量：3
2张恒,郑燕,卢艳超.直流改造工程评价体系及特性评价指标[J].电力建设,2013,34(8):52-55.

1方景龙,王万良,何伟成.用于不平衡数据分类的FE-SVDD算法[J].计算机工程,2011,37(6):157-158. 被引量：2
2雷治军,张素玲,薛贞霞.基于球边界的不平衡数据分类方法[J].计算机应用,2008,28(4):866-868. 被引量：1
3吴海佳,张雄伟,孙蒙,杨吉斌.深度学习中对比散度算法的有偏性分析[J].解放军理工大学学报（自然科学版）,2015,16(3):224-230. 被引量：1
4薛贞霞,刘三阳,刘万里.2v-SSPC-一种不平衡数据分类方法[J].系统工程与电子技术,2008,30(12):2471-2476. 被引量：2
5牛建伟,戴彬,童超,彭井.GFN:基于“群”思想对Fast-Newman算法改进的复杂网络聚类算法[J].高技术通讯,2013,23(10):1016-1023.
6赵建远,李醒飞,田凌子.有色噪声背景下的正交子空间辨识[J].控制理论与应用,2015,32(1):43-49. 被引量：7
7郑恩辉,许宏,李平,宋执环.基于ν-SVM的不平衡数据挖掘研究[J].浙江大学学报（工学版）,2006,40(10):1682-1687. 被引量：8
8王建宏,许莺,熊朝华,徐欣.基函数下非线性系统的支持向量机辨识[J].电光与控制,2016,23(5):11-15.
9杨华,李少远.一种新的基于遗忘因子的递推子空间辨识算法[J].控制理论与应用,2009,26(1):69-72. 被引量：28
10赵建远,李醒飞,田凌子.动力调谐陀螺仪子空间辨识[J].光学精密工程,2015,23(2):423-429. 被引量：1

计算机应用

2011年第4期

浏览历史

内容加载中请稍等...

基于信息熵的支持向量数据描述分类被引量：3

参考文献8

二级参考文献10

共引文献13

同被引文献36

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于信息熵的支持向量数据描述分类 被引量：3

参考文献8

二级参考文献10

共引文献13

同被引文献36

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于信息熵的支持向量数据描述分类被引量：3