期刊文献+

基于信息熵的支持向量数据描述分类 被引量:3

Classification method for SVDD based on information entropy
下载PDF
导出
摘要 针对现有的支持向量数据描述(SVDD)在解决分类问题时通常存在盲目性和有偏性,在研究信息熵和SVDD分类理论的基础上,提出了改进两类分类问题的E-SVDD算法。首先对两类样本数据分别求出其熵值;然后根据熵值大小决定将哪类放在球内;最后结合两类样本容量以及各自的熵值所提供的分布信息,对SVDD算法中的C值重新进行定义。采用该算法对人工样本集和UCI数据集进行实验,实验结果验证了算法的可行性和有效性。 Most of Support Vector Data Description(SVDD) methods have blindness and bias issues when working on two-class problems.The authors proposed a new SVDD method based on information entropy.In this algorithm,firstly,the entropy values were resolved respectively of the two classes of samples.Secondly,according to the size of the value,one class was placed inside the ball.Finally,the penalty was given based on the information provided by the sizes of the two sample data and their entropy values.The efficiency of this algorithm was verified by using artificial data and UCI datasets for the data imbalanced classification problem.The experimental results on artificial data sets and UCI data sets show the feasibility and effectiveness of the proposed method.
出处 《计算机应用》 CSCD 北大核心 2011年第4期1114-1116,共3页 journal of Computer Applications
基金 国家自然科学基金资助项目(60874074) 浙江省科技计划重点项目(2009C14032)
关键词 信息熵 分布特性 支持向量数据描述 分类 information entropy distribution character Support Vector Data Description(SVDD) classification
  • 相关文献

参考文献8

  • 1TAX D M J, DUIN R P W. Support vector domain description [ J ]. Pattern Recognition Letters, 1999, 20(11/12/13) : 1191 - 1199.
  • 2TAX D M J. One-class classification [ D ]. Delft, Netherlands: Delft University of Technology, 2001.
  • 3MU T, NANDI A K. Multiclass classification based on extended support vector data description [ J ]. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 2009, 39(5): 1206-1216.
  • 4CHEN B, LI B, PAN Z S. SVDD regularized with area under theROC [ C ] //Proceedings of the 2nd Intemational Conference on Interaction Sciences: Information Technology, Culture and Human. Seoul: [ s. n. ], 2009:364 -368.
  • 5TAX D M J, DUIN R P W. Support vector data description [ J ]. Machine Learning, 2004, 54( 1):45-66.
  • 6熊家军,李庆华.信息熵理论与入侵检测聚类问题研究[J].小型微型计算机系统,2005,26(7):1163-1166. 被引量:14
  • 7BABAIE T, LUCAS C. Variable selection using information entropy in time series prediction [ C ] //Proceedings of the Seventh International Conference on Computer Science and Information Technolo- gies. Yerevan, Armenia: [s.n.], 2009:118-121.
  • 8TAX D M J, JUSZCZAK P. Kernel whitening for oneclass classification [ J ]. International Joumal of Pattern Recognition and Artificial Intelligence, 2003, 17(3) : 333-347.

二级参考文献10

  • 1Klaus Julish. Data mining for intrusion detection:a critical review[R]. Switzerland:IBM Research, Zurich Research Laboratory, 2001.
  • 2Portnoy L, Eskin E, Stolfo S J. Intrusion detection with unlabeled data using clustering[C]. In:Proceedings of the ACM CCS Workshop on Data Mining for Security Applications,2001.
  • 3Han J, Kamber M. Data mining: concepts and techniques[M]. Morgan Kaufmann Publisher,2000.
  • 4Jain A, Murty M, Flynn P. Data clustering: a review[J]. ACM Computing Surveys, 1999, 31(3):513-521.
  • 5Guha S, Rastogi R, Shim K. ROCK: A robust clustering algorithm for categorical attributes[J]. Information Systems, 2000, 25(5):345-366.
  • 6Daniel Barbara, Julia Couto, Yi Li. COOLCAT: An entropy-based algorithm for categorical clustering[D]. George Mason University, Information and Software Engineering Department, October 1,2001.
  • 7Periklis Andritsos, Panayiotis Tsaparas, Renee J.Miller et al. LIMBO:a scalable algorithm to cluster categorical data[R]. University of Toronto, Department of Computer Science, 2003,7.
  • 8Li Xiang-yang. Clustering and classification algorithm for computer intrusion detection[D]. Arizone State University,2001.
  • 9Wenke Lee, Dong Xiang. Information-theoretic measures for anomaly detection[D].Computer Science Department, North Carolina State University, 2000.
  • 10Garey M,Johnson D.Computers and intractability:a guide to the theory of NP-completeness[M]. W.H.Freeman,1979.

共引文献13

同被引文献36

  • 1汪景宁.安徽统计年鉴[M].北京:中国统计出版社,2008.
  • 2陈斌,冯爱民,陈松灿,李斌.基于单簇聚类的数据描述[J].计算机学报,2007,30(8):1325-1332. 被引量:18
  • 3Chames A, Cooper W W, Rhodes E. Measuring The Efficiency of Decision Making Units [ J ]. European Journal of Operational Research, 1978 (2) :429-444.
  • 4SCHOLKOPF B,SMOLA A.Learning with kernels[M].Cambridge:MIT Press,2002.
  • 5CRISTIANINI N,TAYLOR J S.An introduction to support vector machines[M].Cambridge:Cambridge University Prees,2000.
  • 6ZAFEIRIOU S,TEFAS A,PITAS I.Minimum class variance support vector machines[J].IEEE Transactions on Image Processing,2007,16(10):2551-2564.
  • 7HUANG K Z,YANG H Q,KING I,et al.Maxi-min margin machine:Learning large margin classifiers locally and globally[J].IEEE Transactions on Neural Networks,2008,19(2):260-272.
  • 8TAX D M J,DUIN R P W.Support vector data description[J].Machine Learning,2004,54(1):45-66.
  • 9BEN-HUB A,HORN D,SIEGELMANN H T,et al.Support vector clustering[J].Journal of Machine Learning Research,2001,2:125-137.
  • 10TSANG I W,KWOK J T,ZURADA J M.Generalized core vector machines[J].IEEE Transactions on Neural Networks,2006,17(5):1126-1140.

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部