期刊文献+

一种高效的高维异常数据挖掘算法 被引量:7

Efficient Data Mining Algorithm for High-dimensional Outlier Data
下载PDF
导出
摘要 针对高维异常数据的挖掘问题,提出一种基于最大间隔准则和最小最大概率机的高维异常数据挖掘算法。利用最大间隔准则算法将高维数据投影到低维特征空间,再利用最小最大概率机进行异常数据的挖掘。实验结果表明,该算法检测准确率较高。 To effectively cope with data mining problem for high-dimensional outlier data, a novel outlier data mining algorithm based on Maximum Margin Criterion(MMC) and MiniMax Probability(MMP) machine is proposed. The high dimensional data sets are first projected into lower-dimensional feature space by using MMC algorithm, and MMP machine is adopted to mine outlier data. Experimental results show that the proposed algorithm is feasible and has higher detection accuracy.
出处 《计算机工程》 CAS CSCD 北大核心 2010年第21期34-36,共3页 Computer Engineering
基金 教育部科学技术研究基金重点资助项目(107021)
关键词 异常数据 最大间隔准则 最小最大概率机 数据挖掘 outlier data Maximum Margin Criterion(MMC) MiniMax Probability(MMP) machine data mining
  • 相关文献

参考文献5

  • 1Li Haifeng,Jiang Tao,Zhang Keshu.Efficient and Robust Feature Extraction by Maximum Margin Criterion[J].IEEE Transactions on Neural Networks.2006,17(1):157-165.
  • 2王靖.基于鲁棒的全局流形学习方法[J].计算机工程,2008,34(9):192-194. 被引量:6
  • 3Lanckriet G R G,Ghaoui L E,Bhattacharyya C,et al.A Robust Minimax Approach to Classification[J].The Journal of Machine Learning Research,2002,25(3): 555-582.
  • 4Blake C L,Merz C J.UCI Repository of Machine Learning Databases[EB/OL].(1998-05-01).http://www.ics.uci.edu/mlearn/ MLRepository.html.
  • 5Hettich S,Bay S D.KDD CUP 1999 Data[EB/OL].(1999-10-28).http://kdd.ics.uci.edu/databases/kddcup99/kddcup.html.

二级参考文献6

  • 1Tenenbaum J.A Global Geometric Framework for Nonlinear Dimension Reduction[J].Science,2000,290(5500):2319-2323.
  • 2Roweis S.Nonlinear Dimensionality Reduction by Locally Linear Embedding[J].Science,2000,290(22):2323-2326.
  • 3Belkin M.Laplacian Eigenmaps and Special Techniques for Embedding and Clustering[M].Cambridge,MA,USA:MIT Press,2002:585-591.
  • 4Zhang Zhenyue,Zha Hongyuan.Principal Manifolds and Nonlinear Dimensionality Reduction via Tangent Space Alignment[J].SIAM Journal of Scientific Computing,2005,26(1):313-338.
  • 5Silva V D.Global Versus Local Methods in Nonlinear Dimensionality Reduction[C]//Proc.of Conference on Advances in Neural Information Processing Systems.Cambridge,MA,USA:MIT Press,2003:705-712.
  • 6Yeung D Y.Robust Locally Linear Embedding[J].Pattern Recognition,2006,39(6):1053-1065

共引文献5

同被引文献35

  • 1李德毅,刘常昱,杜鹢,韩旭.不确定性人工智能[J].软件学报,2004,15(11):1583-1594. 被引量:398
  • 2陆声链,林士敏.基于距离的孤立点检测研究[J].计算机工程与应用,2004,40(33):73-75. 被引量:44
  • 3蒋盛益,李庆华,王卉,孟中楼.一种增强的局部异常挖掘方法[J].计算机研究与发展,2005,42(2):210-216. 被引量:8
  • 4李庆华,李新,蒋盛益.一种面向高维混合属性数据的异常挖掘算法[J].计算机应用,2005,25(6):1353-1356. 被引量:3
  • 5E Hong, D W Cheung. Parallel mining of outliers inlarge database[J]. Distributed and Parallel Databases,2002,12(l):5-26.
  • 6Eslin Eleazar. Sparse sequence modeling with applica-tions to computational biology and intrusion detection[D]. USA:Colee MBIA Unversity,2003.
  • 7Li Deyi, Han Jiawei , Shi Xuemei,et al. Knowledgerepresentation and discovery based on linguistic atoms[J]. Knowledge-based Systems, 1998? 10 (7) : 431-440.
  • 8Blake C L,Merz C J. UCI repository of machine learn-ing databases[EB/OL]. (1998-05 - 01) [2012 - 09 -20]. http://www. ics. uci. edu/mlearn/MLReposito-ry. html.
  • 9HASTIE T;TIBSHIRANI R;FRIEDMAN J;范明;柴玉梅;昝红英.统计学习基础一数据挖掘、推理与预测[M]北京:电子工业出版社,2004.
  • 10B(U)HLMANN P,Van de GEER S. Statistics for High-Dimensional Data:Methods,Theory and Applications[M].Berlin.Springer-Verlag Berlin Heidelberg,2011.

引证文献7

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部