期刊文献+

一种基于属性相似度的孤立点挖掘方法 被引量:1

AN OUTLIER MINING ALGORITHM BASED ON ATTRIBUTE SIMILARITY
下载PDF
导出
摘要 孤立点挖掘是数据挖掘中研究的热点之一。在对已有的孤立点挖掘技术分析的基础上,结合基于密度的聚类算法,提出了一种新的改进的检测孤立点方法即基于属性相似度的孤立点挖掘方法(ADBSCAN)。该方法先用基于密度的聚类算法进行聚类,然后再利用对象间的属性相似度进行进一步的检验,确定不包含在任何聚类中的对象是否为真正的孤立点,并通过实验验证了该方法的可行性和有效性。 Outlier mining is one of the research focuses in data mining. Based on the analysis of existing outlier mining technology,and in conjunction with the density-based clustering algorithm,we put forward a new improved outlier detection algorithm which is called outlier mining based on attribute similarity ( ADBSCAN) . It clusters with density-based clustering algorithm firstly,and then makes further detection using the similarities between objects to determine whether or not an object out of any cluster is a real outlier. The feasibility and the effectiveness of the new algorithm have been attested by the experiment.
作者 彭玲 徐汀荣
出处 《计算机应用与软件》 CSCD 2010年第12期236-237,246,共3页 Computer Applications and Software
关键词 孤立点 数据挖掘 属性相似度 Outlier Data mining Attribute similarity
  • 相关文献

参考文献9

  • 1Hawkins D.Identification of Outliers[M].Chapman and Hall,London,1980.
  • 2魏藜,宫学庆,钱卫宁,周傲英.高维空间中的离群点发现[J].软件学报,2002,13(2):280-290. 被引量:44
  • 3王洪春,彭宏.一种基于主成分分析的异常点挖掘方法[J].计算机科学,2007,34(10):192-194. 被引量:14
  • 4范明,孟小峰译,(加)Jiawei Han,Micheline Kamber著.数据挖掘:概念与技术[M].北京:机械工业出版社,2007.
  • 5Ghoting A,Parthasarathy S,and Otey M.Fast mining of distance-based outliers in high dimensional datasets[C]//Proc.of the 6th SIAM International Conference on Data Mining,2006.
  • 6Sarawagi S,Agrawal R,Megiddo N.Discovery-driven exploration of OLAP data cubes[C]//In Proc.of EDBT'98.Valencia,1998.
  • 7Hui Cao,Gangquan Si,Wenzhi Zhu,et al.Enhancing Effectiveness of Density-based Outlier Mining[C]//2008 International Symposiums on Information Processing:149-155.
  • 8姜灵敏.基于相似系数和检测孤立点的聚类算法[J].计算机工程,2003,29(11):183-185. 被引量:19
  • 9Knorr E M,Ng R T.Algorithms for Mining Distance-Based Outliers in Large Datasets[C]//Proc.24th Int.Conf.on Very Large Data Bases,New York,NY,1998:392-403.

二级参考文献42

  • 1陈华,李继波.异常(Outlier)检测算法综述[J].大众科技,2005(9):96-97. 被引量:3
  • 2钱昌明,李国庆,黄皓.分类异常点检测算法及在IDS模型中的应用[J].计算机应用研究,2006,23(4):94-96. 被引量:2
  • 3王宏鼎,童云海,谭少华,唐世渭,杨冬青.异常点挖掘研究进展[J].智能系统学报,2006,1(1):67-73. 被引量:22
  • 4Han J, Kamber M. Data Mining: Concepts and Techniques. Copyright by Morgan Kaufmann Publishers, Inc.2001.
  • 5Barnett V, Lewis T. Outliers in Statistical Data. New York: John Wiley &Sons, 1994.
  • 6Knorr E, Ng R. A Unified Notion of Outliers : Properties and Computation. In proc. 1997 Int. Conf. Knowledge Discovery and Data Mining(KDD97), Newport Beach,CA, 1997-08:219-222.
  • 7Knorr E, Ng R. Algorithms for Mining Distance-based Outliers in Large Datasets.In Proc. 1998 Int. Conf. Very Large Data Base(VLDB98), New York, 1998-08:392-403.
  • 8Aming A, Agrawal R, Raghavan P. A Linear Method for Deviation Detection in Large Database. In Proc. 1996 Int. Conf. Data Mining and Knowledge Discovery(KDD96), Portland, OR, 1996-08:164-169.
  • 9王惠文.偏最小二乘回归分析及其应用[M].北京:国防工业出版社,1999.130-184.
  • 10Fayyad, U., Piatetsky-Shapiro, G., Smyth, P. Knowledge discovery and data mining: towards a unifying framework. In: Simoudis, E., Han, J., Fayyad, U.M., eds. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining. Portland, Oregon: AAAI Press, 1996. 82~88.

共引文献75

同被引文献11

  • 1龙军,殷建平,祝恩,赵文涛.主动学习研究综述[J].计算机研究与发展,2008,45(z1):300-304. 被引量:31
  • 2Tuia D,Ratle F,Pacifici F.Active learning methods for remotesensing image classification. IEEE Transactions on Geoscienceand Remote Sensing . 2009
  • 3Shen D,Zhang J,Su J.Multi-criteria-based active learning for namedentity recognition. Proceedings of the 42nd Annual Meeting onAssociation for Computational Linguistics . 2004
  • 4Zhu J B,Wang H Z,Benjamin.Active learning with sampling byuncertainty and density for data annotations. IEEE Transactions onAudio,Speech,and Language Processing . 2010
  • 5Melville P,Mooney R J.Diverse ensembles for active learning. Proceedings of the 21st International Conference on MachineLearning . 2004
  • 6Gokhan,Tur,Robert,E.Schapire,Dilek,Hakkani-Tur.Active Learning for Spoken Language Understanding. IEEE International Conference on Acoustics,Speech and Signal Processing 2003 . 2003
  • 7Roy Nicholas,McCallum Andrew.Toward optimal active learn-ing through sampling estimation of error reduction. The 18th Int’l Conf on Machine Learning(ICML 2001 ) . 2001
  • 8Gosselin P H,Cord M.Active learning methods for interactiveimage retrieval. IEEE Transactions on Image Processing . 2008
  • 9韦佳,彭宏,林毅申.基于改进距离的孤立点检测方法[J].华南理工大学学报(自然科学版),2008,36(9):25-30. 被引量:12
  • 10曹晖,司刚全,张彦斌,贾立新.一种基于密度近邻的增量式孤立点发现算法[J].模式识别与人工智能,2009,22(6):931-935. 被引量:3

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部