期刊文献+

基于带权多维尺度变换的奇异值挖掘 被引量:3

Outliers Mining via Weighted Multidimensionality Scaling
下载PDF
导出
摘要 大量的高维数据在分布上表现为一低维流形,试图从这样的数据集中探测出奇异点,传统的奇异点挖掘算法可能失效。本文提出了一种带权重的多维尺度变化,算法通过局部的高维数据集和其低维重构的误差来设定数据点的局部权重,再利用权重之和得到的数据点置信度,以此来进行奇异值的判定。通过实验验证了算法的有效性。 Mining outliers from the data set which is distributed on a low dimensional manifold is a hard task. The existing algorithm may not be effective for the situation. So a novel approach called weighted multidimensionality scaling is proposed for outliers mining. It is based on multidimensionality scaling, MDS. Every data point will get a reliability score by the algorithm, then it can be determined whether it is a outlier through the value of its reliability score. The experiments show the efficiency of the algorithm.
出处 《计算机科学》 CSCD 北大核心 2008年第1期190-192,共3页 Computer Science
基金 国家自然科学基金项目(60495019) 教育部博士点专项基金(20060247039)
关键词 奇异值 多维尺度变换 带权多维尺度变换 流形学习 Outliers, MDS, Weighted MDS, Manifold learning
  • 相关文献

参考文献10

  • 1Han Jiawei, Kamber M. Data Mining: Concepts and techniques [M]. Morgan Kaufmann Publishers, 2001.
  • 2Grubbs F E. Procedures for detecting outlying observations in sampies [J]. Technometrics, 1969,11 : 1-21.
  • 3Barnett V, Lewis T. Outliers in Statistical Data [M]. John Wiley Sons, 1994.
  • 4Knorr E, Ng R. A unified notion of outliers: Properties and computation. In: Proc. 1997 Int. Conf. Knowledge Discovery and Data Mining (KDD'97), 1997. 219-222.
  • 5Knorr E, Ng R. Algorithms for mining distancebased outliers in large datasets. In; Proc. 1998 Int. Conf. Very Large Data Bases (VLDB'98), 1998. 392-403.
  • 6Arning A, Agrawai R, Raghavan P. A iinear method for deviation detection in iarge databases. In:Proc. 1996 Int. Conf. Data Mining and KnowiedgeDiscovery (KDD'96), 1996. 164-169.
  • 7Breunig M, Kriegei H P, Ng R, Sander J. LOF: Identifying Density-Based Local Outliers. In:Proc. ACM SiGMOD 2000 Int. Conf. on Management of Data, 2000. 93-104.
  • 8Chang Hong, Yeung D Y. Robust locally linear embedding [J]. Pattern Recognition, 2006,39 : 1053-1065.
  • 9Cox T, Cox M. Multidimensional Scaling [M]. London Chapman&Hali, 1994.
  • 10Huber P J. Robust regression: asymptotics, conjectures, and Monte Carlo, Ann. Statist,1973,1(5) :799-821.

同被引文献28

引证文献3

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部