期刊文献+

广义加权Minkowski距离及量子遗传聚类算法 被引量:3

General Weighted Minkowski Distance and Quantum Genetic Clustering Algorithm
下载PDF
导出
摘要 相异度和相似度度量是聚类算法中非常重要的一种因素,往往会影响到聚类分析的结果。很多聚类算法采用欧式距离作为计算数据相似度的度量。而欧式距离不能反映属性值的全局特性,且不顾及各属性之间的量纲差异,因此当不同属性间具有明显量纲或值域差异时,不能取得很好的效果。对此,提出了一种广义加权Minkowski距离,即由各属性的量纲和值域信息来确定各属性的广义权值,既考虑了整个数据集的特性,又消除了各属性之间的不和谐,同时分位数的引进在一定程度上减弱了噪声属性值对距离度量的影响。将提出的新的距离度量用于经典的k-means算法和量子遗传聚类算法,实验结果表明,采用新的距离度量和引进量子遗传算法的聚类是更加有效的。 Difference and similarity are very important factor in clustering algorithms, and always affect the results of clustering analysis. A lot of clustering algorithms use Euclidean distance as it' s similarity measure. Euclidean distance can't reflect the global information of attributes, and don't consider the unit differences between each attribute, so it can' t make a good result when there is obvious unit and domain differences. So, this paper put forward a generally weighted Minkowski distance which is determined by the unit and domain information of each attributes value. Not only charac- teristics of whole data are considered, but also dicord between attributes is removed, at the same time, using of fractional bits weakens the noise data influence. We used new distance measure in classic k-means. And quantum genetic k-means and the experimental result show that the new algorithm is effective.
出处 《计算机科学》 CSCD 北大核心 2013年第5期224-228,共5页 Computer Science
基金 国家水体污染控制与治理科技重大专项(2009ZX07318-003-01-02) 水利部公益性行业科研专项(201001031)资助
关键词 数据聚类 Minkowski距离 分位数 全局信息 量子遗传算法 Data clustering Minkowski distance Fractional bits Global information QGA
  • 相关文献

参考文献16

  • 1孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量:1061
  • 2Nguyen C D, Cios K J. GAKREM: A novel hybrid clustering algorithm[J]. Information Sciences, 2008,178(22) : 4205-4227.
  • 3Liu Jing-wei, Xu Mei-zhi. Kernelized fuzzy attribute C-means clustering algorithm [J]. Fuzzy Sets and Systems, 2008, 159 (18) :2428-2445.
  • 4Graves D, Pedrycz W. Kernel-based fuzzy clustering and fuzzy clustering : A comparative experimental study[J]. Fuzzy sets and systems, 2010,161 (4) : 522-543.
  • 5Li M J, Ng M K, Cheung Y-M, et al. Agglomerative fuzzy K- Means clustering algorithm with selection of number of clusters [J] IEEE Transactions on Knowledge and Data Engineering, 2008,20(11) : 1519-1534.
  • 6Bobrowski L, Bezdek J C. c-means clustering with the L1 and Loo norms[J]. IEEE Trans on System, Man and Cybernetics, 1991, 21 (3): 545-554.
  • 7Hathaway R J, Bezdek J C, Hu Y. Generalized fuzzy c-means clustering strategies using Lp norm distances[J]. IEEE Trans on Fuzzy Systems, 2000,8 (5) : 576-582.
  • 8Weinberger K Q, Saul L K. Distance metric learning for large margin nearest neighbor classification[J]. J of Machine Learning Research,2009,10(2):207-244.
  • 9叶斌,胡修林,张蕴玉,陈海峰.基于3D Zernike矩的三维地形匹配算法及性能分析[J].宇航学报,2007,28(5):1241-1245. 被引量:9
  • 10Zhang D, ChenS. A comment on alternative c-means clustering algorithms[J]. Pattern Recognition, 2004,37 (2) : 173-174.

二级参考文献26

  • 1李洁,高新波,焦李成.基于特征加权的模糊聚类新算法[J].电子学报,2006,34(1):89-92. 被引量:113
  • 2李士勇,李盼池.基于实数编码和目标函数梯度的量子遗传算法[J].哈尔滨工业大学学报,2006,38(8):1216-1218. 被引量:60
  • 3李阳阳,焦李成.求解SAT问题的量子免疫克隆算法[J].计算机学报,2007,30(2):176-183. 被引量:45
  • 4徐利治 王兴华.数学分析的方法及例题选讲[M].北京:高等教育出版社,1984.120.
  • 5裴礼文.数学分析中的典型问题与方法[M].北京:高等教育出版社,2002..
  • 6Sobolev(崔志勇等译).泛函分析在数学物理中的应用[M].长春:吉林省大学出版社,1992..
  • 7[1]Lane DM.Underwater Robotics:Mission programming and navigation of underwater vehicles[J].International Journal of Systems Science,1998,29(6):294-300
  • 8[2]Larry D,Hostetler,Ronald D.Andreas.Nonlinear kalman filtering techniques for terrain-aided navigation[J].IEEE Transactions on Automatic Control,1983,28(3):315-323
  • 9[4]Kenneth R,Castleman.Digital Image Processing[M].Prentice Hall.Inc.1996:205-211
  • 10[5]Goldgof DB.Feature extraction and terrain matching[C]//Proc.IEEE-CS conf.Computer vision and Pattern Recogniton,1988:899-904

共引文献1201

同被引文献14

引证文献3

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部