期刊文献+

基于对象数量的宽度加权聚类kNN算法 被引量:1

Width-weighted clustering kNN algorithm based on number of objects
下载PDF
导出
摘要 传统k最近邻算法(k-Nearest Neighbor,kNN)作为一种非参数化分类技术在数据分析中具有广泛的应用,但该算法具有较多的冗余计算,致使处理数据时需要花费较多的计算时间。目前大量的研究都集中在数据的预处理阶段,通过为数据建立模型降低kNN查询的计算量。提出一种基于对象数量的宽度加权聚类kNN算法(NOWCkNN),该算法中数据集首先以全局宽度进行聚类,每个生成的子集群根据其对象数量递归计算其宽度的权值,然后算法根据其权值的大小和调和系数调节宽度值,最后生成不同宽度大小的集群用于kNN查询。这不仅减少了算法的聚类时间,还能平衡产生集群的大小,减少迭代次数,使三角不等式修剪率达到最大。实验结果表明,NOWCkNN算法与现有工作相比在各个维度的数据集中有较好的性能,尤其是在高维度、数据量较大的数据集中有更高的修剪效率。 The traditional k-Nearest Neighbor(kNN)algorithm has a wide range of applications as a non-parameterized data clustering algorithm.However,the algorithm has more redundancy calculations,which leads to more computation time when processing data.A large amount of research is currently focused on the preprocessing stage of data,and the computational complexity of kNN queries is reduced by modeling the data.This paper proposes a width-weighted clustering kNN algorithm based on the number of objects for kNN query(NOWCkNN).The algorithm performs width learning based on the number of objects.Firstly,data sets are clustered with global width,and then each generated cluster calculates the weight of its width recursively based on the number of objects.The algorithm can adjust the width value according to the number of cluster’s objects.In terms of clustering,the algorithm not only reduces clustering time and the number of iterations,but also balances cluster size and maximizes the trigonometric inequality.Experimental results show that the work outperforms than the existing works in all dimensions of datasets,especially in high-dimensional and large datasets.
作者 陈辉 关凯胜 李嘉兴 CHEN Hui;GUAN Kaisheng;LI Jiaxing(School of Computer Science and Technology,Guangdong University of Technology,Guangzhou 510006,China;Guangdong Key Laboratory of Big Data Analysis and Processing,Guangzhou 510006,China)
出处 《计算机工程与应用》 CSCD 北大核心 2018年第19期1-9,共9页 Computer Engineering and Applications
基金 国家自然科学基金(No.61702114) 广东省大数据分析与处理重点实验室开放基金(No.201805) 广东省省级科技计划(No.2017A040402009)
关键词 聚类 K-最近邻 三角不等式 宽度加权 高维数据 clustering k-Nearest Neighbors(kNN) trigonometric inequality width-weighted high-dimensional data
  • 相关文献

参考文献3

二级参考文献27

  • 1杨丽华,戴齐,郭艳军.KNN文本分类算法研究[J].微计算机信息,2006,22(07X):269-270. 被引量:24
  • 2王清,马华,孙静,韩忠东.改进的KNN算法及其在医学图像处理中的应用[J].泰山医学院学报,2006,27(6):564-566. 被引量:5
  • 3Wolfson Ouri.Moving objects information management:The database challenge//Proceedings of the International Workshop on Next Generation Information Technologies and Systems.Caesarea,Israel,2002:75-89.
  • 4Xiong Xiaopeng,Mohamed F M,Walid G A.SEA-CNN:Scalable processing of continuous k-nearest neighbor queries in spatio-temporal databases//Proceedings of the International Conference on Data Engineering.Tokyo,Japan,2005:643-654.
  • 5Yu Xiaohui,Ken Q P,Nick Koudas.Mointoring k-nearest neighbor queries over moving objects//Proceedings of the International Conference on Data Engineering.Tokyo,Japan,2005:631-642.
  • 6Mouratidis K,Hadjieleftheriou M,Papadias D.Conceptual partitioning:An efficient method for continuous nearest neighbor monitoring//Proceedings of the International Conference on Management of Data.Baltimore,USA,2005:634-645.
  • 7Hu Haibo,Xu Jianliang,Dik L L.A generic framework for monitoring continuous spatial queries over moving objects//Proceedings of the International Conference on Management of Data.Baltimore,USA,2005:479-490.
  • 8Hsueh Y L,Zimmermann R,Wang Haojun et al.Partition-based lazy updates for continuous queries over moving objects//Proceedings of the International Symposium on Advances in Geographic Information Systems.Seattle,USA,2007.
  • 9Mouratidis K,Yiu M L,Papadias D et al.Continuous nearest neighbor monitoring in road networks//Proceedings of the International Conference on Very Large Data Bases.Seoul,Korea,2006:43-54.
  • 10Demiryurek U,Banaei K F,Shahabi C.Efficient continuous nearest neighbor query in spatial networks using euclidean restriction//Proceedings of the International Symposium on Spatial and Temporal Database.Aalborg,Denmark,2009:25-43.

共引文献78

同被引文献11

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部