基于方差优化谱聚类的热点区域挖掘算法

Hot Region Mining Algorithm based on Variance Optimization Spectrum Clustering

下载PDF

导出

摘要为改善交通拥堵的情况,本文利用聚类分析方法对移动轨迹数据进行挖掘,识别居民出行的热点区域。传统的Ng-Jordan-Weiss(NJW)谱聚类算法常使用K-means聚类算法来实现最后的聚类操作,然而K-means聚类算法存在对初始值敏感、容易陷入局部最优的缺陷,影响对热点区域的挖掘结果。因此,本研究将方差优化初始中心的K-medoids聚类算法运用到谱聚类算法最后聚类阶段,提出基于方差优化谱聚类的热点区域挖掘算法(Hot Region Mining algorithm based on improved K-medoids Spectral Clustering,HRM-KSC),然后在真实的轨迹数据集上进行试验。试验结果发现,HRM-KSC算法聚类结果的轮廓系数更高,表明HRM-KSC算法改善了NJW谱聚类算法,提高了聚类质量。 In order to improve the traffic congestion,this article uses the cluster analysis approach to mine the mobile trajectory data and identify the hot region of residents'travel.The traditional Ng-Jordan-Weiss(NJW)spectral clustering algorithm often uses K-means clustering algorithm to achieve the final clustering operation.However,K-means clustering algorithm has the disadvantages of being sensitive to the initial value and easy to fall into the local optimum,which will affect the mining results of hotspot area.Therefore,the K-medoids clustering algorithm of variance optimization initial center is applied to the final clustering stage of the spectral clustering algorithm,and a Hot Region Mining algorithm based on improved K-medoids Spectral Clustering(HRM-KSC)is proposed,and then experiment on real trajectory data sets.The experiment results find that the HRM-KSC algorithm clustering results have higher silhouette coefficient,which indicates that the HRM-KSC algorithm improves the NJW spectral clustering algorithm and the clustering quality.

作者梁卓灵元昌安覃晓 LIANG Zhuoling;YUAN Chang'an;QIN Xiao(Guangxi University,Nanning,Guangxi,530004,China;Guangxi Academy of Sciences,Nanning,Guangxi,530007,China;Nanning Normal Universety,Nanning,Guangxi,530001,China)

机构地区广西大学广西科学院南宁师范大学

出处《广西科学》 CAS 2020年第6期616-621,I0010,共7页 Guangxi Sciences

基金国家自然科学基金项目(61962006,61802035,61772091) 广西科技开发项目(AA18118047,AD18126015) 广西自然科学基金项目(2018GXNSFDA138005)资助。

关键词 K-medoids算法谱聚类热点区域停留点交通拥堵 K-medoids algorithm spectral clustering hot region stop point traffic congestion

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1谢娟英,高瑞.方差优化初始中心的K-medoids聚类算法[J].计算机科学与探索,2015,9(8):973-984. 被引量：13
2覃晓,梁伟,元昌安,唐涛.基于遗传优化谱聚类的图形分割方法[J].计算机科学,2017,44(1):100-102. 被引量：4
3谢娟英,郭文娟,谢维信.基于邻域的K中心点聚类算法[J].陕西师范大学学报（自然科学版）,2012,40(4):16-22. 被引量：32
4谢娟英,高瑞.Num-近邻方差优化的K-medoids聚类算法[J].计算机应用研究,2015,32(1):30-34. 被引量：11

二级参考文献63

1张惟皎,刘春煌,李芳玉.聚类质量的评价方法[J].计算机工程,2005,31(20):10-12. 被引量：60
2Han J W,Kamber M. Data Mining: Concepts and Techniques[M]. Beijing: China Machine Press, 2000:383-466.
3Theodoridis S, Koutroumbas K. Pattern tecognition[M]. Boston: Academic Press, 2009 : 745-748.
4Kaufman L, Rousseeuw P J. Finding groups in data: An introduction to cluster analysis[M]. New York: Wiley, 1990 : 126-163.
5Lucasius C B, Dane A clustering of large data algorithm: Background, Analytica Chimica Acta, D, Kateman G. On k-medoid sets with the aid of a genetic feasibility and comparison[J]. 1993, 282(3): 647-669.
6Ng R, Han J. Efficient and effective clustering methods for spatial data mining[C] // In Proceedings of the 20th International Conference on very Large Databases, Santiago, 1994: 144-155.
7Wei C P, Lee Y H, Hsu C M. Empirical comparison of fast partitioning-based clustering algorithms for large data sets[J]. Expert Systems with Applications, 2003, 24(4) 351-363.
8Zhang Q, Couloigner I. A new and efficient K-medoid algorithm for spatial clustering[J]. Lecture Notes in Computer Science, 2005, 3482:181-189.
9Park H S, Jun C H. A simple and fast algorithm for K-medoids clustering[J]. Expert Systems with Applications, 2009, 36 (2): 3336-3341.
10Frank A, Asuncion A. UCI machine learning repository[EB/OL]. Irvine, CA: University of California, School of Information and Computer Science, 2010. http:// archive, ics. uci. edu/ml.

共引文献50

1刘博,安建成.基于关键姿势的人体动作识别[J].电视技术,2014,38(5):38-41. 被引量：8
2殷樱,张玉冰,刘家诚,高昆.基于邻域互信息和K均值的基因选择算法[J].电脑知识与技术,2014(2):821-823.
3吴军,王龙龙.基于双鸟群混沌优化的otsu图像分割算法[J].微电子学与计算机,2018,35(12):119-124. 被引量：9
4路浩,倪世宏,查翔,张鹏.基于递减概率初始点选择K中心点进化算法[J].计算机仿真,2014,31(9):314-318. 被引量：3
5谢娟英,高瑞.Num-近邻方差优化的K-medoids聚类算法[J].计算机应用研究,2015,32(1):30-34. 被引量：11
6唐涛,覃晓,易宗剑,韩冬越.基于k中心点聚类的图像二值化方法[J].计算机科学与探索,2015,9(2):234-241. 被引量：10
7谢娟英,周颖.一种新聚类评价指标[J].陕西师范大学学报（自然科学版）,2015,43(6):1-8. 被引量：13
8谢娟英,屈亚楠.密度峰值优化初始中心的K-medoids聚类算法[J].计算机科学与探索,2016,10(2):230-247. 被引量：27
9赵翠芹,易云飞.无线传感网中分簇分层k-medoids协议研究[J].云南民族大学学报（自然科学版）,2016,25(2):157-162. 被引量：2
10王兵,王轲.基于密度指标的大样本数据集聚类方法[J].计算机工程与设计,2016,37(5):1245-1248.

1梁卓灵,元昌安,覃晓,乔少杰,韩楠,范勇强.基于改进谱聚类的热点区域挖掘方法[J].重庆理工大学学报（自然科学）,2021,35(1):129-137. 被引量：4
2魏道培.协作研发的力量[J].中国纤检,2021(1):104-106.
3张倩宜,常晓润,孟兆娜,李妍.基于GoF-EM算法的图像分割[J].计算机应用研究,2020,37(S02):372-374.
4金澄,安晓亚,陈占龙,马啸川.矢量居民地多边形多级图划分聚类方法[J].武汉大学学报（信息科学版）,2021,46(1):19-29. 被引量：5
5Ivo B.Hammer.White,everything white?Josef Frank’s Villa Beer(1930)in Vienna,and its materiality in the context of the discourse on‘white cubes’[J].Built Heritage,2020,4(2):40-55.

广西科学

2020年第6期

浏览历史

内容加载中请稍等...

基于方差优化谱聚类的热点区域挖掘算法

参考文献4

二级参考文献63

共引文献50

相关作者

相关机构

相关主题

浏览历史