期刊文献+

基于方差优化谱聚类的热点区域挖掘算法

Hot Region Mining Algorithm based on Variance Optimization Spectrum Clustering
下载PDF
导出
摘要 为改善交通拥堵的情况,本文利用聚类分析方法对移动轨迹数据进行挖掘,识别居民出行的热点区域。传统的Ng-Jordan-Weiss(NJW)谱聚类算法常使用K-means聚类算法来实现最后的聚类操作,然而K-means聚类算法存在对初始值敏感、容易陷入局部最优的缺陷,影响对热点区域的挖掘结果。因此,本研究将方差优化初始中心的K-medoids聚类算法运用到谱聚类算法最后聚类阶段,提出基于方差优化谱聚类的热点区域挖掘算法(Hot Region Mining algorithm based on improved K-medoids Spectral Clustering,HRM-KSC),然后在真实的轨迹数据集上进行试验。试验结果发现,HRM-KSC算法聚类结果的轮廓系数更高,表明HRM-KSC算法改善了NJW谱聚类算法,提高了聚类质量。 In order to improve the traffic congestion,this article uses the cluster analysis approach to mine the mobile trajectory data and identify the hot region of residents'travel.The traditional Ng-Jordan-Weiss(NJW)spectral clustering algorithm often uses K-means clustering algorithm to achieve the final clustering operation.However,K-means clustering algorithm has the disadvantages of being sensitive to the initial value and easy to fall into the local optimum,which will affect the mining results of hotspot area.Therefore,the K-medoids clustering algorithm of variance optimization initial center is applied to the final clustering stage of the spectral clustering algorithm,and a Hot Region Mining algorithm based on improved K-medoids Spectral Clustering(HRM-KSC)is proposed,and then experiment on real trajectory data sets.The experiment results find that the HRM-KSC algorithm clustering results have higher silhouette coefficient,which indicates that the HRM-KSC algorithm improves the NJW spectral clustering algorithm and the clustering quality.
作者 梁卓灵 元昌安 覃晓 LIANG Zhuoling;YUAN Chang'an;QIN Xiao(Guangxi University,Nanning,Guangxi,530004,China;Guangxi Academy of Sciences,Nanning,Guangxi,530007,China;Nanning Normal Universety,Nanning,Guangxi,530001,China)
出处 《广西科学》 CAS 2020年第6期616-621,I0010,共7页 Guangxi Sciences
基金 国家自然科学基金项目(61962006,61802035,61772091) 广西科技开发项目(AA18118047,AD18126015) 广西自然科学基金项目(2018GXNSFDA138005)资助。
关键词 K-medoids算法 谱聚类 热点区域 停留点 交通拥堵 K-medoids algorithm spectral clustering hot region stop point traffic congestion
  • 相关文献

参考文献4

二级参考文献63

  • 1张惟皎,刘春煌,李芳玉.聚类质量的评价方法[J].计算机工程,2005,31(20):10-12. 被引量:60
  • 2Han J W,Kamber M. Data Mining: Concepts and Techniques[M]. Beijing: China Machine Press, 2000:383-466.
  • 3Theodoridis S, Koutroumbas K. Pattern tecognition[M]. Boston: Academic Press, 2009 : 745-748.
  • 4Kaufman L, Rousseeuw P J. Finding groups in data: An introduction to cluster analysis[M]. New York: Wiley, 1990 : 126-163.
  • 5Lucasius C B, Dane A clustering of large data algorithm: Background, Analytica Chimica Acta, D, Kateman G. On k-medoid sets with the aid of a genetic feasibility and comparison[J]. 1993, 282(3): 647-669.
  • 6Ng R, Han J. Efficient and effective clustering methods for spatial data mining[C] // In Proceedings of the 20th International Conference on very Large Databases, Santiago, 1994: 144-155.
  • 7Wei C P, Lee Y H, Hsu C M. Empirical comparison of fast partitioning-based clustering algorithms for large data sets[J]. Expert Systems with Applications, 2003, 24(4) 351-363.
  • 8Zhang Q, Couloigner I. A new and efficient K-medoid algorithm for spatial clustering[J]. Lecture Notes in Computer Science, 2005, 3482:181-189.
  • 9Park H S, Jun C H. A simple and fast algorithm for K-medoids clustering[J]. Expert Systems with Applications, 2009, 36 (2): 3336-3341.
  • 10Frank A, Asuncion A. UCI machine learning repository[EB/OL]. Irvine, CA: University of California, School of Information and Computer Science, 2010. http:// archive, ics. uci. edu/ml.

共引文献50

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部