期刊文献+

DHSSA优化的K均值互补迭代车型信息数据聚类 被引量:3

K-means Complementary Iterative Vehicle Information Data Clustering Based on DHSSA Optimization
下载PDF
导出
摘要 针对传统方法在车型信息数据聚类过程中受初始化中心点的影响较大导致聚类精度低、鲁棒性差以及在迭代过程中求取均值选择聚类中心受离群点影响大的问题,提出了一种DHSSA优化的K均值互补迭代车型信息数据聚类方法。首先,针对SSA算法中发现者位置更新不足和种群多样性不足的问题,设计了一种扰动因子-领头雀优化策略,通过自适应领头雀策略加强了最优个体的影响力,利用扰动因子扩大搜索空间,提升了寻找聚类中心的准确率;其次,设计了基于筛选最大最小距离积方法 SMMP优化聚类中心的初始化,在MMP基础上增加了筛选机制,使初始化的中心尽可能更均匀地分布在每个簇中;最后,融合DHSSA和SMMP来优化K均值互补迭代,在减小迭代次数的同时增加搜索效率,得到较好的聚类结果。利用多种数据集进行测试,通过试验结果中的收敛曲线和性能指标可以看出,提出的DHSSA-KMC方法相对于SSA-KMC、IMFO-KMC、KMC和KMC++具有更高的搜索精度、收敛速度和更低的聚类代价,并且耗时相对于SSA-KMC和IMFO-KMC有所减少,证明了算法的有效性和优越性。在车型信息数据处理过程中,DHSSA-KMC可以高效聚类生成竞品车型供消费者选择,应用价值明显。 For the problems that the traditional method is greatly affected by the initialization center in the process of vehicle information data clustering,resulting in low clustering accuracy and poor robustness,and the selection of clustering center by calculating the mean in the iterative process is greatly affected by the outliers,a Kmeans complementary iterative vehicle information data clustering optimized by DHSSA is proposed. Firstly,for the problem of insufficient update of discoverer position and insufficient population diversity in SSA algorithm,a disturbance factor-head optimization strategy is designed. The influence of the optimal individual is strengthened by the adaptive head strategy,and the search space is expanded by the disturbance factor,which improves the accuracy of cluster center searching. Secondly,the initialization of cluster centers optimized by screening maximum and minimum distance product method(SMMP)is designed,and the screening mechanism is added on the basis of MMP,so that the initial centers are more evenly distributed in each cluster as much as possible. Finally,DHSSA and SMMP are integrated to optimize the K-means complementary iteration,which reduces the number of iterations and increases the search efficiency to obtain better clustering results. Using a variety of data sets for testing,through the convergence curve and performance indicators in the experimental results,it can be seen that the proposed DHSSAKMC method is of higher search accuracy,convergence speed and lower clustering cost than SSA-KMC,IMFOKMC,KMC and KMC++,and the time consumption is reduced compared with SSA-KMC and IMFO-KMC,which proves the effectiveness and superiority of the algorithm. In the process of vehicle information data processing,DHSSA-KMC can efficiently cluster and generate competitive models for consumers to choose,with obvious application value.
作者 黄鹤 李文龙 杨澜 王会峰 王飚 茹锋 Huang He;Li Wenlong;Yang Lan;Wang Huifeng;Wang Biao;Ru Feng(Chang’an University,Xi’an 710064;Xi’an Key Laboratory of Intelligent Expressway Information Fusion and Control,Xi’an 710064)
出处 《汽车工程》 EI CSCD 北大核心 2022年第5期691-700,729,共11页 Automotive Engineering
基金 国家重点研发计划(2018YFB1600600) 国家自然科学基金面上项目(52172324) 陕西省重点研发计划(2021SF-483) 陕西省自然科学基础研究计划项目(2021UM-184) 陕西省博士后科研项目(2018BSHYDZZ64) 西安市智慧高速公路信息融合与控制重点实验室(长安大学)开放基金项目(300102321502) 中央高校基本科研业务费资助项目(300102240203)资助。
关键词 K均值聚类 筛选最大最小距离积法 麻雀搜索算法 数据集 车型信息数据 KMC screening maximum and minimum distance product SSA data sets car type information data
  • 相关文献

参考文献8

二级参考文献91

  • 1陈兴蜀,吴小松,王文贤,王海舟.基于特征关联度的K-means初始聚类中心优化算法[J].四川大学学报(工程科学版),2015,47(1):13-19. 被引量:29
  • 2肖春景,张敏.基于减法聚类与模糊c-均值的模糊聚类的研究[J].计算机工程,2005,31(B07):135-137. 被引量:22
  • 3张文君,顾行发,陈良富,余涛,许华.基于均值-标准差的K均值初始聚类中心选取算法[J].遥感学报,2006,10(5):715-721. 被引量:57
  • 4孙吉贵,刘杰,赵连宇.聚类算法研究[J].软件学报,2008(1):48-61. 被引量:1073
  • 5Jain A K, Murty M N, Flynn P J. Data clustering: Areview. ACM Computing Surveys (CSUR), 1999,31 (3) : 264- 323.
  • 6Manning C E. Fractal clustering of metamorphic veins. Geology, 1994,22 (4) : 335- 338.
  • 7Wang J, Wu X, Zhang C. Support vector machines hased on K-means clustering for real-time business intelligence systems. International Journal of Business Intelligence and Data Mining, 2005,1(1) :54-64.
  • 8Szolovits P. Artiaicia: intelligence in medicine. Boulder Colorado : Westview Press, 1982,25 - 60.
  • 9McQuitty L L. Elementary linkage analysis for isolating orthogonal and oblique types and typal relevancies. Educational and Psychological Meas- urement, 1957,17:207- 229.
  • 10Bezdek J C. Pattern recognition with fuzzy objective function algorithms. Springer Science Business Media,2013,24-29.

共引文献436

同被引文献18

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部