期刊文献+

基于加权投票的主动密度峰值数据聚类算法

ACTIVE DENSITY PEAK DATA CLUSTERING ALGORITHM BASED ON WEIGHTED VOTING
下载PDF
导出
摘要 针对传统方法缺乏对查询标准的全面考虑,并且反复运行半监督聚类来更新标签等问题,提出一种基于加权投票一致性的主动密度峰值数据聚类算法。提出一个同时考虑代表性和信息性的主动密度峰聚类算法,选取有代表性的实例捕捉数据模式,查询信息实例以减少聚类结果的不确定性。设计快速更新策略,有效地更新了标签。提出一个结合局部和全局不确定性的主动聚类集成框架,并引入加权投票一致性方法对聚类结果进行优化整合。在数据集上的实验结果证明了该方法能有效解决传统方法问题,并具有良好的聚类性能。 Aimed at the problem that traditional methods lack of comprehensive consideration of query criteria and run semi supervised clustering repeatedly to update tags,an active density peak data clustering algorithm based on weighted voting consistency is proposed.An active density peak clustering algorithm considering both representative and informative was proposed.Representative instances were selected to capture data patterns and query information instances to reduce the uncertainty of clustering results.A fast update strategy was designed to update the label effectively.An active clustering integration framework combining local and global uncertainties was proposed,and weighted voting consensus method was introduced to optimize the integration of clustering results.The experimental results on data sets show that the proposed method can effectively solve the problems of traditional methods,and has good clustering performance.
作者 张苏颖 许曙青 Zhang Suying;Xu Shuqing(Nanjing Engineering Branch of Jiangsu Union Technical Institute,Nanjing 210035,Jiangsu,China;School of Computer Science,Nanjing University of Technology,Nanjing 210000,Jiangsu,China)
出处 《计算机应用与软件》 北大核心 2023年第10期297-306,共10页 Computer Applications and Software
基金 2016年度江苏省第五期“333工程”科研资助立项项目(BRA2016432)。
关键词 半监督 主动峰值聚类 加权投票 查询选择 Semi supervision Active peak clustering Weighted voting Query selection
  • 相关文献

参考文献17

二级参考文献70

  • 1龙军,殷建平,祝恩,赵文涛.主动学习研究综述[J].计算机研究与发展,2008,45(z1):300-304. 被引量:31
  • 2李元臣,刘维群.基于Dijkstra算法的网络最短路径分析[J].微计算机应用,2004,25(3):295-298. 被引量:70
  • 3唐伟,周志华.基于Bagging的选择性聚类集成[J].软件学报,2005,16(4):496-502. 被引量:95
  • 4王玲,薄列峰,焦李成.密度敏感的半监督谱聚类[J].软件学报,2007,18(10):2412-2422. 被引量:94
  • 5Motro A, Marks D G, Jajodia S. Aggregation in relational data-bases: Controlled disclosure of sensitive information[M] //Com-puter Security—ESORICS 94. Springer Berlin Heidelberg, 1994:429-445.
  • 6Accorsi R,Muller G. Preventive inference control in data-centricbusiness models[C] // 2013 IEEE Security and Privacy Work-shops (SPW). IEEE, 2013 : 28-33.
  • 7Wagstaff K,Cardie C. Clustering with instance-level constraints[C]//Proc. of the 17th Int,l Conf. on Machine Learning. 2000:1103-1110.
  • 8Jain A K. Data clustering: 50 years beyond K-means[J]. PatternRecognition Letters,2010,31(8) :651-666.
  • 9Goyal P,Behera L,Mcginnity T M_ A novel neighborhood baseddocument smoothing model for information retrieval[J]. Infor-mation retrieval,2013,16(3) :391-425.
  • 10Settles B. Active learning literature survey [R]. University ofWisconsin-Madison.2010.

共引文献123

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部