一种基于信任值的分类属性聚类算法

A categorical attribute clustering algorithm based on trust value

下载PDF

导出

摘要针对K-Modes算法的不足,提出了一种基于信任值的分类属性聚类算法TrustCCluster,该算法不需预先给定聚类个数,聚类结果稳定且不依赖于初始值的选取。在真实数据上验证了TrustC-Cluster聚类算法,并与K-Modes和P-Modes算法进行了对比,实验结果表明TrustCCluster算法是有效、可行的。 For the shortage of K-Modes algorithm, a categorical attribute clustering algorithm TrustCCluster based on trust val-ue is proposed, the algorithm does not need to pre-specify the number of clusters, and clustering results do not depend on the se-lection of the initial values. TrustCCluster clustering algorithm is verified on the real data, and compared with the K-Mode and P-Modes algorithms, the result shows that TrustCCluster algorithm is feasible and effective.

作者李梓蒋庆丰程晓旭贾美娟

机构地区大庆师范学院计算机科学与技术学院

出处《微型机与应用》 2012年第22期57-59,63,共4页 Microcomputer & Its Applications

基金黑龙江省自然科学基金项目(F200923) 黑龙江省教育厅科学技术研究项目(11553001)

关键词信任值聚类 K—Modes算法 P—Modes算法 trust value cluster K-Modes algorithm P-Modes algorithm

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献8

1MACQUEEN J.Some methods for classification and analysis of multivariate observations[C].Proc 5th Berkeley Symposium Mathematics Statist and Probaility, 1967:281-297.
2KAUFMAN J, ROUSSEEUW P J.Finding groups in data: an introduction to cluster analysis[M].New York :John Wiley&Sons, 1990.
3ESTER M, KRIEGEL H P, SANDER J, et al.A density- based algorithm for discovering clusters in large spatial databases [C]. Proc. of 1996 Intl. Conf. on Knowledge Discovery and Data Mining,Portland,OR.1996:226-231.
4WANG W, YANG J, MUNTZ R.STING : A statistical information grid approach to spatial data mining[C].Proc of 1997 Intl.Conf.on Very Large Databases, Athens, Greece. 1997 : 186-195.
5KOHONEN T.Self-organized formation of topologically correct feature maps [J]. Biological Cybernetics, 1982,43 (1) :59-69.
6Huang Zhexue.Clustering large data sets with mixed numeric and categorical values[C].Proc of PAKDD 97.Singapore: World Scientific, 1997 : 21-35.
7Huang Zhexue.Extensions to the K-means algorithm for clustering large data sets with categorical values[J].Data Mining and Knowledge Discovery, 1998,2(3) : 283-304.
8梁吉业,白亮,曹付元.基于新的距离度量的K-Modes聚类算法[J].计算机研究与发展,2010,47(10):1749-1755. 被引量：46

二级参考文献26

1陈宗海,文锋,聂建斌,吴晓曙.基于节点生长k-均值聚类算法的强化学习方法[J].计算机研究与发展,2006,43(4):661-666. 被引量：13
2Han Jiawei,Kamber M.Data Mining Concepts and Techniques[M].San Francisco:Morgan Kaufmann,2001.
3Brendan J F,Delbert D.Clustering by passing messages between data points[J].Science,2007,315(16):972-976.
4Zhang Jiangshe,Liang Yiuwing.Improved possibilistic c-means clustering algorithms[J].IEEE Trans on Fuzzy Systems,2004,12(2):209-217.
5Mac Q J.Some methods for classification and analysis of multivariate observation[C]//Proc of the 5th Berkley Symp on Mathematical Statistics and Probability.Berkley,California:University of California Press,1967:281-297.
6Huang Zhexue.Clustering large data sets with mixed numeric and categorical values[C]//Proc of PAKDD97.Singapore:World Scientific,1997:21-35.
7Huang Zhexue.Extensions to the K-means algorithm for clustering large data sets with categorical values[J].Data Mining and Knowledge Discovery,1998,2(3):283-304.
8Ng M K,Li Junjie,Huang Zhexue,et al.On the impact of dissimilarity measure in K-modes clustering algorithm[J].IEEE Trans on Pattern Analysis and Machine Intelligence,2007,29(3):503-507.
9San O M,Huynh V N,Nakamori Y.An alternative extension of the K-means algorithm for clustering categorical data[J].Int Journal Application Mathematic and Computer Science,2004,14(2):241-247.
10Li Cen,Biswas G.Unsupervised learning with mixed numeric and nominal data[J].IEEE Trans on Knowledge and Data Engineering,2002,14(4):673-690.

共引文献45

1陈小全,张继红.基于改进粒子群算法的聚类算法[J].计算机研究与发展,2012,49(S1):287-291. 被引量：31
2于海涛,李梓,姚念民.K-means聚类算法优化方法的研究[J].小型微型计算机系统,2012,33(10):2273-2277. 被引量：22
3杨静,高嘉伟,梁吉业,刘杨磊.基于数据场的改进DBSCAN聚类算法[J].计算机科学与探索,2012,6(10):903-911. 被引量：21
4于海涛,王慧强,李梓,韩立娟.基于模拟谐振子的优化K-means聚类算法[J].计算机工程与应用,2012,48(30):122-127. 被引量：4
5王熙照,王婷婷,翟俊海.基于样例选取的属性约简算法[J].计算机研究与发展,2012,49(11):2305-2310. 被引量：28
6杨阳,张为群,刘枫,黄仁杰.基于MapReduce自适应参数的粗糙K-modes算法研究[J].计算机科学,2012,39(11):149-152.
7于海涛,贾美娟,王慧强,邵国强.基于人工鱼群的优化K-means聚类算法[J].计算机科学,2012,39(12):60-64. 被引量：23
8陈黎飞,郭躬德.属性加权的类属型数据非模聚类[J].软件学报,2013,24(11):2628-2641. 被引量：7
9李保珍,张亭亭.成对属性关联分析及其属性空间构建[J].情报学报,2014,33(11):1194-1203. 被引量：2
10朱景福,李雪.聚类算法在玉米叶片病斑降维识别中的应用[J].江苏农业科学,2015,43(1):405-406. 被引量：5

1杨阳,张为群,刘枫,黄仁杰.基于MapReduce自适应参数的粗糙K-modes算法研究[J].计算机科学,2012,39(11):149-152.
2白亮,梁吉业,曹付元.基于粗糙集的改进K-Modes聚类算法[J].计算机科学,2009,36(1):162-164. 被引量：15
3黄苑华,郝志峰,蔡瑞初,谢峰.基于相互依存冗余度量的k-modes算法[J].小型微型计算机系统,2016,37(8):1790-1793. 被引量：5
4郭涛,丁祥武.基于MapReduce的并行k-modes算法[J].智能计算机与应用,2015,5(1):43-45.
5罗冬梅.改进的k-prototypes算法及应用[J].武夷学院学报,2009,28(2):74-77. 被引量：1
6李仁侃,叶东毅.粗糙K-Modes聚类算法[J].计算机应用,2011,31(1):97-100. 被引量：5
7张伟,周霆,陈芸,邹汉斌.动态的模糊K-Modes初始化算法[J].计算机工程与设计,2006,27(4):682-683. 被引量：1
8王建新,钱宇华.符号数据的无监督学习:一种空间变换方法[J].计算机科学,2016,43(1):89-93. 被引量：2
9李仁侃,叶东毅.属性赋权的K-Modes算法优化[J].计算机科学与探索,2012,6(1):90-96. 被引量：3
10吴海峰.基于Internet的智能个性化检索[J].现代计算机,2003,9(2):22-25. 被引量：2

微型机与应用

2012年第22期

浏览历史

内容加载中请稍等...

一种基于信任值的分类属性聚类算法

参考文献8

二级参考文献26

共引文献45

相关作者

相关机构

相关主题

浏览历史