核K-Means聚类在Folksonomy标签模糊和冗余中的应用被引量：3

Application of K-Means clustering of kernel to ambiguity and redundancy of tag in Folksonomy

下载PDF

导出

摘要现有的Folksonomy标签推荐系统中,标签模糊会导致系统推荐不准确,并且影响用户建模的准确性,而标签冗余妨碍了对系统的评估。利用K-Means聚类结果抽取模糊和冗余标签时,聚类效果较差导致抽取不准确。提出使用核K-Means聚类处理标签模糊和冗余,通过非线性映射能够较好地分辨、提取并放大样本中有用的特征,提高抽取模糊标签和冗余标签的准确度。实验结果表明:核K-Means聚类对标签和资源的聚类效果更好,抽取的模糊标签和冗余标签也更准确。 Ambiguity of tag may give a false impression of success when the recommended tags ofter little utility. Redundancy of tag can hamper the effort to judge recommendations as well. When using K-Means clustering to deal with this problem, the extraction of ambiguity tags and redundancy tags was inaccurate because the clustering effect was ineffective. Therefore, the K-Means clustering of kernel algorithm was used to deal with the problem of ambiguity and redundancy on tags. This approach improved the clustering effect because it could identify, extract and enlarge useful features of the sample by non- linear mapping. The experimental results show that, the K-Means clustering of kernel algorithm has better performance in the clustering of tag and resource, and the extraction of ambiguity tag and redundancy tag is more accurate.

作者张新伦苏一丹惠刚刚

机构地区广西大学计算机与电子信息学院

出处《计算机应用》 CSCD 北大核心 2011年第3期680-682,697,共4页 journal of Computer Applications

关键词 Folksonomy标签推荐系统标签模糊标签冗余核K-Means聚类 tag recommendation system for Folksonomy tag ambiguity tag redundancy K-Means clustering of kernel

分类号 TP18 [自动化与计算机技术—控制理论与控制工程] TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1LIPCZAK M. Tag recommendation for folksonomies oriented towards individual users [ C]// European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. Antwerp, Belgium: [ s. n. ], 2008:84 -89.
2杨丹,曹俊.基于Web2.0的社会性标签推荐系统[J].重庆工学院学报（自然科学版）,2008,22(7):51-55. 被引量：14
3VIG J, EN S, RIEDL J. Tagsplanations: Explaining recommenda- tions using tags [ C]//IUI'09: Proceedings of the 13th International Conference on Intelligent User Interfaces. New York: ACM Press, 2009:47 - 56.
4GEMMELL J, RAMEZANI M, SCHIMOLER T. A fast effective multi-channeled tag recommender [ C]// European Conference on Machine Learning and Principles and Practice of Knowledge Discov- ery in Databases. Bled, Slovenia: [s. n. ], 2009:35 -42.
5ZHANG NING, ZHANG YUAN, TANG JIE. A tag recommendation system based on contents [ C]/! European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Da- tabases. Bled, Slovenia: [s.n.], 2009:19-25.
6ZHANG YUAN, ZHANG NING, TANG JIE. A collaborative filtering tag recommendation system based on graph [ C]/! European Conference on Machine I.earning and Principles and Practice of Knowledge Discov- ery in Databases. Bled, Slovenia: [s.n.], 2009: 152-158.
7GEMMELL J, RAMEZANI M, SCHIMOLER T, et al. The impact of ambiguity and redundancy on tag recommendation in folksonomics [ C]//Proceedings of the 2009 ACM Conference on Recommender Systems. New York: ACM Press, 2009:23-25.
8孔锐,张国宣,施泽生,郭立.基于核的K-均值聚类[J].计算机工程,2004,30(11):12-13. 被引量：46
9张莉,周伟达,焦李成.核聚类算法[J].计算机学报,2002,25(6):587-590. 被引量：195
10姜园,张朝阳,仇佩亮,周东方.用于数据挖掘的聚类算法[J].电子与信息学报,2005,27(4):655-662. 被引量：67

二级参考文献66

1刘静,钟伟才,刘芳,焦李成.免疫进化聚类算法[J].电子学报,2001,29(z1):1868-1872. 被引量：43
2刘健庄,谢维信,黄建军,李文化.聚类分析的遗传算法方法[J].电子学报,1995,23(11):81-83. 被引量：27
3钱云涛,谢维信.一种由模糊逻辑神经元网络实现的聚类分析方法[J].西安电子科技大学学报,1995,22(1):1-7. 被引量：12
4钱云涛,谢维信.聚类神经网络的通用设计方法[J].西安电子科技大学学报,1997,24(1):15-21. 被引量：3
5[1]Vapnik V N. The Nature of Statistical Learning Theory. Springer Verlag New York, 1995
6[2]Scholkopf B, Smola A, Muller K. Non-linear Component Analysis as a Kernel Eigenvalue Problem. Neural Network,1998:1299-1319
7[3]Muller K, Mika S, Ratsch G, et al. An Introduction to Kernel-based Learning Algorithms. IEEE Trans. on Neural Networks ,2001
8[4]Sch lkopf B. The Kernel Trick for Distances. Technical Report MSR- TR-2000-51, 19 May 2000.
9[1]Web 2.0 Principles and Best Practices[Z].
10[2]Goldberg D E.Genetic Algorithms in Search,Optimization,and Machine Learning[M].MA:Addison-Wesley,1989.

共引文献308

1吕佳,熊浩.一种新城市气温模式分类的聚类算法[J].数学的实践与认识,2007,37(8):55-60.
2梁久祯.核函数参数优化的聚类算法[J].仪器仪表学报,2005,26(z1):678-680. 被引量：2
3赵大伟,肖周芳.一种改进的基于密度和样本数量的K-means算法[J].科技信息,2008(28):170-172. 被引量：1
4梁敏君,倪志伟,倪丽萍,杨葛钟啸.基于网格与分形维数的聚类算法[J].计算机应用,2009,29(3):830-832. 被引量：4
5宋启祥,张明玉,张锏.基于核聚类的MRI和PET医学图像分割方法[J].宿州学院学报,2005,20(1):88-90. 被引量：1
6秦亮,张文广,周绍磊,史贤俊.基于Parzen窗估计的核k-means聚类方法[J].计算机工程,2011,37(S1):217-219. 被引量：1
7沈红斌,王士同,吴小俊.离群模糊核聚类算法[J].软件学报,2004,15(7):1021-1029. 被引量：37
8伍忠东,高新波,谢维信.基于核方法的模糊聚类算法[J].西安电子科技大学学报,2004,31(4):533-537. 被引量：75
9陈才扣,高林,高秀梅,杨静宇.基于聚类的核矩阵维度缩减[J].数据采集与处理,2004,19(3):250-253.
10赵姝,张燕平,张媛,陈传明.基于交叉覆盖算法的改进算法——核平移覆盖算法[J].微机发展,2004,14(11):1-3. 被引量：6

同被引文献34

1邓乃阳,田英杰.数据挖掘中的新方法-支持向量机[M].北京:科学出版社,2004.
2Zhang Ning, Zhang Yuan, Tang Jie. A Tag Recommendation System based on contents[J]. ECML PKDD Discovery Challenge 2009 (DC09), 2009,497 : 285-295.
3Zhang Yuan, Zhang Ning, Tang Jie. A Collaborative Filtering Tag Recommendation System based on Graph[J]. ECML PKDD Discovery Challenge 2009(DCA39), 2009,497: 297-306.
4Gemmell J, Ramezani M, Schimoler T. The Impact of Ambiguity and Redundancy on Tag Recommendation in Folksonomies[M]. NewYork, USA, ACM, 2009: 45-52.
5Lu Yan-ping. Particle Swarm Optimizer for Variable Weighting in Clustering High-dimensional Data[J]. IEEE. Mach Learn, 2011,82:43-70.
6Huang J Z, Ng M, Rong H, et al. Automated dimension weighting in k-means type clustering[J]. IEEE Trans. on Pattern Analysis and Machine Intelligellee, 2005,27 (5) : 1-12.
7Domeniconi C, Gunopulos D, Ma S, et al. Locally adaptive metrics for clustering high dimensional data[J]. Data Mining and Knowledge Discovery Journal, 2007,14 : 63-97.
8Jing L,Ng M K, Huang J Z. An entropy weighting k-means algorithm for subspace clustering of high-dimensinoal sparese data [J]. IEEE Trans. on Knowledge and Data Engineering, 2007,19 (8) : 1026-1041.
9Liang J J,Qin A K,Suganthan P N,et al. Comprehensive learning particle swarm optimizer for global optimization of multimodal functions[J]. IEEE Transactions on Evolutionary Computation, 2006,10(3): 281-295.
10宋洪鑫,李蕾,刘冬雪中文博客标签调查分析及标签推荐模型的研究[C]//中国中文信息学会第五届全国青年计算语言学研讨会论文集.武汉,2010:320-326.

引证文献3

1王晓帅,覃华,丁立朵,马翩翩.用子空间粒子群聚类算法识别Folksonomy标签冗余的研究[J].计算机科学,2012,39(B06):283-287.
2乔绿茵,张敏.我国基于Folksonomy的标签推荐方法研究综述[J].信息资源管理学报,2012,2(4):41-46. 被引量：4
3习扬,苏一丹,覃希.用KPCA-SVM的方法检测垃圾标签的研究[J].计算机技术与发展,2014,24(5):65-69.

二级引证文献4

1陈然,杨成.大学生绿色网络学习环境的构建研究[J].江苏开放大学学报,2014,25(3):27-32. 被引量：2
2孙玲芳,冯遵倡.基于特征加权张量分解的标签推荐算法研究[J].江苏科技大学学报（自然科学版）,2015,29(6):574-579. 被引量：5
3熊回香,窦燕.基于LDA主题模型的标签混合推荐研究[J].图书情报工作,2018,62(3):104-113. 被引量：20
4田伟,龚磊.基于共现图的混合标签推荐算法[J].现代计算机,2020,26(19):40-44.

1张新伦,苏一丹,覃希.标签模糊和冗余在标签推荐中的研究及应用[J].计算机应用研究,2011,28(8):2971-2973.
2李春英,汤庸,汤志康,黄泳航,袁成哲,赵剑冬.面向大规模学术社交网络的社区发现模型[J].计算机应用,2015,35(9):2565-2568. 被引量：10
3贾红梅,李文杰.面向仓储管理的RFID数据过滤模型研究[J].计算机应用与软件,2014,31(2):74-76. 被引量：2
4刘喜平,万常选,刘德喜.XML关键词搜索结果的多样化[J].计算机科学与探索,2012,6(10):935-947. 被引量：1
5王晓帅,覃华,丁立朵,马翩翩.用子空间粒子群聚类算法识别Folksonomy标签冗余的研究[J].计算机科学,2012,39(B06):283-287.
6赵亚楠,董晶,董佳梁.基于社会化标注的博客标签推荐方法[J].计算机工程与设计,2012,33(12):4609-4613. 被引量：10
7徐荣飞.Python正则表达式研究[J].电脑编程技巧与维护,2015(9):45-45. 被引量：4
8周津,陈超,俞能海.采用对象特征向量表示法的标签聚类算法[J].小型微型计算机系统,2012,33(3):525-530. 被引量：8
9李春英,汤庸,林海,袁成哲,麦辉强.基于标签传播的可并行复杂网络重叠社区发现算法[J].中国科学：信息科学,2016,46(2):212-227. 被引量：11
10李唯,王玉皞,孙宇,曾舰.改进型MBI防碰撞算法研究[J].传感技术学报,2016,29(11):1711-1717.

计算机应用

2011年第3期

浏览历史

内容加载中请稍等...

核K-Means聚类在Folksonomy标签模糊和冗余中的应用被引量：3

参考文献10

二级参考文献66

共引文献308

同被引文献34

引证文献3

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

核K-Means聚类在Folksonomy标签模糊和冗余中的应用 被引量：3

参考文献10

二级参考文献66

共引文献308

同被引文献34

引证文献3

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

核K-Means聚类在Folksonomy标签模糊和冗余中的应用被引量：3