期刊文献+

基于权重与共现的标签聚类算法的研究

Tag Clustering Algorithm Research Based on Weight and Co-occurrence
下载PDF
导出
摘要 在社交网络中,标签对资源的描述在一定程度上存在着准确性不高和结构紊乱等现象。为了改善这些问题,提出了一种新的基于权重与共现的标签凝聚型层次聚类算法:首先在社交网络中收集网页标签的相关数据,然后计算标签与网页的权重,再计算标签共现的相似度,并以此为初始数据进行凝聚型层次聚类,最后把聚类结果与人工分类结果比较,计算出精确度、召回率以及加权调和平均数F1。通过实验结果体现了这种算法的可行性。 ln the SNS (Social Networking Service),there are some phenomena,which tag description of resources is not high and the structure is mussy.ln order to improve these problems,a new kind of condensed type hierarchy clustering algorithm of the tag based on weight and co-occurrence is proposed:first,col ect the tags from the SNS.Then calculate the weight of tags with the page.Calculate the similarity of the tags' co-occurrence,and use them as the initial data in type hierarchy clustering.Final y,calculate the accuracy,recal rate and the weighted harmonic mean F1 through comparing the clustering results with the results of manual classification.There is a good effect through the experiment of this new algorithm.
作者 高宏宾 杨翠
出处 《工业控制计算机》 2014年第6期116-117,120,共3页 Industrial Control Computer
基金 广东省自然科学基金S2011010003681~~
关键词 标签 聚类 社交网络 权重 共现 tag cIustering SNS weight co-occurrence
  • 相关文献

参考文献8

  • 1Garcia-Plaza A P, Zubiaga Arkaitz, Fresno Victor, et al. Re- organizing clouds:A study on tag clustering and evaluation [J].Expert Systems with Applications, 2012,39(10): 9483-9493.
  • 2Pandurang M B, P.Dwivedi N K, Chaudhari Y S, et al. Dy- namic tag positioning using cohesion based hierarchical clus- tering [J]. International Journal of Computer Technology andElectronics Engineering, 2012, 12 (1): 120-125.
  • 3韩敏,唐常杰,段磊,李川,巩杰.基于TF-IDF相似度的标签聚类方法[J].计算机科学与探索,2010,4(3):240-246. 被引量:22
  • 4周津,陈超,俞能海.采用对象特征向量表示法的标签聚类算法[J].小型微型计算机系统,2012,33(3):525-530. 被引量:8
  • 5Sultana Fouzia, Charles Stephen, A.Govardhan. Spam com- ment detection in blog comments from blog rss feed by modified TF-IDF algorithm [J]. Interna[ional Journal of Engi- neering Science and Technology, 2012, 4(3): 985-989.
  • 6Lee,Sihyoung, De Neve Wesley, Ro Y M. Tag refinement in an image folksonomy using currence statistics [J].Signal tion, 2012, 25(10): 761-773.
  • 7S.Wu Andrew, H.Do Bao, Kim Jinsuh, et al. Evaluation of negation and uncertainty detection and its impact on preci- sion and recall in search[J]. Journal of Digital Imaging, 2011, 24 (2): 234-242.
  • 8吴志媛,钱雪忠.基于PLSI的标签聚类研究[J].计算机应用研究,2013,30(5):1316-1319. 被引量:4

二级参考文献41

  • 1李良俊,张斌,杨明.基于LSA降维的KNN文本分类算法[J].东北师大学报(自然科学版),2007,39(2):33-36. 被引量:7
  • 2Shirky C Ontology is overrated:Categories,links,and tags[EB/OL].Clay Shirky's Writings about the Internet Website.http://www.shirky.com/writings/ontology_overrated.htral.
  • 3Cattuto C,Loreto V,Pietronero L.Semiotic dynamics and collaborative tagging[J].Proceedings of the National Academy of Sciences,2007,104(5):1461-1464.
  • 4Lawrence K F,Schraefel M C.Freedom and restraint:Tags,vocabularies and ontologies[C]//Proceedings of the 2nd IEEE International Conference on ICTIA,2006:1745-1750.
  • 5Haveliwala T H,Gionis A,Klein D,et al.Evaluating strategies for similarity search on the Web[C]//Proc of 11th International Conference on World Wide Web,2002:432-442.
  • 6Niwa S,Doi T,Honiden S.Folksonomy tag organization method based on the tripartite graph analysis[C]//IJCAI Workshop on Semantic Web for Collaborative Knowledge Acquisition,Hyderabad,India,2007.
  • 7Niwa S,Doi T,Honiden S.Web page recommender system based on Folksonomy mining[C]//Proceedings of the 3rd Int'l Conf on Information Technology New Generations,April,2006:388-393.
  • 8Xu K,Chen Y,Jiang Y.A comparative study of correlation measurements for searching similar tags[C]//Proc of the 4th International Conference on Advanced Data Mining and Applications,2008:709-716.
  • 9Colder S,Huberman B A.The structure of collaborative tagging systems[EB/OL].HP labs:Advanced Research at HP.http://www.hpl.hp.corrd research/idl/papers/tagsdtags.pdf.
  • 10Cattuto C,Schmitz C,Baldassarri A.Network properties of Folksonomies[J].Network Analysis in Natural Sciences and Engineering,2007,20(4):245-262.

共引文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部