期刊文献+

增强覆盖度与非相似性的标签选择多样化方法

Diversifying Tag Selection Result by Improving Both Coverage and Dissimilarity
下载PDF
导出
摘要 标签云是社交网站提供在线资源说明与导航功能的一种流行机制.标签选择即从大量标签中选出有代表性的有限标签,是创建标签云的核心任务.标签选择结果的多样性,是影响用户满意度的一个重要因素.信息覆盖度与标签非相似性是在标签选择中引入多样性的两个主要角度.为了进一步提高标签选择结果的信息覆盖度与标签非相似性,提出了3种标签选择方法.在每种方法中,定义了目标函数以同时量化标签集合的信息覆盖度与标签非相似性,并设计了近似算法以求解相应的最大化问题;同时,还分析了近似算法的近似比.利用CiteULike网站与Last.fm网站的标注数据集,将所提出的方法与已有方法进行了比较.实验结果表明,所提出的方法在信息覆盖度与标签非相似性方面都具有较好的效果. Tag cloud has been a popular facility used by social networks for online resource summarization and navigation. Tag selection, which aims to select a limited number of representative tags from a large set of tags, is the core task for creating tag clouds. Diversity of tag selection result is an important factor that affects user satisfaction. Information coverage and tag dissimilarity are two major perspectives for introducing diversity in tag selection. To improve information coverage and tag dissimilarity of tag selection result, this paper proposes three new tag selection approaches. In each approach, an objective function is defined to quantify both information coverage and tag dissimilarity of tags, and an approximate algorithm is designed to solve the corresponding maximization problem. Further the approximate ratio for each approximate algorithm is analyzed. The proposed and existing approaches are compared using tagging datasets extracted from the websites of CiteULike and Last.fm. The experimental results show that the new approaches perform better in terms of both information coverage and tag dissimilarity.
出处 《软件学报》 EI CSCD 北大核心 2015年第9期2326-2338,共13页 Journal of Software
基金 国家自然科学基金(61100067) 中国科学院先导专项(XDA06010600)
关键词 标签云 标签选择 结果多样化 信息覆盖度 非相似性 tag cloud tag selection result diversification information coverage dissimilarity
  • 相关文献

参考文献1

二级参考文献21

  • 1Smeulders A W M, Worring M, Santini S, Gupta A, Jain R. Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelli- gence, 2000, 22(12): 1349- 1380.
  • 2Ames M, Naaman M. Why we tag: Motivations for annota- tion in mobile and online media//Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. San Jose, USA, 2007: 971- 980.
  • 3Wu L, Yang L, Yu N, Hua X S. Learning to tag//Proceed- ings of the 18th International Conference on World Wide Web. Madrid, Spain, 2009:361-370.
  • 4Akbas E, Yarman Vural F T. Automatic image annotation by ensemble of visual descriptors//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, USA, 2007: 1-8.
  • 5Sigurbj6rnsson B, van Zwol R. Fliekr tag recommendation based on collective knowledge//Proeeedings of the 17th International Conference on World Wide Web. Beijing, China,2008:327-336.
  • 6Freund Y, Iyer R, Schapire R E, Singer Y. An efficient boosting algorithm for combining preferences. The Journal of Machine Learning Research, 2003, 4:933-969.
  • 7Liu D, HuaXS, YangL, WangM, ZhangHJ. Tag rank- ing//Proceedings of the 18th International Conference on World Wide Web. Madrid, Spain, 2009:351-a60.
  • 8Wu L, Li M, Li Z, Ma W Y, Yu N. Visual language model- ing for image classification/Proceedings of the international workshop on multimedia information retrieval. Augsburg, Germany, 2007:115-124.
  • 9Sivic J, Zisserman A. Video Google: A text retrieval approach to object matching in videos//Proceedings of the 9th IEEE International Conference on Computer Vision. Nice, France, 2003z 1470 1477.
  • 10Katz S. Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Transactions on Acoustics, Speech and Signal Processing, 1987, 35(3): 400-401.

共引文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部