增强覆盖度与非相似性的标签选择多样化方法

Diversifying Tag Selection Result by Improving Both Coverage and Dissimilarity

下载PDF

导出

摘要标签云是社交网站提供在线资源说明与导航功能的一种流行机制.标签选择即从大量标签中选出有代表性的有限标签,是创建标签云的核心任务.标签选择结果的多样性,是影响用户满意度的一个重要因素.信息覆盖度与标签非相似性是在标签选择中引入多样性的两个主要角度.为了进一步提高标签选择结果的信息覆盖度与标签非相似性,提出了3种标签选择方法.在每种方法中,定义了目标函数以同时量化标签集合的信息覆盖度与标签非相似性,并设计了近似算法以求解相应的最大化问题;同时,还分析了近似算法的近似比.利用CiteULike网站与Last.fm网站的标注数据集,将所提出的方法与已有方法进行了比较.实验结果表明,所提出的方法在信息覆盖度与标签非相似性方面都具有较好的效果. Tag cloud has been a popular facility used by social networks for online resource summarization and navigation. Tag selection, which aims to select a limited number of representative tags from a large set of tags, is the core task for creating tag clouds. Diversity of tag selection result is an important factor that affects user satisfaction. Information coverage and tag dissimilarity are two major perspectives for introducing diversity in tag selection. To improve information coverage and tag dissimilarity of tag selection result, this paper proposes three new tag selection approaches. In each approach, an objective function is defined to quantify both information coverage and tag dissimilarity of tags, and an approximate algorithm is designed to solve the corresponding maximization problem. Further the approximate ratio for each approximate algorithm is analyzed. The proposed and existing approaches are compared using tagging datasets extracted from the websites of CiteULike and Last.fm. The experimental results show that the new approaches perform better in terms of both information coverage and tag dissimilarity.

作者汪美玲周翔陶秋铭赵琛

机构地区中国科学院软件研究所中国科学院研究生院

出处《软件学报》 EI CSCD 北大核心 2015年第9期2326-2338,共13页 Journal of Software

基金国家自然科学基金(61100067) 中国科学院先导专项(XDA06010600)

关键词标签云标签选择结果多样化信息覆盖度非相似性 tag cloud tag selection result diversification information coverage dissimilarity

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献1

1崔超然,马军.一种结合相关性和多样性的图像标签推荐方法[J].计算机学报,2013,36(3):654-663. 被引量：12

二级参考文献21

1Smeulders A W M, Worring M, Santini S, Gupta A, Jain R. Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelli- gence, 2000, 22(12): 1349- 1380.
2Ames M, Naaman M. Why we tag: Motivations for annota- tion in mobile and online media//Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. San Jose, USA, 2007: 971- 980.
3Wu L, Yang L, Yu N, Hua X S. Learning to tag//Proceed- ings of the 18th International Conference on World Wide Web. Madrid, Spain, 2009:361-370.
4Akbas E, Yarman Vural F T. Automatic image annotation by ensemble of visual descriptors//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, USA, 2007: 1-8.
5Sigurbj6rnsson B, van Zwol R. Fliekr tag recommendation based on collective knowledge//Proeeedings of the 17th International Conference on World Wide Web. Beijing, China,2008:327-336.
6Freund Y, Iyer R, Schapire R E, Singer Y. An efficient boosting algorithm for combining preferences. The Journal of Machine Learning Research, 2003, 4:933-969.
7Liu D, HuaXS, YangL, WangM, ZhangHJ. Tag rank- ing//Proceedings of the 18th International Conference on World Wide Web. Madrid, Spain, 2009:351-a60.
8Wu L, Li M, Li Z, Ma W Y, Yu N. Visual language model- ing for image classification/Proceedings of the international workshop on multimedia information retrieval. Augsburg, Germany, 2007:115-124.
9Sivic J, Zisserman A. Video Google: A text retrieval approach to object matching in videos//Proceedings of the 9th IEEE International Conference on Computer Vision. Nice, France, 2003z 1470 1477.
10Katz S. Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Transactions on Acoustics, Speech and Signal Processing, 1987, 35(3): 400-401.

共引文献11

1安维,刘启华,张李义.个性化推荐系统的多样性研究进展[J].图书情报工作,2013,57(20):127-135. 被引量：37
2黄崑,杨梽永,李一平,白雅楠.个人数字图像管理行为初探[J].情报探索,2014(2):94-97. 被引量：1
3李锡荣,许洁萍,薛盛博,杨刚.基于软近邻投票的图像标签相关性计算[J].计算机学报,2014,37(6):1365-1371. 被引量：4
4谭昶,刘淇,吴乐,马海平,龙柏.推荐系统中典型用户群组的发现和应用[J].模式识别与人工智能,2015,28(5):462-471. 被引量：4
5易唐唐,黄立宏.基于混合引力搜索的自适应特征提取算法[J].计算机工程,2015,41(6):188-194. 被引量：4
6吕刚,郑诚,胡春玲.基于标签与深度本体的Web推荐方法研究[J].计算机工程,2015,41(12):156-160. 被引量：2
7刘淑琴,彭进业.噪声不敏感的柱状图特征描述符及其在图像检索中的应用[J].计算机科学,2016,43(1):302-305. 被引量：2
8张震宇,丁恒,王瑞雪,陆伟.基于标签语义距离的图像多样化检索[J].数字图书馆论坛,2017(2):34-39.
9仲宝才,张福泉,徐琳.非负矩阵分解耦合视觉多样性的图像检索算法[J].计算机工程与设计,2018,39(10):3201-3207.
10赵潇,王一风,赵明.基于市场化巡维的图像标签及管理研究[J].安防科技,2021(8):37-38.

1顾宇,Dave parry,Paul Leon.基于ETC标签选择系统的设计[J].电脑知识与技术（过刊）,2015,21(11X):167-168.
2王茜,方旭.基于标签传播算法在重叠社区发现中的改进[J].现代计算机,2017,23(8):7-10.
3张林泉,陆彦.基于云计算技术的文本可视化分析[J].成都工业学院学报,2014,17(1):90-92.
4邓晓军,满君丰,文志强,王昱.一种提高预测结果多样性的资源分配算法[J].控制工程,2015,22(6):1137-1141. 被引量：3
5王庆福,吕小刚.基于社交网图和兴趣标签的协同推荐算法[J].成都工业学院学报,2015,18(4):22-24.
6J．G．布雷斯林（著）,胡光华.社交语义网[J].国外科技新书评介,2010(9):19-19.
7贾玉峰.灵活运用Windows 98[J].赤峰教育学院学报,2001,19(5):113-114.
8谢梦瑶,潘旭伟.社会化标注中用户动态标签云构建研究[J].数据分析与知识发现,2017,1(2):35-40. 被引量：1
9林古立,彭宏,马千里,韦佳,覃姜维.一种基于关键词的网页搜索结果多样化方法[J].华南理工大学学报（自然科学版）,2011,39(5):102-107. 被引量：5
10周雷,强巴旦增.略论知识组织的几种方法——之民俗分类法和标签云[J].西藏科技,2012(7):19-21. 被引量：1

软件学报

2015年第9期

浏览历史

内容加载中请稍等...

增强覆盖度与非相似性的标签选择多样化方法

参考文献1

二级参考文献21

共引文献11

相关作者

相关机构

相关主题

浏览历史