基于社会标注的Web资源语义聚类研究被引量：2

Semantic clustering of web resources based on social annotation

下载PDF

导出

摘要在深入分析社会标注系统中用户、标签及被标注Web资源之间的关联关系的基础上，提出了基于用户标签的Web资源语义描述获取算法，并基于所获取的Web资源语义描述及其与用户之间的关联关系，利用一种迭代的聚类算法对社会标注系统中的Web资源进行基于语义的聚类，该聚类算法通过迭代不断加强被聚类资源间的一致性信息，从而能够克服传统聚类算法所面临的数据稀疏以及性能问题。研究表明，对Web资源所处环境的各种关联关系的深入分析，能够帮助用户更好地理解和操作相关Web资源，尤其是对于本身特征不充分或难以获取的Web资源来说，关联关系的分析研究具有十分重要的意义。 By analyzing the correlations between users, tags and Web resources in social annotation systems, this paper proposes an algorithm to acquire the semantic descriptions of Web resources based on users＇ tags. And based on the acquired semantic descriptions and the correlations between the descriptions and users, an iterative algorithm is proposed for semantic clustering of the Web resources in social annotation systems. By mutually reinforcing the agreed information between Web resources during the clustering process, the clustering algorithm can tackle, to some extent, the challenges faced by traditional clustering algorithms such as the data sparseness and the performance constraints. The research illustrates the importance of the analysis of the correlations in the environment of Web resources, especially to those whose features are not sufficient or difficult to acquire.

作者杨鲲马慧芳史忠植

机构地区中国科学院计算技术研究所智能信息处理重点实验室中国科学院研究生院中国计量科学研究院

出处《高技术通讯》 CAS CSCD 北大核心 2012年第1期48-54,共7页 Chinese High Technology Letters

基金 863计划（2007AA01Z132）,国家自然科学基金（60435010）,973计划（2007CB311004）和国家科技支撑计划（2006BAC08B06）资助项目.

关键词社会标注语义抽取语义聚类算法广义关联 social annotation, semantic extraction, semantic clustering algorithm, general correlation

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献20

1Golder S A, Huberman B A. Usage patterns of collaborative tagging systems. Journal of Information Science, 2006,32 : 198-208.
2Halpin H, Robu V, Shepherd H. The complex dynamics of collaborative tagging, In: Proceedings of the 2007 International Conference on World Wide Web, Banff, Canada, 2007. 211-220.
3Zhou D, Bian J, Zheng S, et al. Exploring social annotations for information retrieval. In: Proceedings of the 2008 International Conference on World Wide Web, Beijing, China, 2008. 715-724.
4Hotho A, Jaschke R, Schmitz C, et al. Information retrieval in Folksonomies: search and ranking. In: Proceedings of the 2006 European Semantic Web Conference, Berlin, Germany, 2006:411-426.
5Bao S, Xue G, Wu X, et al. Optimizing web search using social annotations, In: Proceedings of the 2007 International Conference on World Wide Web, Banff, Canada, 2007. 501-510.
6Noll M G, Meinel C. Web search personalization via social bookmarking and tagging. In: Proceedings of the 2007 International Semantic Web Conference and Asia Semantic Web Conference, Busan, South Korea, 2007. 367-380.
7Xu S L, Bao S H, Fei B, et al. Exploring folksonomy for personalized search. In : Proceedings of the 31 st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Singapore, 2008. 155-162.
8Zhou M W, Bao S H, Wu X, et al. An unsupervised model for exploring hierarchical semantics from social annotations. In: Proceedings of the 2007 International Semantic Web Conference and Asia Semantic Web Conference, Busan, South Korea, 2007. 680-693.
9Li R, Bao S, Yu Y, et al. Toward effective browsing of large scale social annotations, In: Proceedings of the 2007 International Conference on World Wide Web, Banff, Canada, 2007. 943-952.
10Brooks C H, Montanez N. Improved annotation of the blogosphere via autotagging and hierarchical clustering. In: Proceedings of the 2006 International Conference on World Wide Web, Edinburgh,UK, 2006. 625-632.

同被引文献14

1谢艳玲,何丕廉,于鷃,孙越恒.一种高效的网页聚类方法[J].计算机工程与设计,2007,28(17):4229-4232. 被引量：7
2李云,田素方,李拓,徐涛.基于概念格的Web文本聚类[J].计算机工程与应用,2008,44(23):169-171. 被引量：3
3李星毅,曾路平,施化吉.基于单词相似度的文本聚类[J].计算机工程与设计,2009,30(8):1966-1968. 被引量：9
4毛嘉莉.基于K-means的文本聚类算法[J].计算机系统应用,2009,18(10):85-87. 被引量：9
5吴夙慧,成颖,郑彦宁,潘云涛.文本聚类中文本表示和相似度计算研究综述[J].情报科学,2012,30(4):622-627. 被引量：23
6李鹏,王斌,晋薇.Improving Web Document Clustering through Employing User-Related Tag Expansion Techniques[J].Journal of Computer Science & Technology,2012,27(3):554-566. 被引量：5
7何琳.基于多策略的领域本体术语抽取研究[J].情报学报,2012,31(8):798-804. 被引量：16
8李鹏,王斌,石志伟,崔雅超,李恒训.Tag-TextRank:一种基于Tag的网页关键词抽取方法[J].计算机研究与发展,2012,49(11):2344-2351. 被引量：56
9贺秋芳,曾启杰,蔡延光.挖掘用户标签的增强型社区网页聚类算法[J].微电子学与计算机,2013,30(2):74-77. 被引量：4
10何文静,何琳.基于社会标签的文本聚类研究[J].现代图书情报技术,2013(7):49-54. 被引量：8

引证文献2

1顾晓雪,章成志.结合内容和标签的Web文本聚类研究[J].现代图书情报技术,2014(11):45-52. 被引量：8
2郭红建,陈一飞.社会标注系统自适应网页聚类算法研究[J].电子科技,2018,31(8):73-76.

二级引证文献8

1黄凌云.图书馆数字资源自动推荐优化算法研究[J].情报探索,2016(2):25-29. 被引量：1
2洪文,聂延平,青巧.馆藏资源自动推荐模型结构与处理流程优化分析[J].情报理论与实践,2016,39(5):130-133. 被引量：1
3毕强,刘健,鲍玉来.基于语义相似度的文本聚类研究[J].现代图书情报技术,2016(12):9-16. 被引量：8
4钟学燕,陈国青,孙磊磊,张明月,刘澜.基于多视角特征融合的移动信息服务模式挖掘[J].系统工程理论与实践,2018,38(7):1853-1861. 被引量：5
5郭红建,陈一飞.社会标注系统自适应网页聚类算法研究[J].电子科技,2018,31(8):73-76.
6郭蕾蕾,俞璐,段国仑,陶性留.基于伴随文本信息的Web图像批量标注方法[J].信息技术与网络安全,2018,37(9):70-75.
7林淑贞.基于读者信息挖掘的图书馆资源推荐自动模型研究[J].情报探索,2018(4):6-10. 被引量：1
8郭蕾蕾,俞璐,段国仑,陶性留.基于AP聚类的多特征融合方法[J].计算机技术与发展,2019,29(8):47-52. 被引量：3

1朱新宁,冯辉.基于鱼群算法的异构数据库语义聚类的研究[J].计算机与数字工程,2013,41(1):12-13.
2潘钧.面向Web日志的语义聚类算法[J].计算机应用研究,2007,24(7):267-269. 被引量：1
3成欣,李扬.一种基于本体的异构数据语义抽取方法[J].计算机与现代化,2014(6):1-6. 被引量：2
4曹玉辉,王卫红,覃征,张森.基于有色Petri网的动态资源模型[J].微电子学与计算机,2006,23(5):63-65. 被引量：1
5王海东,张梁,王学光,李娜,刘继文.资源共享关联的语义相似度算法[J].福建电脑,2010,26(3):17-17. 被引量：2
6许峰,张定华,王明微,陈冰,李山.基于云制造平台的云资源语义描述研究[J].计算机工程与应用,2014,50(15):255-260. 被引量：7
7邓小清.基于网格的资源语义模型研究[J].四川文理学院学报,2011,21(2):61-63. 被引量：1
8康晓予,邓贵仕.基于概念模型语义描述的仿真模型资源搜索框架[J].计算机应用与软件,2011,28(3):32-36. 被引量：1
9关丽红.基于特定数学模型的语义抽取研究[J].白城师范学院学报,2012,26(3):11-13.
10冯明星,袁再龙.基于领域本体的P2P网络检索算法研究[J].中小企业管理与科技,2010(36):290-290.

高技术通讯

2012年第1期

浏览历史

内容加载中请稍等...

基于社会标注的Web资源语义聚类研究被引量：2

参考文献20

同被引文献14

引证文献2

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

基于社会标注的Web资源语义聚类研究 被引量：2

参考文献20

同被引文献14

引证文献2

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

基于社会标注的Web资源语义聚类研究被引量：2