期刊文献+

基于Web页面结构和主色调的聚类算法 被引量:1

Clustering Algorithm Based on Web Pages Structure and Dominant Color
下载PDF
导出
摘要 针对目前Web聚类准确率不高的问题,提出一种基于Web页面链接结构和页面中图片主色调特征的聚类算法。通过分析Web页面中的链接结构和Web页面中所显示图片的主色调来比较页面之间的相似度,对Web站点中的Web页面进行聚类。聚类过程兼顾Web页面结构和页面的主要色彩特征。系统实验结果表明,该算法能有效提高聚类的准确性。 Aiming at the problem that the efficiency is low in Web clustering, this paper proposes a clustering algorithm based on linkage structure and the character of the dominant color on Web pages. It compares the similarity between Web pages by analyzing the linkage and the dominant color on them. It can cluster the Web pages on Web sites. In this procedure, the clustering has both the structure and the main character of tone. Experimental results of the system prove that it has made the clustering become more efficient and it has improved a lot than before.
出处 《计算机工程》 CAS CSCD 北大核心 2010年第3期1-3,共3页 Computer Engineering
基金 国家自然科学基金资助项目(60773004) 山西省自然科学基金资助项目(2006011030 2007011050)
关键词 聚类 WEB挖掘 链接结构 主色调 clustering Web mining linkage structure dominant color
  • 相关文献

参考文献5

  • 1刘远超,王晓龙,徐志明,关毅.文档聚类综述[J].中文信息学报,2006,20(3):55-62. 被引量:65
  • 2Menczer F. Lexical and Semantic Clustering by Web Links[J]. Journal of the American Society for Information Science and Technology, 2004, 55(14): 1261-1269.
  • 3Crescenzi V, Merialdo P, Missier P. Clustering Web Pages Based on Their Structure[J]. Data & Knowledge Engineering, 2005, 54(3): 279-299.
  • 4冯霞,黄亚楼.基于色彩主特征的快速图像检索[J].数据采集与处理,2005,20(2):198-202. 被引量:2
  • 5石林.基于对象的Web图像检索研究[D].济南:山东师范大学,2007.

二级参考文献48

  • 1陈浩,何婷婷,姬东鸿.基于k-means聚类的无导词义消歧[J].中文信息学报,2005,19(4):10-16. 被引量:16
  • 2Smeulders A, Worring M, Santini S, et al.Content-based image retrieval at the end of the early years[J].IEEE Trans Pattern Analysis and Machine Intelligence, 2000,22(12):1349~1380.
  • 3Rui Y,Huang T S,Ortega M,et al.Relevance feedback: a power tool for interactive content-based image retrieval[J].IEEE Trans Circuits and Systems for Video Technology,1998,8(5):644~655.
  • 4Swain M J, Ballard D H.Color indexing[J].International Journal of Computer Vision,1991,7(1):11~32.
  • 5Rui Y, Huang T S, Chang S F.Image retrieval: current techniques, promising directions, and open issues[J].Journal of Visual Communication and Image Representation, 1999,10:39~63.
  • 6Chen J, Bouman C, Dalton J.Hierarchical browsing and search of large image databases[J].IEEE Trans on Image Processing, 2000,9(3):442~455.
  • 7Rubner Y.Perceptual metrics for image databasenavigation[D].Stanford University, 1999.
  • 8Carson C, Belongie S, Greenspan H, et al.Blob-world: image segmentation using expectation-maximization and its application to image querying[A].IEEE Trans PAMI[C].2002.1026~1038.
  • 9Simoncelli E, Olshausen B.Natural image statistics and neural representation[J].Annual Review of Neuroscience, 2001,24:1193~1215.
  • 10Qiu G.Bipartite graph partitioning and content-based image clustering[A].CVMP 2004, 1st European Conference on Visual Media Production (CVMP), the IEE[C].Savoy Place, London, 2004.15~16.

共引文献65

同被引文献6

  • 1Han J,Kamber M.Data Mining: Concepts and Techniques[M].San Francisco,USA: Morgan Kaufmann Publishers,2006.
  • 2Cohn D,Hofmann T.The Missing Link——A Probabilistic Model of Document Content and Hypertext Connectivity[C]//Proc.of Advances in Neural Information Processing Systems.Cambridge,USA: MIT Press,2001: 430-436.
  • 3Weiss R,Velez B,Sheldon M,et al.Hypursuit: A Hierarchical Network Search Engine That Exploits Content Link Hypertext Clustering[C]//Proc.of the 7th ACM Conference on Hypertext.New York,USA: ACM Press,1996: 180-193.
  • 4Modha D,Spangler W.Clustering Hypertext with Applications to Web Searching[C]//Proc.of the 11th ACM Conference on Hypertext and Hypermedia.San Antonio,USA: ACM Press,2000: 123-132.
  • 5王立敏,高学东,武森.基于属性-链接的聚类算法综述[J].计算机应用研究,2008,25(6):1622-1625. 被引量:2
  • 6谌志群,王小华,王荣波.一种结构与内容相结合的XML文档聚类方法[J].情报学报,2009,28(5):693-699. 被引量:4

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部