期刊文献+

综合多重评价因素的Web用户聚类算法 被引量:4

An Algorithm of Integrating Multiple Evaluating-Factors for Clustering Web-Users
下载PDF
导出
摘要 文章提出了综合多重评价因素的Web用户聚类算法;首先从评价因素的数学特征出发,提出了Web资源偏爱度与Web资源关联度的概念,然后运用Kruskal算法的基本原理在由Web资源和Web访问行为所构成的无向图内寻找寻频繁路径,再根据频繁路径和Web资源偏爱度与关联度阈值对Web用户进行聚类处理。该算法在一定程度上提高了Web用户聚类算法的准确性与执行效率。 A new algorithm for Clustering Web-Users with Integrating Evaluating-Factors is proposed in this paper. First,by analyzing the mathematical characteristics of evaluating-factors,this paper proposes the concepts of preference and relation of Web resources.Then,we use the basic principle of Kruskal to search for frequent traversal paths in the undigraph composed of web resources and web-users' activities,and cluster Web-Users according to the frequent traversal path and the thresholds of preference and relation.This algorithm improves the accuracy and efficiency of the algorithms for clustering web-users.
作者 吴跃进
出处 《计算机工程与应用》 CSCD 北大核心 2006年第28期147-149,210,共4页 Computer Engineering and Applications
关键词 评价因素 偏爱度 关联度 频繁路径 用户聚类 evaluating-factor,preference,relation,frequent path,cluster Web-user
  • 相关文献

参考文献8

  • 1Ming-Syan Chen,Jong Soo Park,Philip S Yu.Data Mining for Path Traversal Patterns in a Web Environment[C].In:Proc of the 16th Int Conf on Distributed Computing Systems,Hong Kong:IEEE cs Press,1996:385~392
  • 2Borges J,Levene M.Mining Association Rules in Hypertext Database[C].In:Proc the 4th Int'l Conf on Knowledge Discovery and Data Mining,Menlo Park:AAA I Press,1998:149~ 153
  • 3M Fayyad,G Piatetsky-Shapiro,Smyth.Advances in Knowledge Discovery and Data Mining[M].Menlo Park:AAA I Press,1996:1~34
  • 4J Han,O R Zaiane,M Xin.Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs[C].In:Proc Advances in Digital Libraries Conf(ADL'98),Santa Barbara,CA,1998:19~29
  • 5Alejandro A Vaisman,Gabriel DanDretta,Mariela Sapia.Enhancing Web Access Using Data Mining Techiques[C].In:Proceeding of the 14th International Workshop on Database and Expert Systems Application (DEXA'03),2003
  • 6邢东山,沈钧毅.一个可以准确反映Web浏览兴趣的度量值——偏爱度[J].控制与决策,2004,19(3):307-310. 被引量:10
  • 7吴妮娅,张健沛.Web日志模糊聚类算法的研究[J].哈尔滨师范大学自然科学学报,2003,19(5):63-66. 被引量:3
  • 8宋擒豹,沈钧毅.Web日志的高效多能挖掘算法[J].计算机研究与发展,2001,38(3):328-333. 被引量:115

二级参考文献19

  • 1[1]Paliours G, Papatheodorou C, Karkaletsis V, et al. Clustering the users of large Web sites into communities [A]. International Conference on Machine Learning, California, 2000
  • 2[2]Fu Y, Sandhu K, Shih M, Clustering of Web users based on access patterns [ A]. International Workshop on Web Usage Analysis and User Profiling, San Diego, 1999
  • 3[3]C. Shahabe, A. M. Zarkesh, J. Abidi and V. Shah, Knowledge discovery from user's web - page navigation, Proc. Seventh IEEE Intl. Workshop on Research Issue in Data Engineering, 1997
  • 4[4]Fu Y, Sandhu K, Shih M. A generalization - based approach to clustering of Web usage session [ A ]. International Workshop on Web Usage Analysis and User Profiling, 2000
  • 5[5]Colley R, Mobasher B, Srivastava J. Grouping Web page references into transactions for mining world wide web browsing patterns [ A ]. 1997 IEEE Knowledge and Data Engineering Exchange Workshop [ C ]. Newport Beach, CA: IEEE Computer Soc, 1997
  • 6[6]Yongjian Fu, Kanwalpreet Sandhu and Ming-Yi Shih, ″Clustering of Web Users Based on Access Patterns″
  • 7[7]J. C. Bezdek. ″Pattern Recognition with Fuzzy Objective Function Algorithms.″ Plenum Press, New York, 1981
  • 8[8]K. S. Fu. ″Syntactic Pattern Recognition and Applications.″Academic Press, San Diego, CA. 1982
  • 9Zaiane O R,Proc Advances Digital Libraries Conf,1998年,19页
  • 10Chen M S,Proc of the 16th Int Conf Distributed Computing Systems,1996年,385页

共引文献124

同被引文献45

引证文献4

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部