期刊文献+

基于路径聚类的页面访问次序的挖掘 被引量:2

Mining based on page visiting sequence of path clustering
下载PDF
导出
摘要 为了发现用户的行为模式以实现Web站点的结构优化,提出了基于用户访问路径的K-PathSearch算法。在对网页实施预处理后,结合页面链接参数,建立用户访问事务处理模型,形成有用数据集。提取样本分析用户的兴趣度,主要影响因素体现在访问次序、次数以及停留时间三方面,并利用重新定义的相似度将兴趣取向相类似的用户划分为一类;在此基础上,定义用户访问最长拟合路径,进而计算路径聚类中心。经计算,聚类数和聚类中心平均长度增比显著,表明模型和算法是可行和有效的。 In order to find the user behavior patterns to achieve the optimization of website structure. K-PathSearch algorithm is proposed based on user access path. First, combined with the page link parameters after web-page preprocessing, user access transaction processing model is established and useful data set is formed. Furthermore, user interest degree is analyzed on sam- ples. The three main affect factors reflected in access order, frequency and length of stay. Users which have the same interest are divided into a class after similarity of interest degree is redefined. Based on the user access, we can define the longest fitting path of user access and then calculate the path clustering center. The calculation shows that, the growth of the number of cluster and the cluster center average length is significant. It is proved that the model and algorithm are feasible and effective.
出处 《计算机工程与设计》 CSCD 北大核心 2013年第1期303-306,313,共5页 Computer Engineering and Design
关键词 聚类 路径聚类 用户访问事物 K-PathSearch算法 聚类中心 cluster path cluster user access transaction K-PathSearch algorithm cluster center
  • 相关文献

参考文献6

二级参考文献37

  • 1刘国营.基于路径聚类的Web用户访问模式发现算法[J].情报杂志,2005,24(7):18-19. 被引量:2
  • 2吴俊杰,陈俊杰,赵栓柱.基于用户访问兴趣的路径聚类研究[J].计算机工程与应用,2005,41(36):170-171. 被引量:7
  • 3Srivastava J,Cooley R,Deshpande M,et al.Web usage mining:Discovery and applications of usage patterns from web data[J].SIGKDD Explorations,2000,1(2):12 -23.
  • 4Sarwar B,Karypis G,Konstan J,et al.Analysis of recommendation algorithm for E-commerce[A].Proceedings of the 2nd ACM Conference on Electronic Commerce[C].New York:ACM Press,2000.158 - 167.
  • 5Brin S,Page L.The anatomy of large-scale hypertextual web search engine[J].Computer Networks and ISDN Systems,1998,30(1 -7):107 - 117.
  • 6Pirolli P,Pitkow J E.Distribution of surfers' paths through the world wide web:Empirical characterization[J].World Wide Web,1999,2(1 -2):29 -45.
  • 7Billsus D,Pazzani M J.Learning collaborative information filters[A].Proceedings of the Fifteenth International Conference on Machine Learning[C].San Francisco,CA,USA:Morgan Kaufmann Publishers,1998.46 - 54.
  • 8Fu X,Budzik J,Hammond K J.Mining navigation history for recommendation[A].Proceedings of the 2000 International Conference on Intelligent User Interfaces[C].New York,USA:ACM Press,2000.106 - 112.
  • 9Mobasher B,Dai H H,Luo T,et al.Discovery of aggregate usage profiles for Web personalization[A].Proceedings of the Web Mining for E-Commerce Workshop[C].New York,USA:ACM Press,2000.255 -264.
  • 10Agawal R,Strikant R.Fast algorithm for mining association rules[A].Proceedings of the 20th International Conference on Very Large Data[C].New York,USA:ACM Press,1994.487-499.

共引文献73

同被引文献19

  • 1张玉连,王权.基于浏览行为和浏览内容的用户兴趣建模[J].现代图书情报技术,2007(6):52-55. 被引量:24
  • 2RESNICK P, VARIAN H. R. Recommender systems [ J ].Communications of the ACM, 1997,40 ( 3) : 56-58.
  • 3EUZENAT J, SHVAIKO R Ontology matching [ M ].[S. 1. ] : Springer, 2007: 101-102.
  • 4KOUTSONIKOLA V A, VAKALI A I. A fuzzy bi-clustering ap-proach to correlate Web users and pages [ J ]. InternationalJournal of Knowledge and Web Intelligence, 2009, 1 ( 1/2);3-23.
  • 5CHANG Y I,SHEN J H,CHEN T I. Adata mining basedmethod for the incremental update of supporting personalized in-formation filtering [ J]. Journal of Information Science and En-gineering, 2008 , 24 (1) ; 129-142.
  • 6LIU R R, JIA C X, ZHOU T,et al. Personal recommendationvia modified collaborative filtering [ J ]. Physica A-StatisticalMechanics and ITS Applications, 2009, 388 (4) : 462-468.
  • 7吕善国,吴效葵,曹义亲.基于网络结构的推荐算法[J].实验室研究与探索,2012 (7): 278-280,368.
  • 8朱琳可.基于社交网络的个性化推荐服务研究[D].西安:西北大学,2012.
  • 9班志杰,古志民,金瑜.Web预取技术综述[J].计算机研究与发展,2009,46(2):202-210. 被引量:19
  • 10王昊,苏新宁.CSSCI本体概念模型的构建和描述[J].中国图书馆学报,2009,35(3):43-51. 被引量:6

引证文献2

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部