期刊文献+

基于双层聚类方法的网页推荐模型 被引量:6

Scheme of web page usage prediction based on two-layer clustering method
下载PDF
导出
摘要 研究web用户访问模式的聚类问题,提出了双层的用户访问模式的聚类方法.第一层采用简单易实现的LVQ(学习向量量化)神经网络方法对日志中的用户访问模式进行简单聚类,在第二层的聚类中,采用加权的模糊c-均值的方法对第一层的聚类结果进行聚类.最后根据聚类结果产生描述该类用户行为的加权访问模式,并以此作为网页推荐依据.实验结果验证了该算法的有效性和可行性. Clustering web users' access patterns is discussed, and a two-layer clustering approach of user access patterns is proposed in this paper. At the first layer, the learning vector quantization (LVQ) approach is exploited to group the patterns from web logs into some clusters. At the second layer, the weighted fuzzy c-means approach is developed to deal with the clustering results of the first layer. Finally weighted access patterns are created to describe the surfing behaviors of web users in the class. Then the scheme of predicting web page usage could be built. The effectiveness and feasibility of the approach are testified by the experiment results.
作者 吴瑞
出处 《系统工程学报》 CSCD 北大核心 2013年第2期265-270,共6页 Journal of Systems Engineering
基金 国家自然科学基金资助项目(70802043) 山西省自然科学基金资助项目(2008011029-2)
关键词 WEB挖掘 WEB聚类 用户访问模式 模糊C-均值 web mining web clustering user access patterns fuzzy c-means
  • 相关文献

参考文献11

  • 1Eirinaki O. The world-wide web: Quagmire or gold mine[J]. Communications of the Association for Computing Machinery, 1996, 39(11): 65-68.
  • 2Facca F, Lanzi E Mining interesting knowledge from web logs: A survey[J]. Data & Knowledge Engineering, 2005, 53(3): 225-241.
  • 3Runkler T, Beadek J. Web mining with relational clustering[J]. International Journal of Approximate Reasoning, 2003, 32(2): 217- 236.
  • 4陈富赞,刘青,李敏强,寇纪淞.一种基于会话聚类算法的Web使用挖掘方法[J].系统工程学报,2012,27(1):129-136. 被引量:4
  • 5Krishnapram R, Joshi A. Low compexity fuzzy relational clustering algorithms for web mining[J]. IEEE Transactions on Fuzzy Systems, 2001, 9(1): 595-607.
  • 6Lingras P, West C. Interval set clustering of web users with rough k-means[J]. Journal of Intelligent Information Systems, 2004, 23(1): 5-16.
  • 7De S, Krishna E Clustering web transactions using rough approximation[J]. Fuzzy Sets and Systems, 2004, 1480): 131-138.
  • 8吴瑞,宁玉富,郭长友.基于模糊粗糙近似的web浏览模式的聚类[J].系统工程学报,2010,25(1):132-136. 被引量:3
  • 9Liu B, Liu Y, Expected value of fuzzy variable and fuzzy expected value model[J]. IEEE Transactions on Fuzzy Systems, 2002 10(4): 445-450.
  • 10Liu B. Theory and Practice of Uncertain Programming[M]. Heidelberg: Physica-Verlag, 2002: 102-108.

二级参考文献19

  • 1吴瑞,宁玉富,郭长友.基于模糊粗糙近似的web浏览模式的聚类[J].系统工程学报,2010,25(1):132-136. 被引量:3
  • 2Shahabi C,Zarkesh A,Adibi J,et al.Knowledge discovery from users Web-page navigation[C]// Proceedings of the 7th International Workshop on Research Issues of Data Mining,Birmingham:IEEE Computer Society,1997:20-29.
  • 3Chen M,Yu P.Data mining for path traversal patterns in a Web environment[C]// Proceedings of the 16th International Conference on Distributed Computing Systems.Washington:IEEE Computer Society,1996:27-30.
  • 4Yan T,Jacobsen M,Garcia-Molina H,et al.From user access patterns to dynamic hypertext linking[J].Computer Networks and ISDN Systems,1996,28(7):1007-1014.
  • 5Smith K,Ng A.Web page clustering using a self-organizing map of user navigation patterns[J].Decision Support Systems.2003, 35(2):245-256.
  • 6Baldi P,Frasconi P,Smyth P.Modeling the Internet and the Web[M].New York:Wiley,2003.
  • 7Alam S,Dobbie G,Riddle P.Particle swarm otimization based clustering of Web usage data[C]// Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.Washington:IEEE Computer Society.2008.3: 451-454.
  • 8Niknam T,Amiri B.An effcient hybrid approach based on PSO,ACO and K-means for cluster analysis[J].Applied Soft Computing. 2010,10(1):183-197.
  • 9Poli R,Kennedy J,Blackwell T.Particle swarm optimization:An overview[J].Swarm Intelligence,2007,1(1):33-57.
  • 10Spiliopoulou M,Mobasher B,Berendt B,et al.A framework for the evaluation of session reconstruction heuristics in Web-usage analysis[J].Informs Journal on Computing,2003,15(2):171-190.

共引文献5

同被引文献46

  • 1殷贤亮,张为.Web使用挖掘中的一种改进的会话识别方法[J].华中科技大学学报(自然科学版),2006,34(7):33-35. 被引量:27
  • 2吕佳.基于兴趣度的Web用户访问模式分析[J].计算机工程与设计,2007,28(10):2403-2404. 被引量:8
  • 3吴瑞,宁玉富.基于模糊粗糙k-均值的用户访问模式的聚类[J].系统工程理论与实践,2007,27(7):116-121. 被引量:4
  • 4Yfacca F,Lanzi P.Mining interesting knowledge from web logs:a survey[J].Data and Knowledge Engineering,2005,53(3):225-241.
  • 5Runker T,Beadek J.Web mining with relational clustering[J].International Journal of Approximate Reasoning,2003,32(2):217-236.
  • 6Liao T W.Clustering of time series data-a survey[J].Pattern Recognition,2005,38:1857-1874.
  • 7Rees J,Koehler G.Learning genetic algorithm parameters using hidden Markov models[J].European Journal of Operational Research,2006,175(2):806-820.
  • 8Kullback S,Leibler R A.On information and sufficiency[J].Annuals of Mathematical Statistics,1951,22(1):79-86.
  • 9De Angelis L,Dias J G.Mining categorical sequences from data using a hybrid clustering method[J].European Journal of Operational Research,2014,234(1):720-730.
  • 10Dempster A P,Laiard N M,Rubin D B.Maximum likelihood from incomplete data via the EM algorithm[J].Journal of the Royal Statistical Society Series B-Methodological,1977,39(1):1-38.

引证文献6

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部