期刊文献+

一种分布式Web使用模式挖掘模型及算法 被引量:2

Construction and algorithms of distributed web usage pattern mining
下载PDF
导出
摘要 给出了一种分布式Web日志挖掘模型DWLMS.根据对挖掘过程及算法进行分析,提出了一种基于DWLMS的局部频繁路径的更新算法LFP和全局频繁路径的更新算法GFP,较好地解决了Web访问信息的异地存储、实时增长、分布式算法通讯量等因素给模式分析过程带来的困难.在实验室对该方法进行了简单实现和实际日志数据的测试,结果表明了算法的有效性. A distributed Web log mining system model (DWLMS) is presented. Based on the analysis on the procedure and algorithm of Web frequent access pattern mining, the more general incremental updating algorithms of local frequent paths (LFP) and global frequent paths (GFP) in a distributed database system based on DWLMS are proposed for discovering the frequent access paths. Some troubles produced by real time incremental distributed Web access information and more communication data are solved better by the algorithms. The method was realized simply and tested with real world Web log information in laboratory, and the results show that the algorithms are valid.
出处 《北京科技大学学报》 EI CAS CSCD 北大核心 2006年第9期896-901,共6页 Journal of University of Science and Technology Beijing
基金 国家自然基金资助项目(No.70431002) 北京电子科技学院重点实验室资助项目
关键词 分布式数据挖掘 WEB使用模式挖掘 WEB日志挖掘 频繁路径 distributed data mining Web access pattern mining Web log mining frequent path
  • 相关文献

参考文献9

  • 1韩家炜,孟小峰,王静,李盛恩.Web挖掘研究[J].计算机研究与发展,2001,38(4):405-414. 被引量:356
  • 2Srivastava J,Cooley R,Deshpande M,et al.Web usage mining:discovery and application of usage patterns from Web data.SIGKDD Explorations,2000,1(2):12
  • 3Chen M S,Park J S,Yu P S.Efficient data mining for path traversal patterns.IEEE Trans Knowl Data Eng,1998,10(2):209
  • 4Nanopoulos A,Manolopoulos Y.Mining patterns from graph traversals.Data Knowl Eng,2001,37:243
  • 5Kargupta H,Hamzaoglu I,Stafford B.Scalable distributed data mining using an agent based architecture ∥ Proc of KDD97.Menlo Park:AAAI Press,1997:211
  • 6Kargupta H,Park B,Johnson E et al.Collective data mining from distributed vertically partitioned feature space ∥ Workshop on Distributed Data Mining,International Conference on Knowledge Discovery and Data Mining.New York,1998
  • 7Masseglia F,Teisseire M,Poncelet P.Real time Web usage mining with a distributed navigation analysis ∥ Proceedings of the 12th International Workshop on Research Issues in Data Engineering.San Jose,2002:169
  • 8吉根林,杨明,赵斌,孙志挥.基于DDMINER分布式数据库系统中频繁项目集的更新[J].计算机学报,2003,26(10):1387-1392. 被引量:15
  • 9Cheung D W,Ng V T,Fu A W.Efficient mining of association rules in distributed databases.IEEE Trans Knowl Data Eng,1996,8(6):911

二级参考文献7

共引文献369

同被引文献19

  • 1唐北平,肖建华.通用Web日志挖掘系统设计实现[J].电脑知识与技术(过刊),2007(2):310-311. 被引量:2
  • 2吕佳.Web日志挖掘技术应用研究[J].重庆师范大学学报(自然科学版),2006,23(4):39-44. 被引量:15
  • 3赵红玲,宋瀚涛,牛振东,刘桂山.Web日志挖掘中数据预处理的研究[J].计算机应用研究,2005,22(6):67-69. 被引量:20
  • 4陈子军,王鑫昱,李伟.一种Web日志会话识别的优化方法[J].计算机工程,2007,33(1):95-97. 被引量:18
  • 5Chen M S, Park J S, Yu P S. Efficient data mining for path traversal patterns[J].IEEE Trans. on Knowledge and Data Engineering, 1998, 10(2): 209-221.
  • 6Alexandros Nanopoulos, Yannis Manolopoulos. Mining patterns from graph traversals[J]. Data& Knowledge Engineering, 2001, 37(3): 243-266.
  • 7Kargupta H, Johnson E. Collective data mining from distributed vertically partitioned feature space[C] // Workshop on distributed data mining, International Conference on Knowledge Discovery and Data Mining, New York, 1998.
  • 8Kargupta H, Hamzaoglu I, Stafford B. Scalable distributed data mining using an agent based architecture [ C] // Proc. of KDD97 , Menlo Park , CA:AAAI Press,1997 : 211- 214.
  • 9Masseglia F, Teisseire M, Poncelet P. Real time web usage mining with a distributed navigation analysis[C]//International Workshop on Research Issues in Data Engineering, San Jose, California. 2002:169 - 174.
  • 10Ashrafi M Z, Taniar D, Smith K. ODAM: an optimized distributed association rule mining algorithm[J]. IEEE Distributed Systems Online, 2004, 5(3): 1541-4922.

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部