期刊文献+

Web日志挖掘中会话识别方法研究 被引量:4

ON METHOD OF SESSION IDENTIFICATION IN WEB LOG MINING
下载PDF
导出
摘要 提出一种新的基于时间阈值会话识别算法,在时间阈值的计算上,既考虑了站点页面内容和结构的差异性,同时也考虑了访问者的个体差异性。相对于所有用户使用单一先验阈值和使用统计方法结合页面内容确定阈值的方法,方法能更准确地确定页面访问时间阈值,进行会话识别时具有更高的效率和真实性。 This paper presents a new kind of session identification algorithm based on time threshold.When calculating the time threshold value,we have considered the difference of the content and the structure of the website pages,the individual difference of visitors are also considered simultaneously.Contrasting to traditional methods that define the uniform threshold for all users with the priori threshold and with the statistical method in conjunction with page contents,the approach presented in this paper can determine the webpage access time threshold more accurately.It has a higher efficiency and reality when identifying sessions.
作者 张毅
机构地区 浙江万里学院
出处 《计算机应用与软件》 CSCD 2010年第6期92-94,共3页 Computer Applications and Software
基金 浙江省教育厅科研计划基金项目(200070733)
关键词 WEB日志挖掘 会话识别 阈值 数据预处理 Web log mining Session identification Threshold Data pre-processing
  • 相关文献

参考文献5

二级参考文献17

  • 1欧阳一鸣,汪曦东,郭骏,刘红樱.Web使用挖掘数据预处理中的会话构造[J].计算机工程与应用,2005,41(25):148-151. 被引量:11
  • 2金松河,钱慎一,张素智.Frame页面过滤算法在Web日志挖掘预处理中的应用[J].云南民族大学学报(自然科学版),2006,15(1):63-65. 被引量:2
  • 3陈子军,王鑫昱,李伟.一种Web日志会话识别的优化方法[J].计算机工程,2007,33(1):95-97. 被引量:18
  • 4Spiliopoulou M,Mobasher B,Berendt B,et al.The impact of site structure and user environment on session reconstruction in web usage analysis [J]. Informs Journal of Computing, Special Issue on Web Based Data for E-Business Applications, 2003,15 (2): 171-190.
  • 5Srivastava J, Cooley R, Deshpande M,et al.Web usage mining: Discovery and applications of usages patterns from web data[C]. SIGKDD Explorations,2000.
  • 6Facca F M,Lanzi P L.Mining interesting knowledge from web-logs:A survey[J].Data and Knowledge Engineering,2005,53(3): 225-241.
  • 7熊忠阳,周亚峰.Web访问挖掘的预处理技术的研究[J].计算机技术与发展,2007,17(8):11-14. 被引量:19
  • 8Yang Qiang, Zhang Haining, Li Tianyi. Mining Web logs for prediction models in WWW caching and prefecting[C]//The Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining KDD'01. San Francisco: ACM SIGKDD, 2001.
  • 9Mikroyannidis A, Theodoulidis B. A theoretical framework and an implementation architecture for self adaptive Web sites[C]// Prodeedings of the IEEE/WIC/ACM International Conference on Web Intelligence(WI'04), Beijing: IEEE Press, 2004.
  • 10Berendt B, Mobasher B, Nakagawa M, et al. The impact of site structure and user environment on session reconstruction in Web usage analysis[C]// Proceedings of the 4th WebKDD 2002 Workshop at the ACM-SIGKDD Conference on Knowledge Discovery in Database. Edmonton, Alberta: ACM SIGKDD,2002.

共引文献37

同被引文献21

引证文献4

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部