摘要
Web日志挖掘就是运用数据挖掘技术从Web日志中发现和抽取信息的过程。数据预处理是Web日志挖掘的一个关键环节。对数据预处理的各个环节进行研究,并介绍各个环节中的一些特殊处理方法,根据对Web服务期日志数据格式的分析,对会话概念进行了形式化描述,然后在分析目前会话构造算法的基础上,提出了基于时间和引用的启发式方法来构造会话。
Web log mining is a process that using data mining technology to find and extract information from Web log. Data preprocessing plays a key role in the process of Web log mining.This paper mainly researches'all links of data preprocessing,introduces the solution of some especial problems in this process. By the analysis of Web server log format,give the formal descriptions of the concept of session. On the basis of analyzing the current session construction methods,mainly proposes the time - referrer - based heuristic method that can be used to eonstruet sessions.
出处
《计算机技术与发展》
2007年第8期11-14,18,共5页
Computer Technology and Development