摘要
在详细介绍ELF日志文件格式的基础上定义了会话表,并对预处理过程中几个主要步骤进行深入讨论,总结已有的各种处理手段提出新的改进方法,其中重点针对会话识别进行了改进并给出了新的算法。
Based on extended log file format,the session table was defined.Through the in-depth research of several major steps in the preprocessing,existed various means have been summarized to propose a new method.Especially,a new algorithm in session identification was improved and thus presented.
出处
《微型电脑应用》
2007年第10期50-53,6,共4页
Microcomputer Applications
关键词
WEB日志挖掘
数据预处理
用户识别
会话识别
事务识别
Web log mining
ELF
Data preprocessing
User identification
Session identification
Affairs identification