摘要
会话识别是Web日志数据预处理中的重要步骤,直接影响着Web日志挖掘的效率和准确性。在给出会话识别定义的基础上.对传统的预先设定时间间隔方法进行了优化,并具体描述了数据结构及其算法。实验结果证明会话质量得到了提高。
Session identification is the important process of data preprocessing in web log mining, which directly affects the impact and accuracy of web log mining. The definition of session identification is given, the traditional method of preestablished time interval is optimized and the algorithm is described concretely based on the data structure. The empirical analysis prove that the quality of session is improved.
作者
李瑞
朱鹤祥
LI Rui, ZHU He-xiang (School of Software,Dalian Jiaotong University,Dalian 116028,China)
出处
《电脑知识与技术》
2009年第11期8616-8618,共3页
Computer Knowledge and Technology
基金
辽宁省基金项目,绿色制造模式下智能优化算法应用研究(20072161)