摘要
数据预处理是Web使用挖掘的一个关键环节,其结果直接影响到后续的事务识别、路径分析、关联规则挖掘和序列模式挖掘的结果。提出了一种用户识别的通用算法、路径补充的启发式策略和基于主题规约的方法,并用实验证明了其高效性。
Data Preprocessing is a critical step in web usage mining.The results of Data Preprocessing is relevant to the next steps,such as transaction identification,path analysis,association rules mining,sequential patterns mining,and so forth.This text presents a currency algorithm for user identification、an heuristic rule for path completion and a method based theme statute.It is experimentally evaluated that not only its efficiency is high,but also it can identify user and session exactly.
出处
《计算机工程与应用》
CSCD
北大核心
2006年第A01期101-104,共4页
Computer Engineering and Applications
关键词
WEB使用挖掘
数据预处理
用户识别
会话识别
路径补充
主题规约
Web Usage Mining
data preprocessing
user identification
session identification
path completion
theme statute