摘要
Web日志挖掘技术是 Web数据挖掘中最重要的应用。通过对挖掘服务器日志文件的分析和研究 ,可以对网站的组织结构及其性能进行改进 ,增加个性化服务 ,发现潜在的读者群体。数据预处理关系到 Web日志挖掘的质量。数据预处理包括数据清理、识别用户、识别用户会话、格式化 ,目的是分割服务器日志为多个独一无二的用户的一次访问序列 。
Web log mining is the most important application in Web data mining. We can improve the organization structure of Web site and its function ,increase personalized service and discover the potential reader group on the basis of the analysis and research of Web log mining documents. Data preprocessing decides the quality of Web log mining. It includes data clearing, user identifying, user session identifying, format, etc. and its aim is to separate Web server log into multi user reference strings and also give the reference type realization.
出处
《现代图书情报技术》
CSSCI
北大核心
2004年第12期55-57,17,共4页
New Technology of Library and Information Service