期刊文献+

Web日志挖掘数据预处理方法研究 被引量:2

Research on Data Preprocessing Method in Web Log Mining
下载PDF
导出
摘要 Web日志挖掘技术是 Web数据挖掘中最重要的应用。通过对挖掘服务器日志文件的分析和研究 ,可以对网站的组织结构及其性能进行改进 ,增加个性化服务 ,发现潜在的读者群体。数据预处理关系到 Web日志挖掘的质量。数据预处理包括数据清理、识别用户、识别用户会话、格式化 ,目的是分割服务器日志为多个独一无二的用户的一次访问序列 。 Web log mining is the most important application in Web data mining. We can improve the organization structure of Web site and its function ,increase personalized service and discover the potential reader group on the basis of the analysis and research of Web log mining documents. Data preprocessing decides the quality of Web log mining. It includes data clearing, user identifying, user session identifying, format, etc. and its aim is to separate Web server log into multi user reference strings and also give the reference type realization.
作者 柳胜国
出处 《现代图书情报技术》 CSSCI 北大核心 2004年第12期55-57,17,共4页 New Technology of Library and Information Service
关键词 WEB日志挖掘 数据挖掘 数据预处理 方法研究 Web log mining Data mining Data preprocessing Research method
  • 相关文献

参考文献4

二级参考文献17

  • 1(加)HanJ KamberM.数据挖掘概念与技术[M].北京:机械工业出版社,2001..
  • 2[1]R Agrwal,R Srikant. Fast algorithms for mining association rules[C].In:Proc of the 20th VLDB conference,1994
  • 3[2]R Cooley,B Mobasher,J Srivastava. Web mining:Inoformation and pattern disvovery on the World Wide Web[C].In:International Conference on Tools with Artificial Intelligence,Newport Beach,CA, 1997:558~567
  • 4[3]Open Market Inc. Open Market Web reporter. http://www. openmarket.com, 1996
  • 5[4]T Bray,J Paoli,C M Sperberg-McQueen. Extensible markup Language (XML)1.0 W3c recommendation[R].Technical report,W3c,1998
  • 6[5]T Joachims,d Freitag,T Mitchell.Webwatcher:A tour guide for the World Wide Web[C].In:Proc of the 15 th Conference on Artificial Intelligence, Nagoya, Japan, 1997: 770~775
  • 7[6]L E Baum,T Petrie. Statistical inference for probabilistic functions of finite state[J].Ann Math Stat, 1996
  • 8[7]R Agrawal. Data mining:Crossing the chasm[R].Invited talk at the 5th ACM SiGKDD Int conference on Knowledge Discovery and Data Mining, 1996
  • 9[8]Ralph Kimball,Richard Merz.The Data Webhouse Toolkit.John Wiley and Sons,Inc.2000
  • 10[9]Thorsten Joachinms. Text categorization with support vector machines:Learning with many relevant features[C].In:European Conference onMachine Leaning(ECML), 1998

共引文献101

同被引文献17

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部