期刊文献+

Web使用挖掘的数据采集技术探究 被引量:3

Research on Techniques of Data Collecting for Web Usage Mining
下载PDF
导出
摘要 如何准确、及时、全面地采集用户使用数据是Web使用挖掘的重要前提和基础。基于Web的基本结构,Web使用挖掘的数据源可以从Web服务器端、应用服务器端、代理服务器端和客户端进行采集。文中分析了传统的基于Web日志进行Web使用挖掘所面临的问题,讨论了建立在用户浏览行为基础上的客户端数据采集技术,重点讨论了其中的JavaApplet技术。通过Java Applet技术可以获取客户端IP,可以自动完成用户浏览信息的准确采集,可以广泛用于各类网站的个性化和智能化服务、站点结构改进、商业智能等。 How to collect users' data accurately and quickly and ensure data integrity is an important precondition and foundation for Web usage mining research. Based on the Web structure, the data source of Web usage mining can be collected from Web server, application server, agent server and client. In this paper, the problems facing traditional Web usage mining based on the Web log are analysed, the data collection techniques of client are discussed which is based on the users' browsing behaviours, and the Java Applet technique is much emphasized, which can help get the IP address of client, automaticly complete the accurate collection of users' browsing information, can he widely used for the Web sites' personal and intelligent service, for the improvement of Web structure, and for the business intelligence, etc.
出处 《计算机技术与发展》 2010年第3期225-229,共5页 Computer Technology and Development
基金 国家"十一五"计划项目(FIB070335-B8-08)
关键词 数据采集 WEB使用挖掘 WEB日志 JAVA APPLET data collecting Web usage mining Web Log Java Applet
  • 相关文献

参考文献9

  • 1涂承胜,陆玉昌.Web使用挖掘技术研究[J].小型微型计算机系统,2004,25(7):1177-1184. 被引量:37
  • 2向坚持,刘相滨,徐选华.基于用户行为的Web使用挖掘数据采集技术研究[J].计算机与现代化,2007(12):59-62. 被引量:8
  • 3Chen M S, Park J S, Yu P S. Data Mining for Path Traversal Patterns in a Web Environment[C] // In: Proceedings of the 16th International Conference on Distributed Computing Systems. Hong Kong: [ s. n. ], 1996: 385 - 392.
  • 4Yan Tak, Jacobsen M, Gareia Molina H, et al. From User Access Patterns to Dynamic Hyper text Linking[C] //In: Proceedings of the 5th International World Wide Web Corderence. Paris, Franee:[s.n.], 1996:1007-1014.
  • 5BoNes J, Levene M. Data Mining of User Navigation Patterns[C] // In: Proceedings of the WEBKDD' 99 Workshop on Web Usage Analysis and User Profiling. San Diego, CA, USA:[s.n. ], 1999:31-39.
  • 6SEIGERM MADSENMR LANGSTONJ etal 陆昌辉 张光剑 陈佐 译.点击流数据仓库[M].北京:电子工业出版社,2004..
  • 7朱志国,邓贵仕.Web使用挖掘技术的分析与研究[J].计算机应用研究,2008(1):29-32. 被引量:23
  • 8刘立军,周军,梅红岩.Web使用挖掘的数据预处理[J].计算机科学,2007,34(5):200-201. 被引量:22
  • 9Schildt H.Java2参考大全[M].第5版.周志彬,吕建宁,章小莉译.北京:电子工业出版社,2004.

二级参考文献35

  • 1刘洪涛,张平,黄智兴,程静,刘革平.用户浏览行为数据采集方法综述[J].西南科技大学学报,2004,19(2):45-49. 被引量:6
  • 2向坚持,陈晓红,刘相滨,徐选华.基于Web Log的数据预处理研究[J].湖南师范大学自然科学学报,2004,27(4):33-36. 被引量:4
  • 3Mark Sweiger, Mark R Mandsen, Jimmy Langston, Howard Lombard. 点击流数据仓库[M].北京:电子工业出版社,2004.
  • 4Pyle D.Data Preparation for Data Mining.Morgan Kaufmann Publishers Inc,San Francisco,CA,1999.540
  • 5Cooley R,Mobasher B,Srivastava J.Data preparation for mining World Wide Web browsing patterns.Journal of Knowledge and Information Systems,1999,1(1):5~32
  • 6Tan P,Kumar V.Discovery of Web robot sessions based on their navigational patterns.Data Mining and Knowledge Discovery,2002,6:9~35
  • 7Jetal S.Web Usage Mining:Discovery and application of usage patterns from Web data[J].SIGKDD Explorations,2000,1(2):12~23
  • 8Cooley R,Mobasher B,Srivastava J.Data Preparation for Mining World Wide Web Browsing Patterns[J].Journal of Knowledge and Information Systems,1999,1(1):5~32
  • 9Chen MS,Park J S,Yu PS.Data Mining for Path Traversal Patterns[A].In:Proc.of the 16th Int'l Confon Distributed Computing System[C].Hong Kong,1996
  • 10Perkowitz M,Etzioni O.Towards adaptive Web sites:Coneeptual framework and case study[J].Artificial Intelligence,2000,118:245~275

共引文献80

同被引文献36

引证文献3

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部