期刊文献+

Web数据挖掘系统的设计及实现研究 被引量:17

Research on the design and implementation of web mining system
下载PDF
导出
摘要 在全球信息化进程中,信息超载已经成为一个大问题。Web上信息虽多,但想找到需要的信息却很困难。人们通过点击和搜索引擎与Web进行交互,但是都不能从中准确快捷地获取需要的信息,Web数据挖掘技术就是解决此问题的好方法。讲述了Web数据挖掘的基本理论,根据挖掘对象的不同将其划分为Web内容挖掘、Web链接结构挖掘和Web访问信息挖掘;利用HTML网页的特殊结构性质,提出了一种Web数据挖掘系统的通用框架,并讨论了一些实现的具体技术。 Information overloading is a big problem in the global informatization. The web is huge, but it is difficult to find what we needed on it. We interactive with the web by click or search engines, however neither can helps us get what we want ac-curately and immediately. The web mining techniques can resolve these problems. The theories of web mining are discussed in this paper. Based on the objects, we classify web mining into three categories, namely web content mining, web structure mining, and web usage mining. Finally, we proposed a general frame of web mining systems based on the specific structures of HTML pages. The implementation is also discussed in details.
出处 《计算机工程与设计》 CSCD 2002年第7期36-38,45,共4页 Computer Engineering and Design
关键词 WEB 数据挖掘 数据库 设计 www web mining VSM HITS HTML
  • 相关文献

参考文献7

  • 1[1]Steve Lawrence, Lee Giles C. Searching the World Wide Web [J]. Science. 1998,280:98-100.
  • 2[2]Steve Lawrence, Lee GilesC. Accessibility of Information on the Web [J]. Nature. 1999,400:107-109.
  • 3[3]http://www.eefind.com[EB].
  • 4[4]Etzioni O. The World Wide Web: quagmire or gold mine? [J]. Communications of ACM, 1996 ,39(11), 65-68.
  • 5[5]Robert Walker Cooley. Web Usage Mining: Discovery and Application of Interesting Patters from Web Data [D]. University of Minnesota ,2000,5.
  • 6[6]Kleinberg J. Authoritative Sources in a Hyperlinked Environment[C].In ACM-SIAM Symposium on Discrete Algorithms , 1998.
  • 7[7]Brin S , Page L. The Anatomy of a Large-scale Hypertextual Web Search Engine[C]. In 7th International World Wide Web Conference , Brisbane, Australia, 1998.

同被引文献104

引证文献17

二级引证文献71

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部