期刊文献+

电子文档信息自动挖掘技术中的预处理研究 被引量:2

Pre - processing in Automatic Information Mining for Electronic Documents
下载PDF
导出
摘要 基于Internet的信息挖掘是数据挖掘技术中的重要组成部分,也是网络信息处理领域中的一项新课题。本文介绍了Internet上的电子文档信息自动挖掘的概念和系统的体系结构,并给出了文档结构图解析、文档分类检索等电子文档自动挖掘的预处理过程及处理程序。 Internet information mining is an important data mining techniques, also a new problem in the domain of net information processing. This paper describes the concept and system structure of automatic information mining based on internet electronic documents, The pre -processing procedure and programs are given in the paper, for automatic information mining of electronic documents, such as the analysis of the documental strctural drawing and documental classified index,etc.
出处 《计算技术与自动化》 2002年第2期92-96,共5页 Computing Technology and Automation
基金 湖南省教育厅资助项目(项目编号:01C012)
关键词 电子文档 信息自动挖掘 预处理 数据挖掘 INTERNET DM Internet Electronic document Analysis Pre-processing
  • 相关文献

参考文献1

  • 1W. H. Innom.Building the Data Warehouse[].nd.2000

同被引文献15

  • 1Piatetsky-Shapiro G,Fayyad U,Smity P.From data mining to knowledge discovery:an overview[A].In:Advances in Knowledge Discovery and Data Mining[C].Cambridge,Mass:AAA/MIT Press,1996.1-34
  • 2Innom W H. Building the Data Warehouse, 2nd[M], 2000.
  • 3Wu K L, Yu P S, Ballman A. SpeedTracer: A Web usage mining and analysis tool.[J] IBM System Journal,1998,37(1):89~105.
  • 4Postel J. Simple mail transfer protocol. STD 10, RFC831, USC/Information Sciences Institute, Aug. 1983.
  • 5Crocker D. Standard for the format of ARPA Internet text message. STD 11, RFC 522, UDEL, Aug. 1982.
  • 6Myers J. Post office protocol- version 3. RFC1725, Dover Beach Consulting, Inc., Nov. 1994.
  • 7Borenstein N, Freed N. MIME (Multipurpose Internet Mall Extentions) part one:mechanisms for specifying and describing the format of Internet message bodies. RFC 1521, Bellcore,Innosoft, Sep. 1993.
  • 8Moore K. MIME (Multipurpose Internet Marl Extentions) part two: message header extensions for non--ASCⅡ text. RFC1522,University of Tennessee, Sep. 1993.
  • 9Rarnsdell B. S/MIME Version 3 Message Specification. IETF RFC 2633 ,Jun. 1999.
  • 10Sahami M, Dumais S, Heckerrnan D. A Baysian Approach to Filtering Junk E-Mail. AAAI 98 Workshop on Text Categorization. July 1998.

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部