期刊文献+

机器学习与网络信息处理 被引量:3

Machine Learning and Web Information Processing
下载PDF
导出
摘要 机器学习在网络信息处理中占有重要地位。GHunt是一个采用多项机器学习技术的网络信息智能获取与处理系统。首先,这一系统支持分布式的网络信息并行搜索与内容过滤;其次,采用机器学习技术,包括文本分类、聚类,文本概念抽取,从概念层次理解文本信息;再次,基于概念语义空间有效地统一文本信息管理;最后提供高效的基于概念语义的文本信息检索,以及个性化的专题组织与信息推送服务。文中着重阐述了系统中所用到的机器学习技术。 Machine Learning plays an important role in processing web information.GHunt is an intelligent system based on machine learning for web information acquiring and processing.Firstly,the system supports distributed parallel searching and filtering web information.Secondly,the system can distinguish the domain of the web page and understand the document at the concept level by text classification,clustering and concept extraction based machine learning.Thirdly,the system can manage the documents based on semantic concept space.At last,the system can efficiently provide text retrieve based on semantics and individual recommendation for news event.In the paper the machine learning technology applied in the system are described in detail.
出处 《计算机工程与应用》 CSCD 北大核心 2004年第33期189-191,共3页 Computer Engineering and Applications
基金 国家自然基金(编号:90104021 60173017 60073019) 北京市重点自然科学基金(编号:4011003)资助
关键词 网络信息 机器学习 概念语义空间 分类 聚类 Web Information,machine learning,semantic concept space,classification,clustering
  • 相关文献

参考文献8

  • 1Chen H,Schuffels C ,Orwig R.Internet Categorization and Search:A Self-Organizing Approach[J].Journal of Visual Communication and Image Representation, 1996 ;7 ( 1 ): 88~102
  • 2E Bonabeau ,M Dorigo ,G Theraulaz. Inspiration for optimization from social insect behaviour[J].Nature,2000;406(6)
  • 3J Kennedy,R C Eberhart. Swarm Intelligence[M].Morgan Kaufmann Publishers, 2000
  • 4Zhongzhi Shi,Bin Wu,Qing He et al. IDSIS:Intelligent Document Semantic Indexing System[C].In:The 17th IFIP World Computer Congress, Montreal, 2002
  • 5Ziyan Jia,Qing He,Haijun Zhang et al.Special Topic Organization and Retrieval System[C].In :International Conference on Intelligent Information Technology, Beijing, 2002
  • 6Wu Bin,Zheng Yi,Liu Shaohui et al.CSIM:A Document Clustering Algorithm Based On Swarm Intelligence[C].In:2002 World Congress on Computational Intelligence,Hawaiian,WCCI2002
  • 7李源,郑毅,何清,史忠植.基于概念空间的文本语义索引[J].计算机科学,2002,29(1):20-22. 被引量:7
  • 8刘少辉,董明楷,张海俊,李蓉,史忠植.一种基于向量空间模型的多层次文本分类方法[J].中文信息学报,2002,16(3):8-14. 被引量:75

二级参考文献13

共引文献80

同被引文献20

  • 1韩世欣,黄梯云,李一军.基于机器学习理论的智能决策支持系统模型操纵方法的研究[J].决策与决策支持系统,1996(1):10-18. 被引量:11
  • 2Belkin N, Croft WB. Information Filtering and Information Retrieval,Two Sides of the Same Coin. Communications of the ACM,Dec.1992,v35 ,n12:p.29-39.
  • 3Rocchio J.Relevance Feedback in Information Retrieval [A]. The Smart Retrieval System:Experiments in Automatic Document Processing[C].Englewood Cliffs,NJ:Prentice-Hall Inc,1997 : 313-323.
  • 4D Freitag. Machine Learning for Information Extraction in Information Domains. Machine Learning,2000,39(2-3) : 169-272.
  • 5Saton G,Wong A,Yang CS. A Vector Space Model for Automatic Indexing. Communications of ACM, 1975,18 (11) :613-620.
  • 6JiaweiHanMichelineKamber.数据挖掘概念与技术[M].北京:机械工业出版社,2001..
  • 7Geoge E Luger,史忠植,张银奎,等.人工智能复杂问题求解的结构和策略[M].北京:机械工业出版社,2004.
  • 8Mase H.Experiments on automatic Web page categorization for IR system[C].Technical Report Stanford,Calif:Standford Univ,1998.
  • 9Nigam K.Text classification from labled and unlabled documents using EM[J].Machine Learing,2000,39(2/3):103-134.
  • 10Manevitz M L,Yousef M.One-class SVMs for document classification[J].Machine Learing Reseach,1995,17(2):241-250.

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部