期刊文献+

一种动态更新知识体系模型及其在专题信息采集中的应用研究 被引量:2

A Knowledge System Model of the Dynamic Update Application in Collecting Thematic Information
下载PDF
导出
摘要 专题信息采集通常是指基于专题内容概念从海量网络信息资源里获取专门所需信息的过程,专题内容概念主要通过系统的领域知识体系来表达。但依据领域知识体系进行信息采集,需要人工手动更新领域知识,效率较低、查全率不高。本文尝试引入一种半自动动态更新领域知识体系方案来指导专题信息采集,通过基于关联规则的扩展查询改进算法来发现新的领域知识关键词,通过人工半自动化筛选,形成一种知识描述模型,设计并应用于EMALS的网络信息发现系统。最后通过实验证明知识动态演进方案是可行的。 Thematic information gathering is a process which is often based on the concept of thematic content to obtain special information from the massive online information, and the concept of thematic content is often expressed by the domain knowledge system. However, if we collect information using domain knowledge system, we have to update the knowledge manually. The efficiency of this method is low and recall rate is not high. This paper attempts to guide thematic information gathering by introducing a domain knowledge system of semi-automatic update to discover some new knowledge keywords through extended query improved algorithm based on association rules, and to form a knowledge model through artificial and semi-automatic selection, which is then designed and applied to network information discovery systems of EMALS. Finally, we prove the knowledge program of the dynamic evolution is feasible through experiments.
作者 汪维熙 马静
出处 《情报学报》 CSSCI 北大核心 2012年第6期583-588,共6页 Journal of the China Society for Scientific and Technical Information
基金 本文系国防技术基础项目成果之一.
关键词 动态更新知识体系模型专题信息采集 dynamic update, knowledge system model, thematic information gathering
  • 相关文献

参考文献6

二级参考文献71

  • 1钱晓东,王正欧.基于神经网络文本检索词的语义扩充[J].计算机工程,2004,30(20):22-24. 被引量:3
  • 2陈晓红,秦杨.基于Web数据挖掘的高效关联规则研究[J].计算机工程与科学,2005,27(11):48-51. 被引量:9
  • 3孙巍.电子商务中的WEB数据挖掘与XML[J].计算机系统应用,2006,15(9):25-28. 被引量:5
  • 4J Han, M Kamber. Data Mining Concepts and Techniques [ M ]. Beijing : High Education Press,2001.
  • 5[1]Martijn Koster. Guidelines for Robot Writers [EB/OL]. http ://info. webcrawler. com/mak/projects/robots/guidelines. html.
  • 6[2]Oskari Heinonen, et al. WWW Robots and Search Engines[Z].(1996).
  • 7[3]David Pallmann. Progrmming Bots, Spiders , and Intelligent Agent in Microsoft Visual C++[M].北京:北京希望电子出版社,1999.41-59.
  • 8[4]M Koster. A Standard for Robot Exclusion[EB/OL] .http://info. webcrawler. com/mak/ projects/ robots/norobots. html.
  • 9[5]HTML4.01规范[EB/OL].http://www.3c.org/TR/html4.
  • 10[6]http://www.w3.org/TR/html4/references.html # ref-RFC2616[EB/OL].

共引文献100

同被引文献84

引证文献2

二级引证文献50

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部