期刊文献+

面向茶学领域本体的概念自动提取方法研究

Research on Automatic Concept Extraction for Tea Domain Ontology
下载PDF
导出
摘要 针对目前手工构建本体耗时费力这一难题,以茶学领域知识为研究对象,提出了一种本体的概念自动提取方法。该方法利用中文分词技术对茶学语料进行切分,使用互信息技术从切分后的语料中得出候选概念(合成词)集合,通过判断候选概念和非合成词的领域相关性,自动提取出茶学领域本体概念。以该方法为基础开发了相应的原型系统,实验结果表明,该方法是有效的。 Aimed at reducing the time and labor required in current manual ontology construction,an automatic method of extracting ontology concept was proposed in this paper,based on the knowledge of tea domain.Firstly,tea domain corpus was processed by Chinese word segmentation,and the set of candidate concepts(compound words) was obtained by mutual information technology,and then the ontology concepts of tea domain were automatically extracted by judging the domain coherence of the candidate concepts.Experiment was performed by related prototype system,and the results proved that the method was effective.
出处 《农业网络信息》 2010年第8期13-15,24,共4页 Agriculture Network Information
基金 国家863计划项目"农业知识网格的构建及关键技术研究"(编号2006AA10Z249)
关键词 茶学本体 概念提取 互信息 领域相关性 tea domain ontology concept extraction mutual Information domain coherence
  • 相关文献

参考文献8

二级参考文献32

  • 1黄昌宁.中文信息处理中的分词问题[J].语言文字应用,1997(1):74-80. 被引量:83
  • 2耿骞,耿崇.利用词语共现进行Ontology的概念获取[J].现代图书情报技术,2006(2):43-45. 被引量:10
  • 3姜韶华,党延忠.基于长度递减与串频统计的文本切分算法[J].情报学报,2006,25(1):74-79. 被引量:14
  • 4肖红,许少华,李欣.具有三级索引词库结构的中文分词方法研究[J].计算机应用研究,2006,23(8):49-51. 被引量:16
  • 5张利,张立勇,张晓淼,耿铁锁,岳宗阁.基于改进BP网络的中文歧义字段分词方法研究[J].大连理工大学学报,2007,47(1):131-135. 被引量:12
  • 6Maedche A, Staab S. An ontology learning for the Semantic Web[J]. IEEE Intelligent Systems, 2001, 16(2) :72-79.
  • 7Philipp Cimiano, Johanna VSlker. Text2Onto-a framework for ontology learning and data-driven change discovery[ C] // Proceedings of the 10th International Conference on Applications of Natural Language to Information Systems (NLDB). Lecture Notes in Computer Science, Springer, Alicante, Spain, June 2005, 3513: 227-238.
  • 8Patrick Pantel, Dekang Lin. A statistical corpus-based term extractor[ C ]//Proceedings of AI. Ottawa, Canada: Lecture Notes in Artificial Intelligence, 2001: 36-46.
  • 9Buitelaar P, Olejnik D, Sintek M. OntoLT: A protege plugin for ontology extraction from text[ C]//Proceedings of the International Semantic Web Conference (ISWC). Florida,USA, 2003: 17-22.
  • 10Dan Crow, John DeSanto. A hybrid approach to concept extraction and recognition-based matching in the domain of human resources [ C ] // Proceedings of the 16^th IEEE International Conference on Tools with Artificial Intelligence (ICTAI) ,2004 : 535-541.

共引文献44

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部