期刊文献+

基于规则与统计的本体概念自动获取方法研究 被引量:10

Research on AUtomatic Ontological Concept Extraction Based on Rules and Statistics
下载PDF
导出
摘要 为获取中文领域本体的概念提出了基于规则匹配和统计方法相结合的学习模型,充分利用现有的自然语言处理技术和统计学习方法,从领域文本中通过语义串切分、规则匹配、领域归属度分析和概念约简算法自动获取领域概念。该方法解决了现有中文本体学习方法对领域词典的依赖以及无法获得短语式特定领域概念的问题,同时解决了领域概念筛选问题。实验证明了该方法的有效性。 A methodology based on rules and statistics is proposed to acquire Chinese ontological concepts. This methodology uses NLP techniques and statistical algorithms to extract domain-specific concepts from texts through semantic string segmentation, rules matching, analysis of domain consensus and relevance, and concept reduction. This methodology resolves current problems in Chinese ontology learning, such as dependency on domain lexicons, difficulties for phrasal concepts acquisition, and domain-specific concepts filtering, which can be demonstrated by experimental results.
作者 张新 党延忠
出处 《情报学报》 CSSCI 北大核心 2007年第6期813-820,共8页 Journal of the China Society for Scientific and Technical Information
基金 国家自然科学基金资助项目(项目编号:70431001,70271046).
关键词 领域本体构建 概念抽取规则匹配领域归属度 概念约简 domain ontology building, concept extraction, rules matching, domain consensus & relevance, concept reduction
  • 相关文献

参考文献14

  • 1Maedche A, Staab S. An ontology learning for the Semantic Web[J]. IEEE Intelligent Systems, 2001, 16(2) :72-79.
  • 2Philipp Cimiano, Johanna VSlker. Text2Onto-a framework for ontology learning and data-driven change discovery[ C] // Proceedings of the 10th International Conference on Applications of Natural Language to Information Systems (NLDB). Lecture Notes in Computer Science, Springer, Alicante, Spain, June 2005, 3513: 227-238.
  • 3Patrick Pantel, Dekang Lin. A statistical corpus-based term extractor[ C ]//Proceedings of AI. Ottawa, Canada: Lecture Notes in Artificial Intelligence, 2001: 36-46.
  • 4Buitelaar P, Olejnik D, Sintek M. OntoLT: A protege plugin for ontology extraction from text[ C]//Proceedings of the International Semantic Web Conference (ISWC). Florida,USA, 2003: 17-22.
  • 5Dan Crow, John DeSanto. A hybrid approach to concept extraction and recognition-based matching in the domain of human resources [ C ] // Proceedings of the 16^th IEEE International Conference on Tools with Artificial Intelligence (ICTAI) ,2004 : 535-541.
  • 6耿骞,耿崇.利用词语共现进行Ontology的概念获取[J].现代图书情报技术,2006(2):43-45. 被引量:10
  • 7Thomas R Gruber. A translation approach to portable ontologies[ J] . Knowledge Acquisition, 1993,5 (2) : 199- 220.
  • 8姜韶华,党延忠.基于长度递减与串频统计的文本切分算法[J].情报学报,2006,25(1):74-79. 被引量:14
  • 9李萍,黄崇岭.IT领域的专业术语构词特点及功能意义——从词汇层面分析IT领域的专业术语[J].桂林电子工业学院学报,2004,24(2):48-51. 被引量:2
  • 10ICTCLAS-汉语词法分析系统[OL].[2006-09-23].http ://www. nip. org. crdproject/project.php? proj-id = 6.

二级参考文献39

  • 1Halliday M A K. Expressions in the Functions of Language[M]. London : Edward Arnold, 1973.103-104.
  • 2Leech O N and Short M. Style in Fiction [M]. London:Longman, 1981.33.
  • 3Mukarovsky, Standard J. Literary Structure and Style [M].Washington:Geogetown University Press, 1998. 17-30.
  • 4Halliday M A K. Essays in Modern Stylistics [M]. New York Methuen, 1981. 325-360.
  • 5戴炜栋 何兆熊 华钧.简明英语语言学教程[M].上海:上海外语教育出版社,1998.20-32.
  • 6Halliday M A K. Expressions in the Functions of Language[M].London: Edward Arnold,1973.103-104.
  • 7Leech G N and Short M.Style in Fiction[M].London: Longman,1981.33.
  • 8Mukarovsky,Standard J.Literary Structure and Style[M].Washington: Geogetown University Press,1998.17-30.
  • 9Halliday M A K.Essays in Modern Stylistics[M].New York Methuen,1981.325-360.
  • 10Brian Roark and Eugene Charniak, Noun - phrase co - occurrence statistics for semi - automatic semantic lexicon construction. In : Proceedings of ACL-98, Montreal, Quebec, Canada, 1998

共引文献23

同被引文献163

引证文献10

二级引证文献55

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部