期刊文献+

一个全文词义自动标注系统的实现 被引量:3

Implement a full-text automatic system for word sense tagging
下载PDF
导出
摘要 为研究在给定上下文中如何确定多义词的词义,介绍了一种无指导的词义消歧技术和一个汉语全文词义标注系统的设计实现过程.该系统基于贝叶斯模型,使用大规模语料进行训练,较好地解决了知识获取中数据稀疏的问题.该系统具有标注正确率高和运行速度快等特点,适合大规模文本的词义标注工作. Word sense disambiguation has been a very active research topic in the NLP field, which studies how to determine which of the senses of an ambiguous word is invoked in a particular context using sense classifiers. This paper presents a technique for unsupervised word sense disambiguation and implements the process of a full - text word sense tagging system. This system performs word sense disambiguation based on the Nave Bayesian Model, uses largescale corpora as training data, and it is able to preferentially conquer the problem of Sparse Data in Knowledge Acquisition. In addition, this system has the characteristics of high accuracy and quick running speed. Thus, this system is competent for word sense tagging on large - scale, real - word text.
出处 《哈尔滨工业大学学报》 EI CAS CSCD 北大核心 2005年第12期1603-1605,1649,共4页 Journal of Harbin Institute of Technology
基金 国家自然科学基金资助重点项目(60435020)
关键词 词义 梢歧 自然语言处理 无指导学习算法 贝叶斯模型 依存文法 word sense disambiguation natural language processing unsupervised learning algorithm Nave-Bayesian Model dependency grammar
  • 相关文献

参考文献5

  • 1NANCY I, JEAN V. Introduction to the special issue on word sense disamibguation: The state of the art [ J ].Computational Linguistics, 1998, 24 ( 1 ): 1 -40.
  • 2DAGAN I, ITAI A. Two languages are more informative than one[ A]. Proceedings of the 29^th Annual Meeting of Association for Computation Linguistics [ C ]. Berkeley:Association for Computational Lintuistics, 1991.
  • 3YAROWSKY D. Word sense disambiguation using statistical methods of Roget's categories trained on large corpora [ A ]. Computation Linguistic' 92 [ C ]. Nantas: Association for Computational Linguistics, 1992. 454-460.
  • 4SCHUTZE H. Automatic word sense discrimination [ J ].Computational Linguistics, 1998,24( 1 ) :97 - 124.
  • 5鲁松,白硕,黄雄,张健.基于向量空间模型的有导词义消歧[J].计算机研究与发展,2001,38(6):662-667. 被引量:36

二级参考文献2

  • 1李娟子.汉语词义消歧方法研究:博士论文[M].北京:清华大学,1999..
  • 2李娟子,博士论文,1999年

共引文献35

同被引文献31

引证文献3

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部