期刊文献+

基于依存适配度的知识自动获取词义消歧方法 被引量:11

Word Sense Disambiguation Based on Dependency Fitness with Automatic Knowledge Acquisition
下载PDF
导出
摘要 针对困扰词义消歧技术发展的知识匮乏问题,提出一种基于依存适配度的知识自动获取词义消歧方法.该方法充分利用依存句法分析技术的优势,首先对大规模语料进行依存句法分析,统计其中的依存元组信息构建依存知识库;然后对歧义词所在的句子进行依存句法分析,获得歧义词的依存约束集合;并根据WordNet获得歧义词各个词义的各类词义代表词;最后,根据依存知识库,综合考虑词义代表词在依存约束集合中的依存适配度,选择正确的词义.该方法在SemEval 2007的Task#7粗粒度词义消歧任务上取得了74.53%的消歧正确率;在不使用任何人工标注语料的无监督和基于知识库的同类方法中,取得了最佳的消歧效果. A word sense disambiguation (WSD) method based on dependency fitness is proposed to solve the problem of knowledge acquisition bottleneck in the development of WSD techniques. The method achieves automatic knowledge acquisition in WSD by taking full advantage of dependency parsing. First, a large-scale corpus is parsed to obtain dependency cells whose statistics information is utilized to build a dependency knowledge base (DKB); then, the ambiguous sentence is parsed to obtain the dependency constraint set (DCS) of ambiguous words. For each sense of ambiguous word, sense representative words (SRW) are obtained through WordNet. Finally, based on DKB, dependency fitness of all kinds of SRW on DCS is computed to judge the right sense. Evaluation is performed on coarse-grained English all-words task dataset of SemEval 2007. Compared with unsupervised and knowledge-based methods which don't utilize any sense-annotated corpus, the proposed method yields state-of-the-art performance with F1-measure of 74.53%.
出处 《软件学报》 EI CSCD 北大核心 2013年第10期2300-2311,共12页 Journal of Software
基金 国家自然科学基金(61132009) 国家重点基础研究发展计划(973)(2013CB329303)
关键词 词义消歧 依存句法分析 知识获取 依存适配度 word sense disambiguation dependency parsing knowledge acquisition dependency fitness
  • 相关文献

参考文献3

二级参考文献115

共引文献37

同被引文献64

  • 1朱雪峰,金芝.关于软件需求中的不一致性管理[J].软件学报,2005,16(7):1221-1231. 被引量:24
  • 2施海虎,邢宣宇,李冬梅.面向主题会话的扩展语义框架[J].中文信息学报,2005,19(5):44-51. 被引量:3
  • 3罗贝,吴洁,曹存根,邵志清.从文本中获取植物知识方法的研究[J].计算机科学,2005,32(10):6-13. 被引量:13
  • 4卢志茂,刘挺,李生.统计词义消歧的研究进展[J].电子学报,2006,34(2):333-343. 被引量:27
  • 5魏伟.汉语离合词研究综述[J].锦州医学院学报(社会科学版),2006,4(4):80-83. 被引量:4
  • 6NAVIGLI R. Word sense disambiguation: a survey [ J]. ACM Com- puting Surveys, 2009, 41(2) : 1 -69.
  • 7CHAN Y S, NG H T. Scaling up word sense disambiguation via par- allel texts[ C]//AAAI 2005: Proceedings of the 20th National Con- ference on Artificial Intelligence. Menlo Park: AAAI Press, 2005, 3:1037 - 1042.
  • 8PILEHVAR M T, JURGENS D, NAVIGLI R. Align, disambignate and walk: a unified approach for measuring semantic similarity [ C]//Proceedings of the 51 st Annual Meeting of the Association for Computational Linguistics. Stroudsburg: Association for Computa- tional Linguistics, 2013, 1:1341 - 1351.
  • 9NAVIGLI R, PONZETTO S P. Joining forces pays off: Multilingnal joint word sense disambiguation[ C]// Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Stroudsburg: Asso- ciation for Computational Linguistics, 2012:1399 - 1410.
  • 10STEVENSON M, AGIRRE E, SOROA A. Exploiting domain infor- mation for word sense disambiguation of medical documents[ J]. Journal of the American Medical Informatics Association, 2012, 19 (2) : 235 - 240.

引证文献11

二级引证文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部