期刊文献+

基于最大熵的汉语介词短语识别研究 被引量:7

Identification of Chinese Prepositional Phrase
原文传递
导出
摘要 汉语介词短语识别的方法是基于最大熵的统计模型,通过最大熵的介词短语边界自动识别和依存语法错误校正两个处理阶段:先由最大熵模型对介词短语进行识别,然后利用依存树库中介词短语的左右边界词语的依存语法知识,对介词短语右边界的错误识别进行校正,完成了对经过分词和词性标注的句子进行介词短语界定的任务,为进一步的句法分析工作打下良好的基础。实验表明该方法是行之有效的。 This paper describes an automatic prediction model of Chinese prepositional phrase boundary location based on maximum entropy.It consists of two stages:first automatically identifying the phrase boundary by using the statistic of maximum entropy,and then post-tuning the results with dependent grammar knowledge.Firstly,the maximum entropy is applied to identifying the prepositional phrase,then the results are fine-tuned with dependent grammar knowledge generated by dependent treebank.Thus finishing the identification of Chinese prepositional phrase through the word segmented and word-of-speech tagged sentences,and laying a good foundation for the further analysis of the sentences.The experiment result indicates that the method is feasible and effective.
出处 《通信技术》 2010年第5期181-183,186,共4页 Communications Technology
基金 教育部科学技术重点资助项目(No.03081)
关键词 汉语介词短语 短语识别 最大熵 依存语法 Chinese prepositional phrase phrase identification maximum entropy dependence grammar
  • 相关文献

参考文献11

二级参考文献52

  • 1吕琳,周世斌,刘玉树.一种高性能英文词性标注器的设计与实现[J].北京理工大学学报,2005,25(10):876-879. 被引量:5
  • 2周强.汉语语料库的短语自动划分和标注研究.北京大学博士研究生学位论文[M].-,1996..
  • 3赵军.汉语基本名词短语识别及结构分析研究.清华大学工学博士学位论文[M].-,1998..
  • 4孙宏林.现代汉语非受限文本的实语块分析.北京大学博士研究生学位论文[M].-,2001..
  • 5[1]Abney Steven. Partial Parsing Via finite-state Cascades[C]. In Proceedings of the ESSLLI'96 Robust Parsing Workshop,1996.
  • 6[2]Cardie Claire,Pierce David. Error-driven Pruning of Treebank Grammars for Base Noun Phrase Identification[C]. In Proceedings of COLING-ACL'98, 1998.218-224.
  • 7[3]Eric Brill. Transformation-Based Error-Driven Learning and Natural Language Processing:A Case Study in Part-of-Speech Tagging[J].Computational Linguistics, 1995,21(4).
  • 8[4]Church K.A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text[C]. In Proceedings of the Second Conference on Applied Natural Language Processing, 1988.136-143.
  • 9[5]Voutilainen A. Nptool, a Detector of English Noun Phrases[C]. In Proceedings of the First Workshop on Very Large Corpora,1993.
  • 10[6]Walter Daelemans, Sabine Buchholz, Jorn Veenstra. Memory-Based Shallow Parsing[C]. The CoNLL-99 Workshop.

共引文献111

同被引文献64

引证文献7

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部