期刊文献+

基于中心驱动模型的宾州中文树库(CTB)句法分析 被引量:3

Parsing Penn Chinese treebank (CTB) with head-driven model
下载PDF
导出
摘要 报告了依托宾州中文树库进行句法分析研究的最新进展。以著名的中心驱动模型为基础,首次在宾州中文树库5.0上进行了句法分析实验。同前人的工作相比,这次实验取得了更加成功的结果,极大缩小了中、英文句法分析的差距。在公共的测试集上对句法分析器的性能进行了评价,对于正确分词和词性标注的句子,句法分析的精确率和召回率分别达到85.89%和85.61%。介绍了模型的实现过程,并进一步分析了模型中决策表和基本名词短语(BNP)两个关键环节在句法分析器中所起到的作用。本文的工作对于研制实用化句法分析系统具有一定参考价值。 This paper reports the new improvement of the work on parsing the Penn Chinese treebank (CTB), one of the most important technologies of Chinese information processing. The well-known head,driven model was applied to the new available CTB5.0 and the parsing experiment was performed for the first time. Compared with the previous work on CTB, the experiment achieved more promising result and greatly narrowed the performance gap between Chinese parsing and English parsing. The parser was evaluated on the standard test set with PARSEVAL metric. It performed with the precision of 85.89% and the recall rate of 85.61% on the sentences with gold segmentation and POS tagging. The construction of the parser was described, and the functions of the two important technologies that can significantly improve the parsing performance were analyzed. This work is referential to the development of Chinese parser for real applications.
出处 《高技术通讯》 CAS CSCD 北大核心 2007年第1期15-20,共6页 Chinese High Technology Letters
基金 国家自然科学基金(60302021、60375019)和863计划(2004AA117010-08)资助项目.
关键词 中心驱动模型 宾州中文树库 句法分析 结构模式识别 head-driven model, Penn Chinese treebank, parsing, syntactic pattern recognition
  • 相关文献

参考文献14

  • 1Uszkoreit H,Flickinger D,Kasper W,et al.Deep linguistic analysis with HPSG.In:Verbmobil:Foundations of speechto-speech translation.Heidelberg:Springer,2000.216-237
  • 2Zhou Q.A statistics-based Chinese parser.In:Proceedings of the 5th Workshop on Very Large Corpora.1997,4-15
  • 3Zhou M.A block-based dependency parser for unrestricted Chinese text.In:Proceedings of the 2nd Chinese Language Processing Workshop.2000,78-84
  • 4Zhang Y,Xu B,Zong C Q.Chinese syntactic parsing based on extended GLR parsing algorithm with PCFG *.In:Proceedings of the 19th International Conference on Computational Linguistics.2002,1308-1332
  • 5Xue N W,Xia F,Chiou F D,et al.The Penn Chinese treebank:phrase structure annotation of a large corpus.Natural Language Engineering,2004,10(4):1-30
  • 6Collins M.Head-driven statistical models for natural language parsing:[Ph.D.thesis].Pennsylvania:University of Pennsylvania,1999
  • 7Xia F.Automatic grammar generation from two different perspective:[Ph.D.thesis].Pennsylvania:University of Pennsylvania,1999
  • 8Bikel D,Chang D.Two statistical parsing models applied to Chinese treebank.In:Proceedings of the 2nd Chinese language processing workshop.Hong Kong,2000.1-6
  • 9Chiang D,Bikel D.Recovering latent information in treebanks.In:Proceedings of the 19th International Conference on Computational Linguistics.2002,183-189
  • 10Levy R,Manning C.Is it harder to parse Chinese,or the Chinese treebank? In:Proceedings of Association of Computational Linguistic.2003,439-446

同被引文献27

  • 1党政法,周强.短语树到依存树的自动转换研究[J].中文信息学报,2005,19(3):21-27. 被引量:12
  • 2冯志伟.自然语言处理中的概率语法[J].当代语言学,2005,7(2):166-178. 被引量:10
  • 3冀铁亮,穗志方.词汇化句法分析与子语类框架获取的互动方法[J].中文信息学报,2007,21(1):120-126. 被引量:3
  • 4周强.汉语语料库的短语自动划分和标注研究[D].北京:北京大学,2002.
  • 5CHENG Yu-ehang, ASAHARA M, MATSUMOTO Y. Machine learning-based dependency analyzer for Chinese [C] // MINGHUI D, HAIZHOU L, MIN Z, eds. Proceedings of the International Conference on Chinese Computing 2005. Singapore: COLIPS Publication, 2005:66-73.
  • 6XUE Nian-wen, XIA Fei, CHIOU Fu-dong, et al. The Penn Chinese Treebank.. phrase structure annotation of a large corpus [J]. Natural Language Engineering, 2005, 11 (2):207-238.
  • 7CHENG Yu-chang, ASAHARA M, MATSUMOTO Y. Chinese deterministic dependency analyzer: examining effects of global features and root node finder [C] // Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing. Korea: SIGHAN, 2005:17-24.
  • 8LIN De-kang. A dependency-based method for evaluating broad-coverage parsers [J]. Natural Language Engineering, 1998, 4(2): 97-114.
  • 9XIA Fei. Automatic grammar generation from two different perspectives [D]. Philadelphia: University of Pennsylvania, 1999.
  • 10CHOMSKY N. Remarks on nominalization [C] // JACOBS R, ROSENBAUM P, eds. Reading in English Transformational Grammar. Waltham (MA) :Ginn and Co. , 1970:184-221.

引证文献3

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部