期刊文献+

基于启发式搜索与预标注的中文CCG句法分析

CHINESE CCG PARSING BASED ON A*SEARCH AND SUPERTAGGING
下载PDF
导出
摘要 针对中文组合范畴语法(CCG)分析困难的特点,研究如何将两种彼此相互独立的技术共同应用在中文CCG句法分析上。首先使用预标注算法,使用对数线性模型通过去除那些概率较低的词汇范畴来对句子的潜在分析空间进行剪枝。然后应用启发式搜索算法进一步加速分析过程。最后从时间效率和分析精度两个维度对所使用的方法进行验证。实验表明,基于启发式搜索与预标注的句法分析算法可以显著地提高分析效率与分析精度。 Chinese CCG is difficult to parse, in light of this character, in the paper we investigate the way to integrate two independent techniques on Chinese CCG parsing. Firstly the supertagging is used, and by eliminating with log-linear model those words categories whose possibility is low, the latent parsing space of sentences is pruned, Secondly, A * search is applied to further accelerate the parsing procedure. At last the verifications are done on the approach used from the dimensions of both time efficiency and parsing accuracy. Experiments indicate that the parsing algorithm based on A * search and supertagging can significantly improve the efficiency and accuracy.
出处 《计算机应用与软件》 CSCD 北大核心 2014年第9期231-235,共5页 Computer Applications and Software
基金 国家自然科学基金项目(61003091)
关键词 中文句法分析 组合范畴语法 启发式搜索 预标注 Chinese parsing Combinatory categorial grammar (CCG) A * search Supertagging
  • 相关文献

参考文献15

  • 1Steednian M.The syntactic process[M].Cambridge:MlT Press,2000.
  • 2Hockenmaier J,Steedman M.Generative models for statistical parsingwith Combinatory Categorial Grammar[C]//Proceedings of 40th Annu-al Meeting of the Association for Computational Linguistics(ACL02),Philadelphia,2002:335-342.
  • 3Bangalore SjJoshi A K.Supertagging:An Approach to Almost Parsing[J].Computational Linguistics,1999,25(2):238-265.
  • 4Clark S,Curran J K.The importance of supertagging for wide-coverageCCG parsing[C]//Proceedings of 20th International Conference onComputational I linguistics(COLING(H),Geneva,2004:282-288.
  • 5Clark S,Curran J R.Wide-coverage efficient statistical parsing withCCG and log-linear models[J].Computational Linguistics,2007,33(4):493-552.
  • 6Hart P,Nilsson N,Raphael B.A formal basis for the heuristic determi-nation of minimum cost paths[J].Transactions on Systems Science andCybernetics,1968,4(2):100-107.
  • 7Xue N,Xia F,Chiou F D,et al.The Penn Chinese TreeBank:Phrasestructure annotation of a large corpus[J].Natural Language Engineer-ing,2005,11(2):207-238.
  • 8Tse D,Curran J K.Chinese CCGbank:extracting CCG derivations fromthe Penn Chinese Treebank[C]//Proceedings of 23rd InternationalConference on Computational Linguistics(COLINGIO),Beijing,2010:1083-1091.
  • 9Hockenmaier J.Data and Models for Statistical Parsing with Combinato-ry Categorial Grammar[D]. University of Edinburgh,2003.
  • 10宋彦,黄昌宁,揭春雨.中文CCG树库的构建[J].中文信息学报,2012,26(3):3-8. 被引量:12

二级参考文献1

共引文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部