期刊文献+

基于句子跨度的哈萨克语句法分析研究 被引量:1

Research of Kazakh parsing based on span
下载PDF
导出
摘要 由于目前哈萨克语句法分析准确率较低并缺乏基于神经网络的哈萨克语句法分析的相关研究,针对哈萨克语短语结构的句法分析,使用基于移进—归约的方法,采用在栈中存储句子跨度而不是部分树结构,从而在进行句法树解析时不需要对句法树进行二叉化。该研究在句子特征提取时使用双向LSTM对句子跨度特征进行提取,得到句子跨度在整个句子上下文中信息,再使用多层感知机对句法分析模型进行训练,最后在解码时使用动态规划选取最优句法分析结果;最终使得哈萨克语短语句法分析准确率达到了76.92%。研究成果对哈萨克语句法分析准确率有了进一步的提高,并为后续的哈萨克语机器翻译及语义分析奠定良好的基础。 Due to the low accuracy of Kazakh parsing and the lack of correlation research based on neural network Kazakh parsing,this paper focused on the parsing of Kazakh phrase structure,based on the shift-reduce method,but by the stack elements were sentence spans rather than partial tree,then it didn’t need to carry out the binary tree in parsing.The research used the bi-directional LSTM to extract the features of sentence span,and obtained the sentence span in the whole sentence context,using the multilayer perceptron to train the parsing model.In the end,the Kazakh parsing accuracy achieved 76.92%.The research results improved the accuracy of Kazakh parsing and built a good foundation for Kazakh machine translation and semantic analysis.
作者 柴伟 古丽拉·阿东别克 Chai Wei;Gulila Altenbek(College of Information Science&Engineering,Xinjiang University,Urumqi 830046,China;Xinjiang Laboratory of Multi-language Information Technology,Urumqi 830046,China;The Base of Kazakh&Kirghiz Language of National Language Resource Monitoring&Research Center on Minority Language,Urumqi 830046,China)
出处 《计算机应用研究》 CSCD 北大核心 2020年第3期731-733,753,共4页 Application Research of Computers
基金 国家自然科学基金资助项目(61363062)。
关键词 双向LSTM 句子跨度 动态规划 Bi-LSTM span dynamic oracle
  • 相关文献

参考文献3

二级参考文献22

  • 1冯志伟.基于短语结构语法的自动句法分析方法[J].当代语言学,2000,2(2):84-98. 被引量:16
  • 2Booth T L,Thompson R A. Applying Probabihty Measures to Abstract Languages.IEEE Tmnsactiom on Computers, 1973, C-22(5) : 442-450.
  • 3D. Mckee,K.Krebsbach.A learning Natural Language Parser[J],2004. https://www2.1awrence.edu/fast/krebsbak/Research/Publications/ pdf/mics08-mckee.pdf.
  • 4周强.汉语句法知识的自动获取研究.中国中文信息学会二十周年学术会议,2001[C].
  • 5Stenven Bird, Ewan Klein Edward Loper [M].Natural Language Processing with Python. O'Reilly Media, Inc.2009:291-322.
  • 6Ahmad Al-Taani, Mohammed Msallam, Sana Wedian. A top- down chart parser for analyzing Arabic sentences [J]. Interna- tional Arab Journal of Information Technology, 2012, 9 (2):109-116.
  • 7Shihadeh Alqrainy, Hasan Muaidi, Mahmud S Alkoffash. Context-free grammar analysis for Arabic sentences [J]. Inter- national Journal of Computer Applications, 2012, 53 ( 3 ) : 7-10.
  • 8Krasimir Angelov, Peter Ljunglef. Fast statistical parsing with parallel multiple context-free grammars [C] //Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014: 368-376.
  • 9Duncan Mckee, Kurt Krebsbach. A leaming natural language par- ser [EB/OL]. [2015-04-09]. https://www2, lawrence, edu/fast/ krebsbak/Research/Publications/pdf/mics08-mckee. pdf.
  • 10Mark-Jan Nederhof, Martin McCaffery. Determinic parsing using PCFGs [C] //Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014: 338-347.

共引文献3

同被引文献1

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部