摘要
该文针对哈萨克语短语结构句法分两个阶段采用由粗到精的方法进行哈萨克语句法分析研究。第一阶段使用粗略的句法分析器生成20个最佳候选树;第二阶段采用感知机的方法训练,提取特征信息,并对第一阶段生成的20个最佳候选树进行重排序,最终解析结果是第一阶段产生的候选树的结果和重排序结果按照比例选取。该方法在两个阶段不仅可以获取到句子的结构信息,还可以提取到详细的特征信息,可以最大限度地对句子进行解析,获得了较好的句子解析结果,其句法分析正确率为71.4%。
A coarse-to-fine strategy is applied for the two-stage syntactic analysis of the Kazakh phrase structure.The first stage generates 20-best parses with a rough parser.The second stage employs the perceptron method to rerank them for the best result with the extracted features.This method can not only obtain the sentence structural through the two stages,but also provide the detailed feature information for better analysis of the result.Experiments indicate an accuracy 71.4% of this parser.
作者
梁金莲
古丽拉·阿东别克
LIANG Jinlian;Gulila Altenbek(College of Information Science and Engineering, Xinjiang University, Urumqi, Xinjiang 830046, China;Xinjiang Laboratory of Multi-Language Information Technology, Xinjiang University, Urumqi, Xinjiang 830046, China;The Base of Kazakh and Kirghiz Language of National Language Resource Monitoring and Research Center on Minority Languages, Xinjiang University, Urumqi, Xinjiang 830046, China)
出处
《中文信息学报》
CSCD
北大核心
2018年第1期83-88,共6页
Journal of Chinese Information Processing
基金
国家自然科学基金(61363062)
其他项目(NMLR201601)