摘要
广义最大似然比检测(GLR)算法模型翻译识别结果存在数据点重合的情况,精确度无法得到有效保障;为了准确地识别短语,设计了基于改进GLR算法的短语智能识别算法,该算法构建标记规模约74万个英汉单词的短语语料库,使短语具备可搜索功能,通过短语中心点构建短语结构,可获得词性识别结果,依据解析线性表的句法功能校正词性识别结果中的英汉结构歧义,最终获得识别的内容;综合的测评结果上看,基于改进GLR算法识别精度95%以上,综合得分92.3分,该算法克服了GLR的弊端,相对统计算法和动态记忆算法提高了运算速度和处理性能,更加适合机器翻译任务,为在智能机器翻译领域提供了新的思路。
Generalized maximum likelihood ratio detection(GLR)algorithm model translation recognition results have data points that overlap,and accuracy cannot be effectively guaranteed.In order to accurately identify phrases,an optimized GLR algorithm based on intelligent recognition is designed.This algorithm constructs a corpus of phrases with a scale of about 740,000 EnglishChinese words,makes the phrases searchable,and constructs the phrase structure through the center of the phrase to obtain part-of-speech recognition As a result,the ambiguity between English and Chinese structures in the part-of-speech recognition results was corrected according to the syntactic function of the analytical linear table,and finally the content of recognition was obtained.In terms of comprehensive evaluation results,based on the improved GLR algorithm,the recognition accuracy is more than 95%,and the comprehensive score is 92.3,the algorithm overcomes the disadvantages of GLR,improves the operation speed and processing performance relative to the statistical algorithm and dynamic memory algorithm,and is more suitable for machine translation tasks,providing a new way of thinking in the field of intelligent machine translation.
作者
党莎莎
龚小涛
Dang Shasha;Gong Xiaotao(School of General Education,Xi'an Aviation Vocational and Technical College,Xi'an 710089,China)
出处
《计算机测量与控制》
2020年第4期161-164,共4页
Computer Measurement &Control
关键词
智能识别
改进GLR算法
机器翻译
intelligent recognition
optimized GLR algorithm
machine translation