摘要
为了快速获得音译结果,借鉴人工处理方式,开发了基于规则的多语种音译软件.该软件采用算法和规则独立设计的思想,能满足多种语言的音译需求.完整的音译过程包括单词预处理、字母识别与切分、字母重组与定位、规则表查询四个步骤.在字母重组阶段提出一种确定最佳音节划分的方法,有效解决了音节划分错误较多的难题,从而保证了最终音译结果的质量.通过对英语、罗马语和俄语进行分组实验,经人工检查,音译正确率达到95%以上.
In order to obtain the transliteration results quickly, a multi-lingual transliteration software based on rules is designed on the basis of manual processing in this study. The software can meet the transliteration needs of various lingual, because algorithm and rule are designed separately. The complete transliteration process includes four steps of word pre-treating, letter recognition and segmentation, letter recombination and localization, and rule table searching. In the letter recombination, this study proposes a method of determining the best syllable division, which can reduce the error rate of syllable division effectively and improve the quality of final transliteration results. The results of the experiments for English, Roman, and Russian in this study show that the transliteration accuracy can reach to 95% or more.
作者
梁建飞
王军元
刘敏
LIANG Jian-Fei;WANG Jun-Yuan;LIU Min(61243 Troops,Xinjiang 830006,China)
出处
《计算机系统应用》
2018年第9期268-272,共5页
Computer Systems & Applications
关键词
规则表
多语种
音译
音节划分
双语地图
rule table
multilingual
transliteration
syllable division
bilingual map