摘要
该文介绍了哈萨克文专用字母■、■、■、■的特殊书写习惯,以及哈萨克文编码字符处理现状。指出当前广泛使用的字母替换法不符合国际和国家相关标准,并且会导致哈萨克文排序错误,增加文字转换、语音合成等功能的实现难度。为解决上述不足,对字母替换法进行了三个改进,包括用专用字母与符号"■"结合表示它们自己;专用字母各种书写形式带符号■的字形中,仅将独立字符形式带符号"■"的字形包含在OpenType字体中;用字形替换规则<calt>识别专用字母与哈萨克文字母不相邻的上下文环境。为便于改进方法的应用,该文介绍了与改进方法一致的OpenType字体字形替换规则设置方法。
This paper describes the special writing rules of the Kazakh letters ■,■,■ and ■,pointing out the current substitution method does not comply with international or national standards and obstructs Kazakh processing in text sorting,script conversion and speech synthesis.This paper proposed three improvements,i.e.1)representing the four special letters with the combination of themselves and character ■;2)include only isolated forms ■ with ■ in OpenType font;and 3) identifying the contexts that are not adjacent to the Kazakh letter based on the glyph substitute rulecaltin OpenType font.To facilitate the application of the above suggestions,this paper describes the set of the glyph substitution rules in OpenType font which is consistent with the improved method.
出处
《中文信息学报》
CSCD
北大核心
2017年第4期94-99,共6页
Journal of Chinese Information Processing
基金
中科院西部之光项目(YG2012114)
中科院仪器设备功能开发技术创新项目(YBXM-2014-04)