摘要
目前汉字字形描述方法存在的主要问题是缺少能涵盖一切可能汉字的可计算的字形形式化描述体系,从而造成汉字处理应用中的一系列障碍。本文给出了一种汉字网格字形描述方法,实验表明,该方法具有描述一切可能汉字字形(包括错字)骨架的能力,支持不同颗粒度的构字元素、结构关系等字形特征的自动提取和计算,为字形特征的自动分析处理提供了一种有效的手段,从而也为基于字形计算的各种应用建立了可靠的基础。
The main problem existing in current Chinese character glyph discriptions is the lack of a formal description for Chinese character glyphs which is computable and can cover all possible Chinese characters at the same time. This paper proposes a grid description approach for Chinese characters. Experiment result indicates that it can not only describe all possible Chinese character skeletons, including typos, but also provide great support for automatic extraction and computation using Chinese character glyph features with different particle size, such as strokes, radicals and structure relations. Therefore, this method establishs a reliable basis for a variety of applications based on computing of Chinese character glyph.
出处
《中文信息学报》
CSCD
北大核心
2008年第3期115-123,共9页
Journal of Chinese Information Processing
基金
国家自然科学基金资助项目(60272055,60572159)
关键词
计算机应用
中文信息处理
汉字字形
形式化描述
网格字形
特征计算
computer application
Chinese information processing
Chinese character glyph
formal description
grid glyph
feature computing