摘要
汉字的数学表达式是一种全新的汉字表示方法 通过对汉字部件特征的深入分析 ,利用图像处理技术对汉字数学表达式的自动生成做了探讨 选取了大约 5 0 0个基本汉字部件 ,提取了各部件的连通数、亏格数、端点数、折点数、连接点数、交叉点数以及NMI,HNMI ,VNMI值作为汉字部件的基本特征 ;并通过汉字连通区域的分割与合并进行汉字部件的划分和识别 ;最后 ,通过汉字结构的识别得到了汉字的数学表达式 实验中 ,汉字表达式自动生成的正确率为 92 % 这将在排版印刷、广告及包装设计。
The mathematical expression of Chinese characters is a novel method to describe Chinese characters Based on the analysis of components of Chinese characters,an algorithm of automatic generation of mathematical expression is presented in this paper Firstly, about 500 Chinese character components are chosen, and nine basic properties are selected for each of them: connectivity number, genus number, end number, turn number, joint number, cross number and NMI, HNMI, and VNMI; then by segmenting or combining connective regions, a Chinese character is divided into Chinese character components; finally, by recognizing the components and the Chinese character structure, the mathematical expressions are obtained Satisfying experimental result shows that the correct ratio of mathematical expressions is 92% Automatic generation of mathematical expressions will facilitate the management and transmission of Chinese information in the fields such as typesetting printing, advertising and package design, network transmission, Chinese mobile communication, and so on
出处
《计算机研究与发展》
EI
CSCD
北大核心
2004年第5期848-852,共5页
Journal of Computer Research and Development
基金
教育部科研重点基金项目 ( 0 2A0 5 6)
国家"九七三"重点基础研究发展规划基金项目 (TG19980 3 0 60 2 )
关键词
汉字
数学表达式
特征提取
部件识别
Chinese character
mathematical expression
feature extraction
component recognition