摘要
提出了一种用于字符预分类的模糊逻辑分析法 .对文本字符作印刷结构分析 ,给出了一个带有容差分析的文本行字符基线精确测定算法 ,其他有效参考线则是通过聚类分析而获得 .模糊逻辑用于确定各字符类的隶属值以保证字符的正确预分类 .实验结果表明 ,这种模糊印刷字符预分类法在 SUN 4 / 4 90工作站上每秒可有效地处理 10 4 以上字符 ,并对不同大小的字符和不同字体的处理结果令人满意 .
In this paper, a new fuzzy-logic approach is presented for character preclassif ication which gives a precise calculation method for the baseline detection algo rithm with tolerance analysis through analyzing the typographical structure of t extual blocks. Other virtual reference lines are extracted with clustering techn ique. In order to ensure correct character preclassification, a fuzzy-logic app roach is used to assign a membership to each typographical category for ambiguou s classes. The results prove that the proposed fuzzy typographical analysis for character preclassification is able to process to more than 10000 characters per second on a SUN 4/490 workstation and the method has been tested for different font sizes and different types with satisfaction.
出处
《软件学报》
EI
CSCD
北大核心
2000年第10期1397-1404,共8页
Journal of Software
基金
国家自然科学基金!(No.7870 0 12 )
江苏省教委留学回国人员科研基金!(No.1997- 15 - 5 1)
关键词
字符预分类
印刷字符
模糊逻辑
模糊分类
Character preclassification, typographical categorization, baseline detection, f uzzy logic, fuzzy classification.