期刊文献+

基于局部离群因子和波动阈值的古籍版面图像分析方法 被引量:1

Layout Image Analysis for Ancient Chinese Books Based on Local Outlier Factor and Wave Threshold
下载PDF
导出
摘要 古籍版面图像结构复杂,对其进行有效、准确的分析是实现古籍汉字识别与检索的前提和基础。对古籍汉字版面分析的关键问题展开研究,在对古籍版面特点进行分析与归纳的基础上,提出基于局部离群因子(local outlier factor,LOF)和波动阈值的古籍版面分析方法。首先,采用基于LOF的分类算法对古籍版面图像投影分割后的区域进行分类,确定存在分割问题的候选混合区域;然后,利用波动阈值对候选混合区域中的文字与框线粘连部分进行分割;最后,确定古籍版面中的文字区域并输出。实验结果表明,该算法能够有效地分离古籍文字区域和框线区域,版面分类和分割准确率分别为87.02%、78.69%。 It is the premise and basis of recognition and retrieval of ancient Chinese character for realizing automatic analysis of ancient Chinese layout images,which is more difficult than that of the normal printed layout images because of their complex structure.Based on the analysis and generalization of layout characteristics of ancient Chinese books,a layout analysis method of ancient Chinese layout images was proposed based on local outlier factor(LOF)and wave threshold.Firstly,LOF-based classification algorithm was used to classify the projected segmentation regions of ancient book layout images,and the candidate mixed regions with segmentation problems were determined.Secondly,the adhesion parts of text and frame lines in the candidate mixed regions were segmented by using the wave threshold.Finally,the text regions in the ancient book layout were determined and output.The experimental results show that the proposed algorithm can effectively separate the text area and the frame line areas of ancient Chinese layout images,and the layout classification and segmentation accuracy were 87.02%and 78.69%,respectively.
作者 贾运 田学东 左丽娜 JIA Yun;TIAN Xue-dong;ZUO Li-na(School of Cyber Security and Computer,Hebei University,Baoding 071002,China)
出处 《科学技术与工程》 北大核心 2020年第29期12021-12027,共7页 Science Technology and Engineering
基金 河北省教育厅河北省高等学校科学技术研究重点项目(ZD2017208)。
关键词 古籍 版面图像 版面分析 局部离群因子 波动阈值 ancient Chinese books layout images layout analysis local outlier factor wave threshold
  • 相关文献

参考文献12

二级参考文献72

共引文献87

同被引文献8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部