期刊文献+

基于轮廓投影方法的文本图象偏斜纠正

Deflect Correction for Document Image Based on Its Schema Histogram
下载PDF
导出
摘要 印刷文献信息采集处理是文本信息处理应用 ,特别是数字化图书馆建设中十分繁重而又必须从事的工作 .由于目前广泛使用的字符光学识别系统 (OCR)无法对具有偏斜角度的扫描文本图象进行自动加工处理 ,所以需要大量的人工介入 ,即以手工方法纠正图象偏斜 .因为无法有效地进行扫描文本集的批量处理 ,所以难以提高处理效率 .针对这一问题 ,在讨论文本图象轮廓投影性质的基础上 ,利用其相关系数与文本偏斜角的统计依赖关系 ,构造了一种用于文本图象的自动偏斜纠正方法 . Using OCR tools to transform scanned document images into editable text files is a important way in printed documents processing, such as those in text retrieving applications and digital library projects. Nevertheless, the OCR systems that we generally employed can not work correctly and efficiently with document images having deflections. Trying to manipulate this deflection correction procedure automatically, We study the properties of the image's schema histogram and it's correlation series. The result shows that under a small angle of deflection (less than 8°),the horizontal correlation series varies negative exponentially with the angle of deflection. For this we construct a scheme that can adjust the deflection automatically depend on the image's histogram pattern.To do this, we first choose a non-deflected sample image from the image set to find its correlation series which is in turn used to construct the negative exponential function. This experiential function can be used to determine the deflection angles of the whole set of document image. Practically, this method has shown very good performance in automatic deflection correction.
作者 李存华
出处 《中国图象图形学报(A辑)》 CSCD 北大核心 2001年第10期984-987,共4页 Journal of Image and Graphics
关键词 文本图象 轮廓投影 行相关系数 偏斜纠正 印刷文献 文本信息处理 数字化图书馆 Document image, Histogram, Correlation series, Deflection rectify correction
  • 相关文献

参考文献3

  • 1[1]Witten Lan H., Moffat A, Bell T C. Managing gigabytes. (Second Edition)[M]. Morgan Kaufmann Pub. Inc. San Francisco,USA.1999:240~299.
  • 2[2]Govindaraju V S, Lam Niyogi W D. Newspaper image understanding[M]. New Delhi,India; Narosa Publishing House, 1990:375~84.
  • 3[3]Srihari S N. Document image understanding[A]. In: Proc. IEEE Computer Society Fall Joint Computer Conf[C]., Dallas, TX, 1992:87~96.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部