文档处理中背景字符的去除

Removing of Preprinted Characters in Document Image Processing

下载PDF

导出

摘要识别域图像的提取是文档自动处理系统中一个重要的预处理过程。在实际应用中,用户填写的信息常常与版面中的框线和背景字符存在交叠现象,严重影响了系统的性能。本文提出了基于点边距离分析的背景字符去除算法。首先通过灰度图像匹配的方法精定位背景字符子图像;然后利用形态学方法结合笔画的宽度信息对背景字符子图像进行二值化;最后分析像素点到边界距离的变化确定需要填充的像素位置,并通过形态学方法计算像素的填充值。实验采用了真实票据图像中的日期域,实验结果表明本文的方法获得了基本令人满意的效果,背景字符像素被成功去除。 Extraction of recognition item is an important preprocess procedure in a Document image analysis system. In reality, user fill-in data usually cross or touch the preprinted lines and characters, creating tremendous problems for the recognition engines. In this paper, we proposed a practical preprinted character removing method. Image matching algorithm is applied to locate the position of the preprinted character, and then the character image is binarized by mathematical morphology method combing with stroke width information. Last, the preprinted character is removed based on the varying of stroke contours. Experiment results on real-life check images demonstrate the efficient of the proposed method.

作者张重阳杨静宇李伟孙明明

机构地区南京理工大学计算机科学与技术系

出处《计算机科学》 CSCD 北大核心 2006年第8期229-231,共3页 Computer Science

基金电子信息产业发展基金(信部运[2003]446号)

关键词图像处理文档图像分析图像匹配二值化数学形态学 Image processing, Document image analysis, Image matching, Binarization, Mathematical morphology

分类号 TP317.2 [自动化与计算机技术—计算机软件与理论] TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1Bin Yu, Jain A K. A generic system for form dropout. IEEE Trans. on Pattern Analysis and Machine Intelligence, 1996, 18(11): 1127-1134
2Tseng Yi-Hong, Lee Hsi-Jian. Interfered-character recognition by removing interfering-lines and adjusting feature weights. In:Proc.Fourteenth Int. Conf. on Pattern Recognition, Brisbane, Qld.Australia, 1998. 1865-1867
3张重阳,陈强,娄震,杨静宇.基于灰度图像的表格框线去除算法[J].计算机研究与发展,2005,42(4):635-639. 被引量：9
4Liang S, Ahmadi M, Shridhar M. Segmentation of handwritten interference marks using multiple directional stroke planes and reformalized morphological approach. IEEE Trans Image Process,1997, 6(8):1195-1202
5Ye Xiangyun,Cheriet M,Suen C Y. A generic method of cleaning and enhancing handwritten data from business forms. Document Analysis and Recognition, 2001, 4(2): 84-96
6罗钟铉,刘成明.灰度图像匹配的快速算法[J].计算机辅助设计与图形学学报,2005,17(5):966-970. 被引量：72
7孙远,周刚慧,赵立初,施鹏飞.灰度图像匹配的快速算法[J].上海交通大学学报,2000,34(5):702-704. 被引量：45
8Otsu N. A threshold selection method from grey-level histograms. IEEE Trans. Sys. , Man, Cybern, 1978, 8:62-66
9崔屹．图像处理与分析-数学形态学芳法及应用．北京：科学出版社，2000，4

二级参考文献21

1Fuh Chioushann，Image Vision Computing J，1998年，16卷，9-10期，677页
2Jia Xiaoguang，IEEE Trans Pattern Anal Machine Intell，1995年，17卷，12期，1167页
3Xiangyun Ye, Mohamed Cheriet, Ching Y. Suen, et al.Extraction of bankcheck items by mathematical morphology. J.Doc. Anal. Recognit., 1999, 2(2): 53～66.
4Xiangyun Ye, Mohamed Cheriet, Ching Y. Suen. A generic method of cleaning and enhancing handwritten data from business forms. J. Doc. Anal. Recognit., 2001, 4(2): 84～96.
5Bin Yu, Jain, A. K. A generic system for form dropout. IEEE Trans. Pattern Analysis and Machine Intelligence. 1996, 18(11): 1127～1134.
6Jain-Shiue Chen, Din-Chang Tseng. Overlapped-character separation and reconstruction for table-form documents. Int'1 Conf. Image Processing, Lausanne, Switzerland, 1996.
7J.M. Gloger. Use of the hough transform to separate merged text/graphics in forms. Int'l Conf. 11th IAPR, Hague,Netherlands, 1992.
8S. Naoi, Y. Hotta, M. Yabuki, et al. Global interpolation in the segmentation of handwritten characters overlapping a border. The 1st Int'l Conf. Image Processing, Austin, TX, USA, 1994.
9Yi-Hong Tseng, Hsi-Jian Lee. Interfered-character recognition by removing interfering-lines and adjusting feature weights.Fourteenth Int' l Conf. Pattern Recognition, Brisbane, Qld,Austrialia, 1998.
10Jin-Yong Yoo, Min-Ki Kim, Sang Yong Ban, et al. Line removal and restoration of handwritten characters on the form documents.The 4th Int'l Conf. Document Analysis and Recognition, Ulm,Germany, 1997.

共引文献118

1赵宏伟,刘宇琦,程禹,刘君玲.基于相位相关的图像匹配算法[J].吉林大学学报（工学版）,2011,41(S1):183-188. 被引量：2
2刘兵全,何继善,李振伟,涂蓉.医学图像后处理研究进展[J].国外医学（生物医学工程分册）,2004,27(4):248-252. 被引量：9
3郭显久,李莉.基于细分小波与局部投影熵相结合的图像匹配算法[J].辽宁师范大学学报（自然科学版）,2005,28(1):59-63.
4罗钟铉,刘成明.灰度图像匹配的快速算法[J].计算机辅助设计与图形学学报,2005,17(5):966-970. 被引量：72
5张国华,梁中华.一种基于模板匹配的人民币纸币面额识别方法[J].沈阳工业大学学报,2005,27(4):439-442. 被引量：15
6平洁,殷润民.一种全景图快速生成算法及其实现[J].微计算机应用,2006,27(1):59-62. 被引量：5
7蔡晓东,叶培建.基于特征点集的匹配算法应用于卫星姿态确定[J].北京航空航天大学学报,2006,32(2):171-175. 被引量：4
8单宝明,徐启蕾.基于投影与KMP简约算法的一维快速模板匹配算法[J].青岛科技大学学报（自然科学版）,2006,27(2):176-178. 被引量：2
9岳永娟,苗立刚,彭思龙.大规模显微图像拼接算法[J].计算机应用,2006,26(5):1012-1014. 被引量：2
10余莉,王润生,韩方剑.多分辨率形态学目标检测[J].计算机辅助设计与图形学学报,2006,18(6):849-853. 被引量：5

1张丘,马利庄,高岩,陈志华.基于方向投影的票据图像倾斜检测方法[J].计算机应用,2004,24(9):50-51. 被引量：7
2张重阳,娄震,杨静宇.银行支票中小写金额图像的提取[J].中文信息学报,2003,17(2):42-47. 被引量：2
3张新红,张帆,张军亮.一种改进的二值图像质量评价方法[J].计算机工程与科学,2010,32(6):52-54. 被引量：3
4罗钟铉,刘成明.灰度图像匹配的快速算法[J].计算机辅助设计与图形学学报,2005,17(5):966-970. 被引量：72
5孙远,周刚慧,赵立初,施鹏飞.灰度图像匹配的快速算法[J].上海交通大学学报,2000,34(5):702-704. 被引量：45
6张重阳,杨静宇,张艳.支票大写金额图像分割策略[J].计算机工程,2006,32(24):4-5. 被引量：1
7李建江,张磊,李兴钢,陈翔,黄义双.CUDA架构下的灰度图像匹配并行算法[J].电子科技大学学报,2012,41(1):110-113. 被引量：15
8钟侠.基于Hough变换的票据图像倾斜校正[J].常州大学学报（自然科学版）,2012,24(2):69-72. 被引量：2
9邓淑玲.基于SPSS距离分析的财务管理应用探究[J].财会通讯（理财版）,2008(9):37-38. 被引量：2
10陈凯,吕文阁,谢庆华,张湘伟.基于竞选算法的灰度图像匹配研究[J].四川理工学院学报（自然科学版）,2014,27(5):68-71. 被引量：3

计算机科学

2006年第8期

浏览历史

内容加载中请稍等...

文档处理中背景字符的去除

参考文献9

二级参考文献21

共引文献118

相关作者

相关机构

相关主题

浏览历史