图像中多语种文本提取的高斯混合建模方法被引量：2

Gaussian Mixture Modeling of Neighbor Characters for Multilingual Text Extraction in Images

下载PDF

导出

摘要建立了相邻字符区域的高斯混合模型,用于区分字符与非字符.在此基础上,提出了一种从图像中提取多语种文本的方法.首先对输入图像进行二值化,并执行形态学闭运算,使二值图像中每个字符成为一个单独的连通成分.然后根据各连通成分重心的Voronoi区域,形成连通成分之间的邻接关系;最后在贝叶斯框架下,基于相邻字符区域的高斯混合模型计算相应的伪概率,以此为判据将每个连通成分标注为字符或非字符.利用所提出的文本提取方法,进行了复杂中英文文本的提取实验,获得大于97%的准确率和大于80%的召回率,证实了方法的有效性. A new method based on the Gaussian mixture modeling of neighbor characters is proposed to extract multilingual texts in images. In the training phase, the Gaussian mixture model of three neighbor characters is trained from the examples. Then the texts in an input image are extracted in the following steps. Firstly, the image is binarized using the edge-pixel clustering method and the morphological closing operation is performed on the binary image, in order that each character in it can be treated as a connected component. Secondly, the neighborhood of connected components is established according to the Voronoi partition of the image. Three connected components neighboring with each other constitute a neighbor set. For each neighbor set, a posteriori pseudo-probability is computed based on the Gaussian mixture model of three neighbor characters and used to classify the neighbor set as the case of three neighbor characters. Finally, the text extraction is completed by labeling the connected components as characters or non- characters with the following rule： if a connected component is included in at least one neighbor set classified as the case of three neighbor characters, then the connected component is labeled as a character, or else as a non-character. The proposed method are tested in the applications of Chinese and English text extraction. In the experiments, the expectation-maximization algorithm is employed to train the Gaussian mixture model of three neighbor characters. The experimental results of text extraction show the effectiveness of the method.

作者付慧刘峡壁贾云得

机构地区北京林业大学信息学院北京理工大学计算机科学与技术学院

出处《计算机研究与发展》 EI CSCD 北大核心 2007年第11期1920-1926,共7页 Journal of Computer Research and Development

基金国家自然科学基金项目(60473049) 国家"九七三"重点基础研究发展规划基金项目(2006CB303105) 北京理工大学优秀青年教师资助计划基金项目(2006Y1202)~~

关键词高斯混合模型文本提取二值图像多语种建模方法 Voronoi区域字符区域连通成分 document analysis optical character recognition （OCR） text extraction image retrieval Gaussian mixture modeling （GMM）

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献12

1Keechul Jung,Kwang In Kim,Anil K Jain.Text information extraction in images and video:A survey[J].Pattern Recognition,2004,37(5):977-997
2密聪杰,刘洋,薛向阳.基于多帧图像的视频文字跟踪和分割算法[J].计算机研究与发展,2006,43(9):1523-1529. 被引量：11
3H Hase,T Shinokawa,M Yoneda,et al.Character string extrsction from color document[J].Pattern Recognition,2001,34(7):1349-1365
4V Wu,R Manmatha,E M Riseman.TextFinder:An automatic system to detect and recognize text in images[J].IEEE Trans on Pattern Analysis Machine Intelligence,1999,21(11):1224-1229
5Dong-Qing Zhang,Shih-Fu Chang.Learning to detect scene text using a higher-order MRF with belief propagation[C].In:Proc of the IEEE Conf on Computer Vision and Pattern Recognition Workshops (CVPRW' 04).Los Alamitos:IEEE Computer Society Press,2004.101-108
6付慧,刘峡壁,贾云得.用于文本区域提取的边缘像素聚类方法[J].计算机辅助设计与图形学学报,2006,18(5):729-734. 被引量：6
7Perry Moerland.A comparison of mixture models for density estimation[C].In:Proc of the 9th Int'l Conf on Artificial Neural Networks (ICANN'99).London:lEE Press,1999.25-30
8J E Beasley,F Goffinet.A Delaunay triangulation-based heuristic for the Euclidean Steiner problem[J].Networks,1994,24(14):215-224
9S M Lucas,A Panaretos,L Sosa,et al.Icdar 2003 robust reading competitions[C].In:Proc of the 7th Int'l Conf on Document Analysis and Recognition.Berlin:Springer-Verlag,2003.682-687
10J A Bilmes.A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models[R].U C Berkeley,Tech Rep:97-021,1998

二级参考文献23

1Jung Keechul,Kim Kwang In,Jain Anil K.Text information extraction in images and video:a survey[J].Pattern Recognition,2004,37(5):977-997
2Jain A K,Yu B.Automatic text location in images and video frames[J].Pattern Recognition,1998,31(12):2055-2076
3Sato T,Kanade T,Hughes E K,et al.Video OCR for digital news archive[C] //Proceedings of IEEE Workshop on Content based Access of Image and Video Databases,Bombay,India,1998:52-60
4Wu V,Manmatha R,Riseman E M.TextFinder:an automatic system to detect and recognize text in images[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1999,21(11):1224-1229
5Sin B,Kim S,Cho B.Locating characters in scene images using frequency features[C] //Proceedings of International Conference on Pattern Recognition,Quebec,2002,3:489-492
6Mao W,Chung F,Lanm K,Siu W.Hybrid Chinese/English text detection in images and video frames[C] //Proceedings of International Conference on Pattern Recognition,Quebec,2002,3:1015-1018
7Cheng Zhiguo,Liu Yuncai.Caption location and extraction in digital video based on SVM[C] //Proceedings of the 3rd International Conference on Machine Learning and Cybernetics,Shanghai,2004:3515-3519
8Wang Rongrong,Jin Wanjun,Wu Lide.A novel video caption detection approach using multi-frame integration[C]//Proceedings of the 17th International Conference on Pattern Recognition,Cambridge,United Kingdom,2004:449-452
9Tang Yuan Y,Lee Seong-Whan,Suen Ching Y.Automatic document processing:a survey[J].Pattern Recognition,1996,29(12):1931-1952
10H Li, D Doermann. Automatic identification of text in digital video key frames [C]. In: Proc of the 14th Int'l Conf on Pattern Recognition. Los Alamitos, CA: IEEE Computer Society Press, 1998. 129-132

共引文献15

1马瑞,王家廞.基于点模式匹配的视频文字跟踪和笔画提取[J].计算机工程,2008,34(3):15-17. 被引量：1
2付慧,刘峡壁,贾云得.基于最大-最小相似度学习方法的文本提取[J].软件学报,2008,19(3):621-629. 被引量：1
3江延湖,白似雪.复杂背景图像文本信息提取技术研究[J].江西教育学院学报,2008(3):18-21.
4邓宇,李华.多特征组合和图切割支持的物体/背景分割方法[J].计算机研究与发展,2008,45(10):1724-1730. 被引量：7
5郭戈,平西建,张涛,徐长勇.利用时空特性的字幕检测与定位算法[J].小型微型计算机系统,2009,30(10):2054-2058. 被引量：2
6郑翠翠,王兴起.基于边缘信息和局部直方图的视频文字检测法[J].机电工程,2009,26(10):31-33. 被引量：1
7叶利华.视频标签检测与识别[J].制造业自动化,2011,33(6):95-98. 被引量：1
8张建明,王娟,张菊,杜丹,房芳.基于条件笔画密度提取的文本定位方法[J].计算机工程与设计,2011,32(10):3446-3449. 被引量：4
9易剑,彭宇新,肖建国.基于颜色聚类和多帧融合的视频文字识别方法[J].软件学报,2011,22(12):2919-2933. 被引量：22
10刘毅,毛震东,张冬明,张勇东,林守勋.低质量汉字的分块搜索两级识别法[J].计算机辅助设计与图形学学报,2012,24(2):170-175. 被引量：2

同被引文献13

1罗鑫,吴炜,杨晓敏,何小海,盛曦.一种基于PCA的多模板字符识别[J].电子测量技术,2007,30(1):138-141. 被引量：5
2朱峰,詹永照.基于Gabor滤波器组的车牌汉字特征提取[J].计算机应用与软件,2007,24(6):56-58. 被引量：4
3Lin Bo,Fang Bin, Li Donghui.Character recognition of licencense plate image based on multiple classifiers[C]// Proceedings of the 2009 International Conference on Wavelet Analysis and Pattern Recognition, 2009: 12-15.
4Fang B, Leung C H, Tang Y Y, et al.Offiine signature verification with generated training samples[J].IEE Proc Vis Images Signal Process, 2002,149 (2) : 388-397.
5Hsu Chih-Wei, Lin Chih-Jen.A comparison of methods for multiclass support vector machines[J].IEEE Transac- tions on Neural Networks,2002,13(2) :415-425.
6Wang Shenzheng, Lee His-Jian.A cascade framework for a real-time statistical plate recognition system[J].IEEE Transactions on Information Forensics and Security, 2007, 2(2) :267-282.
7Guo Jingming, Liu Yunfu.License plate localization and character segmentation with feedback self-learning and hybrid binarization techniques[J].IEEE Transactions on Vehicular Technology,2008,57(3) : 1417-1424.
8陈世亮,李战怀,袁柳.一种基于概念层次的图像检索方法[J].计算机科学,2008,35(4):139-141. 被引量：1
9董玲娇.车牌自动识别中的字符特征提取[J].机电工程,2008,25(9):106-108. 被引量：4
10袁庭启,徐涛.一种基于HSV空间和纹理特征的快速车牌定位方法[J].重庆工学院学报（自然科学版）,2008,22(10):179-182. 被引量：9

引证文献2

1黄百钢,李俊山,胡双演.基于颜色和笔画特征的文本分割算法[J].计算机科学,2009,36(7):292-294.
2罗辉武,唐远炎,蓝利君,王翊.融合特征和先验知识的车牌字符图像检测算法[J].计算机工程与应用,2012,48(18):187-192. 被引量：4

二级引证文献4

1马晓伟,何向真,于洪志,万福成.基于数字图像处理的车牌定位与分割技术研究[J].西北民族大学学报（自然科学版）,2012,33(4):23-26. 被引量：1
2薛倩.基于字符块提取的车牌字符分割算法[J].河南科学,2014,32(5):781-784. 被引量：2
3巨志勇,苏春美.非均匀光照下的车牌图像分割算法[J].信息技术,2015,39(10):34-37. 被引量：2
4薛倩.基于字符块提取的车牌字符分割算法[J].陕西交通职业技术学院学报,2016,0(2):25-29.

1杨承磊,汪嘉业,孟祥旭.多边形外部Voronoi图顶点和边数的上界[J].计算机辅助设计与图形学学报,2005,17(4):689-693. 被引量：3
2孙冬璞,郝晓红,高爽,王建卫,杨泽雪.基于不确定Voronoi图的概率组最近邻查询[J].北京农学院学报,2013,28(4):73-75. 被引量：1
3朱铮涛,张钢,何淑贤.一种新的快速字符定位算法研究[J].计算机测量与控制,2007,15(6):775-776. 被引量：2
4袁小勇,黄贤武.基于二重BP神经网络的静态人脸检测算法的研究[J].苏州大学学报（工科版）,2002,22(5):26-32.
5肖剑川,许力,叶阿勇,林丽美.基于Voronoi图的路网轨迹隐私保护研究[J].信息网络安全,2016(6):15-21. 被引量：2
6林卉,舒宁,赵长胜.一种新的基于连通成分的边缘评价方法[J].国土资源遥感,2003,15(3):37-40. 被引量：11
7张丽平,经海东,李松,崔环宇.路网中基于Voronoi图的反向最近邻查询方法[J].计算机科学,2015,42(8):231-235. 被引量：1
8林卉,赵长胜,舒宁.一种新的基于连通成分的边缘评价方法[J].现代测绘,2003,26(2):8-11. 被引量：8
9徐鹏飞,陈志刚.增量构造Voronoi区域的改进算法[J].计算机工程与应用,2010,46(8):8-10. 被引量：2
10董玉才,史宏涛,杜健,王东兴.基于改进型灰度阈值分割法的爆炸图像研究[J].信息技术,2010,34(8):19-21. 被引量：3

计算机研究与发展

2007年第11期

浏览历史

内容加载中请稍等...

图像中多语种文本提取的高斯混合建模方法被引量：2

参考文献12

二级参考文献23

共引文献15

同被引文献13

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

图像中多语种文本提取的高斯混合建模方法 被引量：2

参考文献12

二级参考文献23

共引文献15

同被引文献13

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

图像中多语种文本提取的高斯混合建模方法被引量：2