期刊文献+

图像中多语种文本提取的高斯混合建模方法 被引量:2

Gaussian Mixture Modeling of Neighbor Characters for Multilingual Text Extraction in Images
下载PDF
导出
摘要 建立了相邻字符区域的高斯混合模型,用于区分字符与非字符.在此基础上,提出了一种从图像中提取多语种文本的方法.首先对输入图像进行二值化,并执行形态学闭运算,使二值图像中每个字符成为一个单独的连通成分.然后根据各连通成分重心的Voronoi区域,形成连通成分之间的邻接关系;最后在贝叶斯框架下,基于相邻字符区域的高斯混合模型计算相应的伪概率,以此为判据将每个连通成分标注为字符或非字符.利用所提出的文本提取方法,进行了复杂中英文文本的提取实验,获得大于97%的准确率和大于80%的召回率,证实了方法的有效性. A new method based on the Gaussian mixture modeling of neighbor characters is proposed to extract multilingual texts in images. In the training phase, the Gaussian mixture model of three neighbor characters is trained from the examples. Then the texts in an input image are extracted in the following steps. Firstly, the image is binarized using the edge-pixel clustering method and the morphological closing operation is performed on the binary image, in order that each character in it can be treated as a connected component. Secondly, the neighborhood of connected components is established according to the Voronoi partition of the image. Three connected components neighboring with each other constitute a neighbor set. For each neighbor set, a posteriori pseudo-probability is computed based on the Gaussian mixture model of three neighbor characters and used to classify the neighbor set as the case of three neighbor characters. Finally, the text extraction is completed by labeling the connected components as characters or non- characters with the following rule: if a connected component is included in at least one neighbor set classified as the case of three neighbor characters, then the connected component is labeled as a character, or else as a non-character. The proposed method are tested in the applications of Chinese and English text extraction. In the experiments, the expectation-maximization algorithm is employed to train the Gaussian mixture model of three neighbor characters. The experimental results of text extraction show the effectiveness of the method.
出处 《计算机研究与发展》 EI CSCD 北大核心 2007年第11期1920-1926,共7页 Journal of Computer Research and Development
基金 国家自然科学基金项目(60473049) 国家"九七三"重点基础研究发展规划基金项目(2006CB303105) 北京理工大学优秀青年教师资助计划基金项目(2006Y1202)~~
关键词 高斯混合模型 文本提取 二值图像 多语种 建模方法 Voronoi区域 字符区域 连通成分 document analysis optical character recognition (OCR) text extraction image retrieval Gaussian mixture modeling (GMM)
  • 相关文献

参考文献12

  • 1Keechul Jung,Kwang In Kim,Anil K Jain.Text information extraction in images and video:A survey[J].Pattern Recognition,2004,37(5):977-997
  • 2密聪杰,刘洋,薛向阳.基于多帧图像的视频文字跟踪和分割算法[J].计算机研究与发展,2006,43(9):1523-1529. 被引量:11
  • 3H Hase,T Shinokawa,M Yoneda,et al.Character string extrsction from color document[J].Pattern Recognition,2001,34(7):1349-1365
  • 4V Wu,R Manmatha,E M Riseman.TextFinder:An automatic system to detect and recognize text in images[J].IEEE Trans on Pattern Analysis Machine Intelligence,1999,21(11):1224-1229
  • 5Dong-Qing Zhang,Shih-Fu Chang.Learning to detect scene text using a higher-order MRF with belief propagation[C].In:Proc of the IEEE Conf on Computer Vision and Pattern Recognition Workshops (CVPRW' 04).Los Alamitos:IEEE Computer Society Press,2004.101-108
  • 6付慧,刘峡壁,贾云得.用于文本区域提取的边缘像素聚类方法[J].计算机辅助设计与图形学学报,2006,18(5):729-734. 被引量:6
  • 7Perry Moerland.A comparison of mixture models for density estimation[C].In:Proc of the 9th Int'l Conf on Artificial Neural Networks (ICANN'99).London:lEE Press,1999.25-30
  • 8J E Beasley,F Goffinet.A Delaunay triangulation-based heuristic for the Euclidean Steiner problem[J].Networks,1994,24(14):215-224
  • 9S M Lucas,A Panaretos,L Sosa,et al.Icdar 2003 robust reading competitions[C].In:Proc of the 7th Int'l Conf on Document Analysis and Recognition.Berlin:Springer-Verlag,2003.682-687
  • 10J A Bilmes.A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models[R].U C Berkeley,Tech Rep:97-021,1998

二级参考文献23

  • 1Jung Keechul,Kim Kwang In,Jain Anil K.Text information extraction in images and video:a survey[J].Pattern Recognition,2004,37(5):977-997
  • 2Jain A K,Yu B.Automatic text location in images and video frames[J].Pattern Recognition,1998,31(12):2055-2076
  • 3Sato T,Kanade T,Hughes E K,et al.Video OCR for digital news archive[C] //Proceedings of IEEE Workshop on Content based Access of Image and Video Databases,Bombay,India,1998:52-60
  • 4Wu V,Manmatha R,Riseman E M.TextFinder:an automatic system to detect and recognize text in images[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1999,21(11):1224-1229
  • 5Sin B,Kim S,Cho B.Locating characters in scene images using frequency features[C] //Proceedings of International Conference on Pattern Recognition,Quebec,2002,3:489-492
  • 6Mao W,Chung F,Lanm K,Siu W.Hybrid Chinese/English text detection in images and video frames[C] //Proceedings of International Conference on Pattern Recognition,Quebec,2002,3:1015-1018
  • 7Cheng Zhiguo,Liu Yuncai.Caption location and extraction in digital video based on SVM[C] //Proceedings of the 3rd International Conference on Machine Learning and Cybernetics,Shanghai,2004:3515-3519
  • 8Wang Rongrong,Jin Wanjun,Wu Lide.A novel video caption detection approach using multi-frame integration[C]//Proceedings of the 17th International Conference on Pattern Recognition,Cambridge,United Kingdom,2004:449-452
  • 9Tang Yuan Y,Lee Seong-Whan,Suen Ching Y.Automatic document processing:a survey[J].Pattern Recognition,1996,29(12):1931-1952
  • 10H Li, D Doermann. Automatic identification of text in digital video key frames [C]. In: Proc of the 14th Int'l Conf on Pattern Recognition. Los Alamitos, CA: IEEE Computer Society Press, 1998. 129-132

共引文献15

同被引文献13

引证文献2

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部