期刊文献+

最大稳定极值区域与笔画宽度变换的自然场景文本提取方法 被引量:18

A Natural Scene Text Extraction Method Based on the Maximum Stable Extremal Region and Stroke Width Transform
下载PDF
导出
摘要 针对从背景复杂、视角多变、语言形式多样的场景图像中难以准确提取文本信息的问题,提出了一种基于最大稳定极值区域(MSER)和笔画宽度变换(SWT)场景文本提取方法。该方法结合MSER、SWT算法的优点,采用MSER算法的准确检测文字区域,建立文本候选区域,利用SWT算法计算文本候选区域笔画宽度得到候选文本区域的笔画宽度;根据笔画宽度图,利用连通域标记建立笔画宽度连通图,然后根据笔画宽度连通图,建立笔画连通图的启发性规则,删除非文本候选区域,并根据文本的几何特征分析及局部自适应窗口最大类间方差(Otsu)分割,有效提取出自然场景图像中的文本,文本提取的准确率、召回率及综合性能分别为0.74、0.64及0.68。仿真实验结果表明,在文本视角多变,字符大小、尺寸、字体各异的复杂条件下,所提方法具有较好的鲁棒性,适用于多语言和多字体混合的场景文本提取。 To extract text information effectively from natural scene image with complex background,multi-orientation perspective and multilingual languages,a scenario text extraction method based on maximum stable extremal region(MSER)and stroke width transform(SWT)is presented.The method combines the merits of MSER and SWT algorithms.It establishes text candidate regions by utilizing MSER algorithm to detect text regions,and SWT algorithm is used to calculate the text stroke width of candidate region to get its stroke width.According to the stroke width graph,the stroke-connected graph is established by using connected component labeling.Then the heuristic rules of stroke-connected graph are established to remove non-text candidate regions according to the stroke-connected graph.By using the geometrical feature analysis and the local adaptive window Otsu segmentation,the text in natural scene images can be extracted effectively.The text extraction accuracy,recall rate and comprehensive performance of this method are 0.74,0.64 and 0.68,respectively.Simulation experiment shows that the method can achieve good robustness for complex background with multi-orientation perspective,variouscharacters and font sizes,and it is suitable for variety of languages and fonts.
作者 张国和 黄凯 张斌 符欢欢 赵季中 ZHANG Guohe HUANG Kai ZHANG Bin FU Huanhuan ZHAO Jizhong(School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China)
出处 《西安交通大学学报》 EI CAS CSCD 北大核心 2017年第1期135-140,共6页 Journal of Xi'an Jiaotong University
基金 国家自然科学基金资助项目(61306111)
关键词 自然场景图像 文本提取 最大极值稳定区域 笔画宽度变换 natural scene image text extraction maximum stable extremal region stroke width transform
  • 相关文献

参考文献1

二级参考文献12

  • 1Zhang Yi, Tan Kok Kiong. Text Extraction from Images Captured via Mobile and Digital Devices[J]. International Journal of Computational Vision and Robotics, 2009, 1(1): 34-58.
  • 2Suen H M, Wang Jhing Fa. Segmentation of Uniform Colored Text from Color Graphics Background[J]. IEE Proceedings on Vision, Image and Signal Processing, 1997, 144(6): 317-322.
  • 3Sauvola J, Pietiktiinen M. Adaptive Document Image Binariza- tion[J]. Pattern Recognition, 2000, 33(2): 225-236.
  • 4Kim K I, Jung K, Kim J H. Texture Based Approach for Text Detection in Images Using Support Vector Machines and Continuously Adaptive Mean Shift Algorithm[J]. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2003, 25(12): 1631- 1639.
  • 5Thillou C M, Gosselin B. Color Text Extraction with Selective Metric Based Clustering[J]. Computer Vision and Image Understanding, 2007, 107(1/2): 9%107.
  • 6Angadi S A. A Texture Based Methodology for Text Region Extraction from Low Resolution Natural Scene Images[/]. International Journal of Image Processing, 2009, 3(5): 184-251.
  • 7Angadi S A, Kodabagi M M. Image Decomposition Combining Staircase Reduction and Texture Extraction[J]. Journal of Visual Communication and Image Representation, 2007, 18(6): 464-486.
  • 8Lee S H, Seok J H, Min K M. Scene Text Extraction Using Image Intensity and Color Information[C]//Proc. of CCPR'09. Nanjing,China: [s. n.], 2009.
  • 9Phan T Q, Shivakumara E A Skeleton-based Method for Multi-oriented Video Text Detection[C]//Proc. of the 9th IAPR International Workshop on Document Analysis Systems. Boston, USA: [s. n.], 2010.
  • 10Maragos P, Schafer R. Morphological Skeleton Representation and Coding of Binary Images[J]. 1EEE Transactions on Acoustics, Speech and Signal Processing, 2003, 34(5): 1228-1244.

共引文献3

同被引文献111

引证文献18

二级引证文献48

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部