期刊文献+

多层次MSER自然场景文本检测 被引量:10

Natural scene text detection based on multi-level MSER
下载PDF
导出
摘要 提出一种新的基于多层次最大稳定极值区域(MSER)的自然场景文本检测方法,其由候选区域的提取和文本检测组成.在候选区域提取过程中,采用多层次MSER区域提取方法:通过对原始图像进行多个颜色空间变换和多尺度放缩得到多个变换后的图像,采用多个阈值对其进行MSER区域检测,并将检测到的区域作为候选区域用于文本检测.检测过程中,对候选区域提取手工设计的底层特征和基于卷积神经网络(CNN)的深层特征,训练一个随机森林回归器对特征进行分类得到字符区域,再将其合并成单词区域,并进行相似的特征提取和分类,从而得到最终的文本检测结果.使用2个标准的数据库(ICDAR2011和ICDAR2013)对提出的方法进行性能评价,F指标在ICDAR2011和ICDAR2013上均为0.79,表明了所提出的自然场景文本检测方法的有效性. A novel scene text detection method based on multi-level maximally stable extremal regions(MSER)was proposed,which consisted of two main stages,including candidate regions extraction and text regions detection.In the stage of candidate regions extraction,a multi-level MSER region extraction technique was developed by considering multiple color spaces,multiple scale transformations of original image and multiple thresholds of MSER detection.All extracted regions from the input image were used as candidate character regions for text region detection.In the stage of text detection,the hand-designed bottom features and CNN based features were extracted for each candidate character region as first,then a random forest regressor trained from training datasets was used to get the character regions.After that,the character regions were merged to form candidate word regions,from which the features were extracted and classified to get the final text detection results by using the similar process of candidate character region classification.The proposed method was evaluated on two standard benchmark datasets,including ICDAR2011 and ICDAR2013,and both got the F-measure performance of 0.79,respectively,Which demonstrates the effectiveness of the proposed natural scene text detection method.
出处 《浙江大学学报(工学版)》 EI CAS CSCD 北大核心 2016年第6期1134-1140,共7页 Journal of Zhejiang University:Engineering Science
基金 国家自然科学基金资助项目(61073125 61350004) 中央高校基本科研业务费专项资金资助项目(HIT.NSRIF.2013091 HIT.HSS.201407)
关键词 自然场景文本检测 多层次最大稳定极值区域(MSER) 卷积神经网络(CNN) 随机森林回归器 scene text detection multi-level maximally stable extremal regions(MSER) convolutional neural network(CNN) random forest regressor
  • 相关文献

参考文献25

  • 1SHAHABA, SHAFAITF, DENGEI. A. ICDAR 2011 robust reading competition challenge 2: reading text in scene images[C] // Proceeding of International Confer-ence on Document Analysis and Recognition. Beijing: IEEE. 2011:1491 - 1496.
  • 2KARATZAS D, SHAFAIT F, UCHIDA S, et al. 1C DAR 2013 robust reading competition [C] // Proceeding of International Conference on Document Analysis and Recognition. Washington: IEEE, 2013: 1484- 1493.
  • 3YE Q, DOERMANN D. Text detection and recognition in imagery: a survey [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2014, 37 ( 7 ): 1480 - 1500.
  • 4('HEN X, YUILLE A. Detecting and reading text in natural scenes [C] // Proceeding of IEEE Conference on Computer Vision and Pattern Recognition. Washington: IEEE, 2004:366 - 373.
  • 5WANG K, BABENKO B, BELONGIE S. End-to-end scene text recognition [C] // Proceeding of International Conference on Computer Vision. Barcelona: IEEE, 2011:1457 -1464.
  • 6MISHRA A, ALAHARI K, JAWAHAR C. Top down and bottom-up cues for scene text recognition [C] // Pro- ceeding of 1EEE Conference on Computer Vision and Pattern Recognition. Providence: IEEE, 2012: 2687- 2694.
  • 7JADERBERG M, VEDAI.D] A, ZISSERMAN A. Deep features for text spotting [C] // Proceeding of European Conference on Computer Vision. Zurich: Springer, 2014: 512-528.
  • 8EPSHTEIN B, OFEK E, WEXLER Y. Detecting text in natural scenes with stroke width transform [C] // Proceeding of IEEE Conference on Computer Vision and Pattern Recognition. San Francisco., IEEE, 2010: 2963 - 2970.
  • 9MATAS J, CHUM O, URBAN M, et al. Robust wide baseline stereo from maximally stable extremal regions [C] // Proceeding of British Machine Vision Conference. Cardiff: Elsevier, 2002:761-767.
  • 10HUANG W, LIN Z, YANG J, et al. Text localization in natural images using stroke feature transform and text covariance descriptors [C]// Proceeding of Inter- national Conference on Computer Vision. Sydney: IEEE, 2013: 1241-1248.

同被引文献62

引证文献10

二级引证文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部