期刊文献+

基于组件树和霍夫森林的场景文字检测识别

Text detection and recognition in natural scenes based on component tree and Hough forest
下载PDF
导出
摘要 自然场景中的文字检测与识别是图像理解中的重要部分,在大部分的系统设计中,检测和识别被看成是孤立的两部分进行处理,本文提出使用多类霍夫森林建立一个统一的检测识别框架。同时为了解决霍夫森林在类别增多时识别率下降,以及在尺度多变的情况下定位偏移的问题,文中提出利用组件树提取出具有层级的连通域,同时针对文字本身的特征建立分类器。通过级联该分类器,提取出文本的候选位置并确定目标的尺度大小,为后级精细的定位和识别奠定基础。实验结果显示该方案在检测和识别方面均与目前最优的方案具有竞争性。 Text detection and recognition in natural scenes play an important role in image understanding. In most of current system design, detection and recognition are isolated and processed separately. A unified framework for detection and recognition based on multi-class Hough forest is proposed. In order to improve the performance when the quantity of classes increases, as well as improve accuracy with uncertain scale, component tree is used for extracting connected component with hierarchy, while a set of features based on text characteristics is extracted and feed to a classifier. With the help of the classifier, the scale of the target is determined and all candidate texts are located, which build the foundation of subsequent stage for fine positioning and recognition. Experiments show that the scheme is competitive with current optimal solutions in both detection and recognition.
出处 《电子设计工程》 2016年第20期178-181,185,共5页 Electronic Design Engineering
关键词 组件树 霍夫森林 图像理解 文字检测 文字识别 component tree Hough forest image understanding text detection text recognition
  • 相关文献

参考文献15

  • 1Duda R O, Hart P E. Use of the Hough transformation to detect lines and curves in pictures[J]. Conmlunications of tile ACM,1972,15(I): 11-15.
  • 2Ballard D H. Generalizing the Hough transform to detect ar- bitrary shapes[J]. Pattern recognilion, 1981,13(2): 111-122.
  • 3Gall J, Lempitsky V. Class-specific hough forests fir object detection [M}//Computer Vision and Pattern Recognition (CVPR) ,2009: 1022-1029.
  • 4Gall J, Yao A, Razavi N, et al. Hough forests for object de- tection, tracking, and action recognition [J]. IEEE Transac- tions on Pattern Analysis and Machine Intelligence,2011, 33(11 ): 2188-2202.
  • 5Koo H I, Kim D H. Scene text detection via connected com- ponent elustering and nontextfihering [J].IEEE Transactions on Image Processing, 2013,22 (6): 2296-2305.
  • 6Matas J, Chum O, Urban M, et al. Robust wide-baseline stereo from maximally stable extrenml regions [J]. Image and vision computing,2004,22( 10): 761-767.
  • 7Chen H, Tsai S S,Schrith G,et al. Robust text detection in natural images with edge-enhanced maximally stable ex- tremal regions [C]//lmage Processing (ICIP), 2011: 2609- 2612.
  • 8Neumann L, Matsa J. A method for text localization and recognition in real-world images [C]//Asian Conference of Computer Vision (ACCV),2010: 770-783.
  • 9NistO.r D, Stewnius H. Linear time maximally stable ex-tremal regions [C]//Computer Vision-ECCV, 2008: 183-196.
  • 10Freund Y, Schapire R E. A desicion-theoretic generalization of on-line learning and an application to boosting[C]//Compu- tational learning theory, 1995: 23-37.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部