基于组件树和霍夫森林的场景文字检测识别

Text detection and recognition in natural scenes based on component tree and Hough forest

下载PDF

导出

摘要自然场景中的文字检测与识别是图像理解中的重要部分,在大部分的系统设计中,检测和识别被看成是孤立的两部分进行处理,本文提出使用多类霍夫森林建立一个统一的检测识别框架。同时为了解决霍夫森林在类别增多时识别率下降,以及在尺度多变的情况下定位偏移的问题,文中提出利用组件树提取出具有层级的连通域,同时针对文字本身的特征建立分类器。通过级联该分类器,提取出文本的候选位置并确定目标的尺度大小,为后级精细的定位和识别奠定基础。实验结果显示该方案在检测和识别方面均与目前最优的方案具有竞争性。 Text detection and recognition in natural scenes play an important role in image understanding. In most of current system design, detection and recognition are isolated and processed separately. A unified framework for detection and recognition based on multi-class Hough forest is proposed. In order to improve the performance when the quantity of classes increases, as well as improve accuracy with uncertain scale, component tree is used for extracting connected component with hierarchy, while a set of features based on text characteristics is extracted and feed to a classifier. With the help of the classifier, the scale of the target is determined and all candidate texts are located, which build the foundation of subsequent stage for fine positioning and recognition. Experiments show that the scheme is competitive with current optimal solutions in both detection and recognition.

作者苏江房涛王晓明仵媛媛高博

机构地区国网陕西省电力公司信息通信公司

出处《电子设计工程》 2016年第20期178-181,185,共5页 Electronic Design Engineering

关键词组件树霍夫森林图像理解文字检测文字识别 component tree Hough forest image understanding text detection text recognition

分类号 TN99 [电子电信—信号与信息处理]

引文网络
相关文献

参考文献15

1Duda R O, Hart P E. Use of the Hough transformation to detect lines and curves in pictures[J]. Conmlunications of tile ACM,1972,15(I): 11-15.
2Ballard D H. Generalizing the Hough transform to detect ar- bitrary shapes[J]. Pattern recognilion, 1981,13(2): 111-122.
3Gall J, Lempitsky V. Class-specific hough forests fir object detection [M}//Computer Vision and Pattern Recognition (CVPR) ,2009: 1022-1029.
4Gall J, Yao A, Razavi N, et al. Hough forests for object de- tection, tracking, and action recognition [J]. IEEE Transac- tions on Pattern Analysis and Machine Intelligence,2011, 33(11 ): 2188-2202.
5Koo H I, Kim D H. Scene text detection via connected com- ponent elustering and nontextfihering [J].IEEE Transactions on Image Processing, 2013,22 (6): 2296-2305.
6Matas J, Chum O, Urban M, et al. Robust wide-baseline stereo from maximally stable extrenml regions [J]. Image and vision computing,2004,22( 10): 761-767.
7Chen H, Tsai S S,Schrith G,et al. Robust text detection in natural images with edge-enhanced maximally stable ex- tremal regions [C]//lmage Processing (ICIP), 2011: 2609- 2612.
8Neumann L, Matsa J. A method for text localization and recognition in real-world images [C]//Asian Conference of Computer Vision (ACCV),2010: 770-783.
9NistO.r D, Stewnius H. Linear time maximally stable ex-tremal regions [C]//Computer Vision-ECCV, 2008: 183-196.
10Freund Y, Schapire R E. A desicion-theoretic generalization of on-line learning and an application to boosting[C]//Compu- tational learning theory, 1995: 23-37.

1杨飞.自然场景图像中的文字检测综述[J].电子设计工程,2016,24(24):165-168. 被引量：12
2褚晶辉,董越,吕卫.基于小波变换的文字检测与提取方法[J].电视技术,2014,38(3):182-185.
3李敏花,柏猛.基于数学形态学的复杂背景图像文字检测方法[J].计算机工程,2012,38(4):165-167. 被引量：3
4彭艳兵,关韵竹.基于区域特征与支持向量机的场景文字定位算法[J].计算机与现代化,2016(12):87-91. 被引量：1
5胡正仪,王延平.用小波锥分解提取旋转和尺度不变特征[J].武汉大学学报（自然科学版）,1994,40(6):45-50.
6人教版物理九年级第一阶段目标检测（B）[J].新课程（中考全程检测）（人教版物理）,2009(1):8-10.
7李楠.一种小波变换与维纳滤波结合的语音抗噪研究[J].电声技术,2007,31(5):46-48. 被引量：4
8秦晓伟,郭建中.K-SVD算法的超声图像加性噪声去噪研究[J].陕西师范大学学报（自然科学版）,2012,40(6):42-46. 被引量：2
9孟哲.基于小波变换的多尺度多阈值语音增强方法[J].武汉理工大学学报（交通科学与工程版）,2001,25(2):209-212. 被引量：4
10朱家兵,陶亮,江有名,洪一.一种新的高分辨率SAR图像相干斑噪声抑制算法[J].现代雷达,2005,27(11):54-57. 被引量：4

电子设计工程

2016年第20期

浏览历史

内容加载中请稍等...

基于组件树和霍夫森林的场景文字检测识别

参考文献15

相关作者

相关机构

相关主题

浏览历史