期刊文献+

自然场景图像中基于视觉显著性的文本区域检测 被引量:3

Visual Saliency-Based Detection of Text Region in Natural Scene Images
下载PDF
导出
摘要 从自然场景图像中抽取文本信息有利于场景图像的内容分析.文中根据图像中文本通常在局部区域具有显著性的特点,提出多尺度包围盒视觉显著性模型,并利用该模型设计一种可以融合边缘和纹理信息的候选文本检测方法.首先在Lab颜色空间构造基于边缘和纹理信息的图像同质性,并利用它将图像映射到同质性空间;然后根据多尺度包围盒视觉显著性模型求Lab颜色空间的同质性均值图像;最后求同质映射图像与同质性均值图像的加权欧氏距离,将其作为显著性度量,以提取文本区域.自然场景图像的实验表明:与单纯利用边缘检测或同质性映射进行文本检测的方法相比,文中提出的方法能够更好地抑制背景的干扰,这有利于进一步将文本区域与背景剥离,进行更精确的文本定位. Extracting text information from images captured in natural scenes is helpful for the content analysis of images. In this paper, according to the fact that the texts in images is often salient in local regions, a novel visual saliency model with multi-scale bounding box is proposed, based on which a new method combining the edge and texture information is designed for the candidate text detection. In this method, first, Lab color space is used to construct the edge and textural information-based image homogeneity, and by using this characteristic, the image is mapped into the homogeneity domain. Then, the proposed model is employed to generate average homogeneity ima- ges. Finally, the weighted Euclidean distance between the homogeneity image and the average homogeneity image is determined, and is taken as the saliency measure to extract text regions. Experimental results of natural scene images show that, as compared with the text detection methods based on the edge or the homogeneity, the proposed method can better restrain the background noise, which helps to further segment the text regions from the back- ground and achieve more accurate text location.
出处 《华南理工大学学报(自然科学版)》 EI CAS CSCD 北大核心 2012年第8期39-45,共7页 Journal of South China University of Technology(Natural Science Edition)
基金 国家自然科学基金资助项目(61005061 60873078) 广东省自然科学基金资助项目(9251064101000010) 广东省科技攻关项目(2010B050400006 2010B010600016) 华南理工大学中央高校基本科研业务费专项资金资助项目(2012ZZ0067)
关键词 文本检测 视觉显著性 同质性 图像分割 text detection visual saliency homogeneity image segmentation
  • 相关文献

参考文献16

  • 1Mariano V Y, Kasturi R. Locating uniform-colored text in video frames [ C ] //Proceedings of 15th International Conference on Pattern Recognition. Barcelona: IEEE ,2000 : 539-542.
  • 2蒋人杰,戚飞虎,徐立,吴国荣.基于连通分量特征的文本检测与分割[J].中国图象图形学报,2006,11(11):1653-1656. 被引量:9
  • 3Lienhart Rainer,Wemicke Axel. Localizing and segmenta- tion text in images and videos [ J ]. IEEE Trans on Cir- cuits and Systems for Video Technology, 2002,12 ( 4 ) : 256-268.
  • 4Chen D, Odobez J M, Bourlard H. Text detection and re- cognition in images and video frames [ J ]. Pattern Recog- nition, 2004,37 ( 3 ) : 595- 608.
  • 5Kim K I, Jung K, Kim J H. Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm [J]. IEEE Transactions on Pattern Analysis and Machine Intelli- gence, 2003,25 ( 12 ) : 1631 - 1639.
  • 6Zhong Y, Zhang H, Jain A K. Automatic caption localiza- tion in compressed video [ J ]. IEEE Transactions on Pa- ttern Analysis and Machine Intelligence, 2000,22 ( 4 ) :385-392.
  • 7Epshtein B, Ofek E, Wexler Y. Detecting text in natural scenes with stroke width transform [ C ]//Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition. San Franciseo : IEEE, 2010 : 2963- 2970.
  • 8张引,潘云鹤.复杂背景下文本提取的彩色边缘检测算子设计[J].软件学报,2001,12(8):1129-1135. 被引量:20
  • 9黄剑华,承恒达,吴锐,刘家锋.基于模糊同质性映射的文本检测方法[J].电子与信息学报,2008,30(6):1376-1380. 被引量:5
  • 10Itti L, Koch C, Niebur E. A model of saliency-based visual attention for rapid scene analysis [ J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1998,20 (11) :1254-1259.

二级参考文献24

  • 1孟章荣.各种颜色模型选用需求分析[J].中国图象图形学报(A辑),1996,1(3):238-241. 被引量:20
  • 2庄越挺.智能多媒体信息分析与检索的研究:博士学位论文[M].杭州:浙江大学,1998..
  • 3Clark P,Mirmehdi M.Finding text regions using localized measures[A].In:Proceedings of 11th British Machine Vision Conference[C].Bristol,UK,2000:675 ~ 684.
  • 4Chun B T,Bae Y,Kim T Y.Automatic text extraction in digital videos using FFT and neural network[A].In:Proceedings of IEEE International Fuzzy Systems Conference[C],Seoul,Korea,1999,2:1112 ~1115.
  • 5Chen D,Shearer K,Bourlard H.Text enhancement with asymmetric alter for video OCR[A].In:Proceedings of International Conference on Image Analysis and Recognition[C],Venice,Italy,2001:192 ~ 197.
  • 6Mao W,Chung F,Lanm K,et al.Hybrid Chinese/English text detection in images and video frames[A].In:Proceedings of International Conference on Pattern Recognition[C],Quebec,Canada,2002,3:1015 ~ 1018.
  • 7Wang K Q,Kangas J A.Character location in scene images from digital camera[J].Pattern Recognition,2003,36 (10):2287 ~2299.
  • 8Kim K C,Byun H R,Song Y J,et al.Scene text extraction in natural scene images using hierarchical feature combining and verification[A].In:Proceedings of International Conference on Pattern Recognition[C],Cambridge,UK,2004,2:679 ~ 682.
  • 9Zhu K,Qi F,Jiang R,et al.Using adaboost to detect and segment characters from natural scenes[A].In:Proceedings of Camera Based Document Analysis and Recognition[C],Seoul,Korea,2005:52 ~ 59.
  • 10Winger L,Robinson J A,Jernigan M E.Low-complexity character extraction in low-contrast scene images[J].International Journal of Pattern Recognition and Artificial Intelligence,2000,14(2):113 ~135.

共引文献31

同被引文献20

引证文献3

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部