摘要
自然场景图像中包含大量的图像和文本信息,其文本字符能够提供重要的语义信息。利用计算机自动检测并识别自然场景中的文本信息,是模式识别和文字信息处理领域重要的研究内容。提出一种有效的从场景图像中定位文本的方法,其原理为:首先基于边缘检测进行文本区域粗定位,对定位到的区域进行灰度检测,来确定文本域中的字符位置,然后对所得到的检测区域进行筛选,去掉噪声区域,获取到目标文本域。实验结果表明,本文方法对字体的大小、样式、颜色,以及排布方向具有较强的鲁棒性,能够准确定位并提取自然场景下的文本信息。
Natural scenes contain a large quantity of image information, but also plenty of text information. These text characters could offer important semantic information. Automatic detection and recognition of text in natural scene images is an important research topic in pattern recognition and image processing. In this paper,we propose an effective method to extract text from scene images. The main idea is as follows:First,an edge detection method is used for coarsely locating the text areas,then we apply a gray based detection scheme to the located areas in order to confirm the found characters. Finally,the noise regions are removed through filtering all the detected regions,and the target text regions are obtained. Experimental results show that our scheme is robust on finding text in natural scene images with respect to different font sizes,styles,colors and orientations.In this way, the text information can be located and extracted accurately.
出处
《中国图象图形学报》
CSCD
北大核心
2013年第12期1601-1609,共9页
Journal of Image and Graphics
基金
内蒙古自然科学基金项目(2012MS0902)
复旦大学专用集成电路与系统国家重点实验室开放课题(11KF005)
关键词
自然场景
文本定位
边缘检测
灰度检测
natural scene image
text localization
edge detection
gray based detection