摘要
图像中的文本包含了重要的语义信息。提出了一种基于连通区域分析的彩色图像文本自动定位算法。首先对彩色图像进行多通道分解,得到少量代表颜色下的二值图像。对二值图像进行连通区域标记,并计算连通区域的属性参数。然后根据关于连通区域属性的先验知识限制,去掉连通区域中的非文本区域,其他的作为候选文本区域。最后根据文本行的排列规律从候选文本区域中提取文本行。实验结果表明该方法能较快地定位文本区域,定位精度较高。
Texts in color images contain important semantic information for image indexing and retrieval. A novel connected component based algorithm for text localization in color images is proposed in this paper. Firstly, color image is multi-channel decomposed into binary images of representative colors. Connected component labeling is performed in each binary image, and properties of each connected component are calculated. And then according to apriority knowledge of properties of connected components, non-text connected components are eliminated to get candidate text connected components. Finally, alignment analysis is used to localizing the texts from the candidate text connected components in color images. Experimental results show the proposed algorithm can localize text with faster speed and better accuracy compared with other method tested.
出处
《无线电通信技术》
2008年第5期46-48,共3页
Radio Communications Technology
关键词
连通区域
文本定位
颜色约减
文本行分析
connected component
text localization
alignment analysis
color reduction