期刊文献+

基于边缘和颜色的视频文本图像分割方法 被引量:5

Video Text Image Segmentation Method Based on Edge and Color Features
下载PDF
导出
摘要 视频中的文本如果直接送入OCR软件,识别率较低,因为文本往往叠加在复杂背景中,所以需要先将文本从背景中分割出来。背景像素可能具有和文本像素相似的颜色,并且由于解压缩的影响,文本像素颜色分布可能具有渐变性,给分割带来一定的困难。针对这些问题,提出一种基于文本边缘和颜色特征的文本分割方法,该方法首先利用文本边缘的高频特性沿文本轮廓对图像的颜色分布进行采样;其次使用K-均值空间聚类方法从采样点集合得到图像分割的种子点和分割半径,从而分割文本图像得到不同的分割结果;最后,利用文本笔画的连通域特征挑选出正确的分割结果。实验表明,该方法较好的解决了视频文本和背景的分离问题,分割结果具有较高的OCR识别率。 Before video text images are input to OCR software, text should be separated from background, because video texts often are embedded in complex background. There are two problems: one is that the color of background pixels may be similar with the color of text pixels, the other one is that the color of text pixels has variance caused by video compression and decompression. To solve the two problems, a new text image segmentation algorithm was introduced based on text edge and color features. First, the sample pixels set was got according to high frequency edge information of text. Second, K-means clustering method was applied to get segmentation seed pixels and radius, then segment text image into several text candidate images. Last, false text candidate images were excluded according to connected component property of text strokes Experimental result shows that this method can separate text from background easily, and gets good OCR result.
出处 《系统仿真学报》 EI CAS CSCD 北大核心 2008年第23期6498-6501,共4页 Journal of System Simulation
基金 航空支撑科技基金(05E551010)
关键词 视频检索 文本检测 文本分割 K-均值聚类 video indexing video text detection text segmentation K-means clustering
  • 相关文献

参考文献9

  • 1蔡波,周洞汝,胡宏斌.数字视频中字幕检测及提取的研究和实现[J].计算机辅助设计与图形学学报,2003,15(7):898-903. 被引量:16
  • 2X L Chen, J Yang, J. Zhang, A Waibel. Automatically text detection and recognition in natural scene images [C]// IEEE Trans. Image Processing, 2004. USA: IEEE, 2004, V13:87-99.
  • 3Li Hui-ping, Doermann D. Text enhancement in digital video using multiple frame integration [C]// Proceedings of ACM Multimedia 1999, Orlando FL, USA. USA: ACM, 1999: 19-22.
  • 4V Wu, R Manmatha, E M Riseman. Textfinder: an automatic system to detect and recognize text in images [C]//IEEE Trans. PAMI, 1999. USA: IEEE, 1999, V20: 1224-1229.
  • 5C M Tsai, H J Lee. Binarization of color document images via luminance and saturation color features [C]//IEEE Trans. on Image Processing, 2002, 11(4): 434-451.
  • 6R Lienhart, A Wernicke. Localizing and segmenting text in images and videos [C]// IEEE Trans. Circuits and Systems for Video Technology, 2002, 12: 256-268.
  • 7叶齐祥.图像和视频检测技术研究[D].北京:中国科学院研究生院,2005.
  • 8章毓晋.图像工程-图像处理[M].北京:清华大学出版社,2006.
  • 9张炘中.汉字识别技术[M].北京:清华大学出版社,1992..

二级参考文献5

  • 1Ohya J, Shio A, Akamatsu S. Recognizing characters in scene images [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1994, 16(7): 214~224
  • 2Lopresti D, Zhou J. Document analysis and the world wide web[A]. In: Proceedings of International Workshop on Document Analysis Systems, Malvern, PA, 1996. 651~669
  • 3Yeo B L, Liu B. Visual content highlighting via automatic extraction of embedded captions on MPEG compressed video [A]. In: Proceedings of SPIE Digital Video Compression: Algorithms and Technology, San Jose, CA, USA, 1996. 2668: 38~47
  • 4Lienhart R, Stuber F. Automatic text recognition in digital videos[R]. Mannheim Germany: University of Mannheim, TR-95-036, 1995
  • 5Smith M A, Kanade T. Video skimming and characterization through the combination of image and language understanding technique[A]. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, San Juan, Puerto Rico, 1997. 775~781

共引文献25

同被引文献40

引证文献5

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部