期刊文献+

利用时空特性的字幕检测与定位算法 被引量:2

Spatia-temporal Video Caption Detection and Localization Algorithm
下载PDF
导出
摘要 提出一种视频字幕的检测与定位算法.利用视频字幕在时间上的冗余特性,以镜头为基本处理单元,采用监视-跟踪模型和扩展QSDD(PQSDD)度量来定位字幕的起始帧和终止帧,利用起始帧和终止帧确定起始字幕转换帧对和终止字幕转换帧对;对各帧对的差值图像利用边缘特性分别进行字幕定位,并提出一种基于背景复杂度的自适应阈值选取算法实现对边缘图像的二值化;最后对两幅差值图像定位出的字幕区域做逻辑与运算和连通区域分析得到最终的字幕区域.实验结果表明本文算法具有较高的检测速度和定位精度. This report describes a spatio-temporal algorithm to detect and locate the overlay captions in digital video. The algorithm adopts the PQSDD measurement and a binary-search algorithm to decide the initiative and final frames including the same caption within one shot. Using the initiative and final frames, two flame transition pairs and two difference images are obtained. Caption region localization is subsequently applied by analyzing edge maps of the two difference images respectively. For getting binary edge images, an auto threshold selection algorithm according to region background complexity is proposed. Finally mathematic morphology and connective components analysis are performed to remove the false caption regions. Experimental results prove that the algorithm possesses higher detection speed and localization precision.
出处 《小型微型计算机系统》 CSCD 北大核心 2009年第10期2054-2058,共5页 Journal of Chinese Computer Systems
关键词 时空特性 字幕监视-跟踪模型 字幕转换帧对 扩展QSDD(PQSDD) 动态阈值选取 spatio-temporal character caption surveillance-tracking model caption transition frame pair patulous QSDD dynamic threshold selection
  • 相关文献

参考文献11

  • 1Li Hui-ping, Doermann David, Kia Omid. Automatic text detection and tracking in digital video[J]. IEEE Trans. Image Processing, 2000, 9(1): 147-156.
  • 2Garcia C, Apostolidis X. Text detection and segmentation in complex color images[ C]. Proc. of IEEE International Conference on Acoustics, Speech, and Signal Processing, 2000,2326-2329.
  • 3Jain A K, Yu Bin. Automatic text location in images and video frames[J]. Pattern Recognition, 1998, 31 (12) : 2055-2076.
  • 4Lienhart Rainer, Wernicke Axel. Localizing and segmenting text in images and videos [ J ]. IEEE Trans on Circuits and System for Video Technology, 2002, 12(4) : 256-268.
  • 5Zhang Dong-qing, Rajendran Raj Kumar, Chang Shihfu. General and domain-specific techniques for detecting and recognizing superimposed text in video[ C]. Proc. of IEEE International Conference on Image Processing, 2002,22-25.
  • 6Toshio Sato, Takeo Kanade, Hughes E K, et al. Video OCR: indexing digital news libraries by recognition of superimposed captions[J]. Multimedia Systems, 1999, 17(5) : 385-395.
  • 7密聪杰,刘洋,薛向阳.基于多帧图像的视频文字跟踪和分割算法[J].计算机研究与发展,2006,43(9):1523-1529. 被引量:11
  • 8Tang Xiao-ou, Gao Xin-bo, Liu Jian-zhuang, et al. A special-temporal approach for video caption detection and recognition [ J]. IEEE Transaction on Neural Networks, 2002, 13(4) :961-971.
  • 9Liu Yang, Lu Hong, Xue Xiang-yang, et al. Effective video text detection using line features[ C]. Proc. of International Conference on Control, Autornation, Robotics and Vision, 2005,1528-1532.
  • 10Lyu Michael R, Song Ji-qiang, Cai Min. A comprehensive meth od for multilingual video text detection, localization, and extraction [ J ]. IEEE Transactions on Circuits and Systems for Video Technology, 2005, 15(2) :243-255.

二级参考文献18

  • 1Zhong Y,Jain A K. Locating text in complex color images[J]. Pattern Recognition, 1998, 28(10): 1523-1535.
  • 2Lienhart R, Effelsberg W. Automatic text segmentation and text recognition for video indexing[EB/OL]. ACM/Springer Multimedia Systems, 2000, 8: 69-81.
  • 3Wu V, Manmatha R and Riseman E M. Textfinder: an automatic system to detect and recognize text in images[J]. IEEE Transactions on Patter Analysis and Machine Intelligence, 1999, 20(11): 1224-1229.
  • 4Li H P,Doermann D. Automatic text detection and tracking in digital video[J]. IEEE Transactions on Image Processing, 2000, 9(1):147-156.
  • 5Otsu N. A threshold selection method from gray-level histograms[Z]. SMC-9, 1979, 62-66.
  • 6H Li, D Doermann. Automatic identification of text in digital video key frames [C]. In: Proc of the 14th Int'l Conf on Pattern Recognition. Los Alamitos, CA: IEEE Computer Society Press, 1998. 129-132
  • 7K Jung. Neural network-based text location in color images [J ].Pattern Recognition Letters, 2001, 22(14): 1503-1515
  • 8E K Wong, M Chen. A new robust algorithm for video text extraction [J]. Pattern Recognition, 2003, 36(6) : 1397-1406
  • 9M Cai, J Song, M R Lyu. A new approach for video text detection [C]. IEEE Int'l Conf on Image Processing (ICIP),New York, 2002
  • 10R Lienhart, A Wernieke. Localizing and segmenting text in images and videos [J]. IEEE Trans on Circuits and System for Video Technology, 2002, 12(4) : 256-268

共引文献14

同被引文献8

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部