期刊文献+

基于多尺度图像融合的新闻视频文字区域检测定位算法

Automatic Text Detection in News Video Based on Multi-Resolution Image Fusion
下载PDF
导出
摘要 针对新闻视频中的文字区域检测定位问题,研究了如何快速有效地检测新闻视频中的文字区域,用以实现自动视频检索。为克服视频中文字大小不一的难题,根据视频图像中文字区域特征有别于背景区域的特点,提出了一种基于多尺度图像融合的新闻视频文字区域检测定位算法。算法主要采用训练和学习两个步骤,首先对人为收集的含字符样本与不含字符样本进行小波特征和局部二值模式等特征提取,并完成SVM分类器训练,获取分类器;然后对测试视频帧进行多尺度的遍历检测,并融合检测结果,获取每帧的文字区域。实验结果表明,与前人提出的基于边缘检测的方法相比,算法具有明显优越性,在定位准确度上有较大提高,同时还能克服视频帧之间的快速变换,具有一定的实用意义。 As to the problem of automatic text detection in news video, an efficient algorithm was proposed for text location and video searching. In order to overcome the challenge of the different size of text in news video frames, an algorithm which was based on multi-resolution The method includes two steps : firstly, the wavelet feature image fusion and text block feature was presented. and LBP feature of positive samples and negative sam- ples were extracted which can be trained by support vector machine (SVM). And then, the test video for text detection should be ergoticly detected by multi-resolution method. Finally, the result image of text detection can be gained by multi-resolution image fusion. The experimental results show that this method has the superiority of accuracy rating compared with the traditional method based on edge detection, so that the video frames are trans- formed quickly
作者 章慧 赵丽娟
出处 《贵州大学学报(自然科学版)》 2012年第6期86-90,共5页 Journal of Guizhou University:Natural Sciences
基金 国家自然科学基金(No.60973113) 淮安市工业科技支撑项目(HAG2010069)
关键词 小波特征提取 局部二值模式特征 文字定位 多尺度融合 wavelet feature eLBP text detection multi-resolution fusion
  • 相关文献

参考文献5

二级参考文献28

  • 1王勇,郑辉,胡德文.图像和视频中的文字获取技术[J].中国图象图形学报(A辑),2004,9(5):532-538. 被引量:13
  • 2谢毓湘,栾悉道,吴玲达,老松杨.新闻视频帧中的字幕探测[J].计算机工程,2004,30(20):167-168. 被引量:15
  • 3刘洋,薛向阳,路红,郭跃飞.一种基于边缘检测和线条特征的视频字符检测算法[J].计算机学报,2005,28(3):427-432. 被引量:20
  • 4Xiaoqing Liu, Jagath Samarabandu. An Edge - based Text Region Extraction Algorithm for Indoor Mobile Robot Navigation[J]. IEEE International Conference on Mechatronics & Automation Niagara Falls, Canada July 200.5. 701 - 706.
  • 5Xiaoqing Liu and Jagath Samarabandu. Multiscal Edge - based Text Extraction From Complex Images [J].IEEE ICME 2006. 1721 - 1723.
  • 6[美] Rafael C Gonzalez, Richard E Woods, Steven L Eddins 著,阮秋琦,等译.数字图像处理[M].北京:电子工业出版社,2005.
  • 7[1]Y Wang, Z Liu, J Huang. Multimedia content analysis using audio and visual information[J]. IEEE Signal Processing Magazine, 2000, 17(6):12~36
  • 8[2]R Lienhart, F Stuber. Automatic text recognition in digital videos[A]. In: Proceedings of ACM Multimedia, Boston, 1996.11~20
  • 9[3]Zhong Yu, Zhang Hongjiang, Jain Anil K. Automatic caption localization in compressed video[J]. Pattern Analysis and Machine Intelligence, 2000, 22(4):385~392
  • 10[4]V Vapnik. The Nature of Statistical Learning Theory[M]. New York: Springer, 1995

共引文献37

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部