期刊文献+

基于深度学习法的视频文本区域定位与识别 被引量:11

Area Location and Recognition of Video Text Based on Depth Learning Method
下载PDF
导出
摘要 通过对视频图像进行快速、准确的文本定位与识别,有利于提高视频信息处理的效率与准确率.采用Gabor滤波器实现在横、竖、撇、捺四个方向上的视频图像的纹理特征的提取,再通过RBM逐层增量深度学习算法构建深度置信网络,实现对提取的纹理特征图像中文本区域的定位.论文同时研究了利用形态学处理方法和OCR字符库实现对视频图像文本识别的可行性,并分析了识别效果.测试结果表明,本文提出的深度学习算法与形态学字符识别方法相结合,不但能够实现对视频图像文本区域的准确定位,还有利于提高字符识别的效率和准确率. It is advantageous to improve the efficiency and accuracy of video information processing through fast and accurate text area location and recognition of video images.The Gabor filter has been used to extract the texture features of video images in the four directions of horizontal,vertical,left-failing and right-falling.Then,by RBM layer increment depth learning algorithm,a depth belief network has been structured,and at the same time,the text region location for the texture feature images has been realized.The paper also studied the feasibility and recognition effect about using morphological process and OCR character database to realize the video image text recognition.The test results showed that the proposed optimized depth learning algorithm combining with morphology character recognition method can not only realize the accurate location of the text region for video images,but also improve the efficiency and accuracy of the character recognition.
出处 《哈尔滨理工大学学报》 CAS 北大核心 2016年第6期61-66,共6页 Journal of Harbin University of Science and Technology
基金 国家自然科学基金(61401126)
关键词 深度学习算法 视频图像 文本区域定位 形态学去噪 字符识别 depth learning algorithm video image text area location morphological denoising character recognition
  • 相关文献

参考文献5

二级参考文献43

  • 1刘洋,薛向阳,路红,郭跃飞.一种基于边缘检测和线条特征的视频字符检测算法[J].计算机学报,2005,28(3):427-432. 被引量:20
  • 2孙慧平,刘党辉,沈兰荪.基于DCT压缩域的快速字符定位算法研究[J].电子学报,2006,34(4):751-754. 被引量:4
  • 3黄剑华,吴锐,刘家锋,唐降龙.一种基于同质映射的视频图像中文本检测方法[J].高技术通讯,2007,17(3):249-254. 被引量:1
  • 4LIU H,WU Q,ZHANG H B.Skew detection for complex document images using robust borderlines in both text and non-text regions[J].Pattern Recognition Letters,2008,29(13):1893-1900.
  • 5VIET C D,SEONG S C,SEUNGWOOK C.An efficient method for text detection in video based on stroke width similarity[C].Proceeding of the 8th Asian conference on computer vision,2007(1):200-209.
  • 6QIAN X M,LIU G ZH,WANG H,et al.Text detection,localization,and tracking in compressed video[J].Signal Processing:Image Communication,2007,22(9):752-768.
  • 7LI SH T,SHEN Q H,SUN J.Skew detection using wavelet decomposition and projection profile analysis[J].Pattern Recognition letters,2007,28(5):555-562.
  • 8LIU X B,FU H,JIA Y D.Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images[J].Pattern Recognition,2008,41(2):484-493.
  • 9MANJUNATH BS, MA M Y. Texture Feature for Image Retrieval[M]. John Wiley & Sons Inc., 2002.
  • 10MANJUNATH BS, MA M Y. Texture Feature for Browsing and Retrieval of Image Data[J]. IEEE-PAMI, 2000, 18 (8):837- 842.

共引文献42

同被引文献96

引证文献11

二级引证文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部