期刊文献+

基于笔画相关加权的视频图像文字识别 被引量:4

Video image character recognition based on stroke-related weight
下载PDF
导出
摘要 为了提取影视视频图像中的字幕信息,提出一套鲁棒的方法:首先采用图像的边缘特征对字幕信息进行区域定位,并给出结合边缘信息的方法对图像文字进行二值化;其次,采用投影法和区域生成方法定位单个文字;最后,充分考虑到文字笔画的拓扑结构,进行相邻子网格笔画结构相关性的判定,并采用笔画模糊隶属度完成弹性网格特征的提取。该方法在复杂的背景图像中能够有效得到文字的二值图像,并保证了提取特征的稳定性、健壮性,对二值化后的影视字幕的识别率达到92.1%,实验结果表明了方法的有效性。 In order to extract the subtitle in the video image, a robust method was proposed. First, the image edge feature was adopted in caption location step, and the binarization method of text images with the edge information was given. Then, the method combined with projection and regional generation was used to locate a character. Finally, taking fully account of the topology of the text strokes, the stroke correlation among the adjacent sub-grids was determined and the stroke fuzzy membership was used to complete the elastic grid feature extraction. This method can effectively get the binary image of characters from a complex background image, ensure the stability and robustness in feature extraction. The experimental results show the method is effective, and its recognition rate has been up to 92.1%.
出处 《计算机应用》 CSCD 北大核心 2012年第8期2305-2308,2312,共5页 journal of Computer Applications
基金 重庆市教委科学技术研究项目(KJ110504) 重庆市科委自然科学基金资助项目(2009BB2081) 教育部留学回国人员科研启动基金资助项目(教外司留[2010]1174)
关键词 视频图像 文字识别 文本定位 二值化 子网格特征 笔画相关性 video image character recognition text location binarization sub-grid feature stroke correlation
  • 相关文献

参考文献12

二级参考文献69

  • 1杜世宏,王桥,杨一鹏.一种定性细节方向关系的表达模型[J].中国图象图形学报(A辑),2004,9(12):1496-1503. 被引量:16
  • 2冯志伟.用上下文无关语法来描述汉字结构[J].语言科学,2006,5(3):14-23. 被引量:9
  • 3王开铸,王英伟.汉字字形的关系稳定原理[J].中文信息学报,1996,10(4):24-31. 被引量:3
  • 4R Lienhart, A Wemicke. Localizing and segmenting text in images, videos [ J ]. IEEE Transactions on Circuits Syst Video Technol, 2002,12(4) :256 - 268.
  • 5Agnihotri L, Dimitrova N. Text detection for video analysis [ A]. IEEE Workshop on Content-Based Access of Image and Video Libraries [C ]. Fort Collins, CO, USA: IEEE Press, 1999.109 - 113.
  • 6K Jain, B Yu. Automatic text location in images and video frames[ J]. Pattern recognition, 1998,31(12) :2055 - 2076.
  • 7Wenge Mao,Fu-lai Chung,Lam, K K M, Wan-chi Sun.Hybrid Chinese/English text detection in images and video frames [ A]. Proceedings of 16th International Conference on Pattern Recognition, 2002 [C ]. Washington, DC, USA: IEEE Computer Society,Volume (3) ,Aug 2002. 1015 - 1018.
  • 8J Gllavata, R Ewerth, B Freisleben. A text detection, localization and segmentation system for OCR in images[A]. Proceedings of the 1EEE Sixth International Symposium on Multimedia Software Engineering[ C]. Washington, DC, USA :IEEE Computer Society,2004.310 - 317.
  • 9Michael R Lyu, Jiqiang Song, Min Cal. A comprehensive method for multilingual video text detection, localization, and extraction[J ]. IEEE Transaction on circuits and systems for video technology, 2005,15(2) :243 - 255.
  • 10D Chen,K Shearer,H Bourlard. Text enhancement with asymmelric filter for vdeo OCR[A]. In Proceedings of 11 th International Conference Image Analysis Processing [ C ]. Palermo, I taly: IEEE Press,2001,192 - 197.

共引文献66

同被引文献59

  • 1李对红,王裴岩 ,张桂平,张少阳.基于字簇的多模型中文分词方法研究[J].计算机应用研究,2020,37(2):355-359. 被引量:2
  • 2曹阳,高志远,杨胜春,姚建国,梁云,孙云枫.云计算模式在电力调度系统中的应用[J].中国电力,2012,45(6):14-17. 被引量:37
  • 3王水平,唐振民,陈北京,蒋晔.复杂环境下语音增强的复平面谱减法[J].南京理工大学学报,2013,37(6):857-862. 被引量:6
  • 4梁华刚,程加乐,茹锋.基于特征空间法的旋转多字体文字识别[J].微电子学与计算机,2015,32(4):82-85. 被引量:3
  • 5董振东,董强,郝长伶.知网的理论发现[J].中文信息学报,2007,21(4):3-9. 被引量:98
  • 6Hinton G E, Salakhutdinov R R.Reducing the dimension- ality of data with neural networks[J].Science, 2006,313 : 504-507.
  • 7WIDMER A. SCHAER R, MARKONIS D, et al. Facilitating medical information search using Google Glass connected to a content-based medical retrieval system [C]//36th Annual International Conference of the IEEE, Chicago, USA, 2014.
  • 8TEIXEIRAJM F, FERREIRAR D, SANTOSM P, et al. Tele-operation using Google glass and AR, Drone for structural inspection [C]//Virtual and Augmented Reality, Piata Salvador, 2014.
  • 9SILVAM L, FREITASD C, MARCEL P, et ol. Tele-operation using Google glass and AR, drone for structural inspection[C]// Virtual and Augmented Reality, Piata Salvador, 2014.
  • 10WILLEM W, SCHOLLPM M, WISCHNIEWSKIS A, et al. Comparing Google glass with tablet-PC as guidance system for assembling tasks [C]//Wearable and Implantable Body Sensor Networks Workshops, Zurich, 2014.

引证文献4

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部