一种新的视频文本定位方法

A New Approach For Video Text Localization

下载PDF

导出

摘要本文提出了一种基于投影分析与支持向量机学习相结合的文本定位方法。首先采用投影分析的方法将可能的文本区域提取出来,然后再采用基于支持向量机学习的方法将提取出来的文本区域中的虚假文本区域排除掉。采用投影分析的方法时先将图像的边缘提取出来,再使用一些形态学的操作使边缘聚集,最后采用多次投影定位出文本区域。在使用支持向量机进行文本分类时本文采用了小波,角点,扫描线和区域内边缘点的重心位置等特征。实验表明该方法比单纯的基于边缘的方法要好。 A novel approach to text localization is presented in this paper. We combine projection analysis of edge based method and learning of support vector machine based method in text localization. The text localization can be divided into two steps. In the first step, the potentially text area are extracted by the edge method. In the second step, we use support vector machine to classify the actual text areas and the false text areas. The false text areas are removed in this step. The edge of the image is extracted first in the edge method. Then the morphologic operation is used. The text areas are localized by multiply horizontal and vertical projection. The textures we used in the support vector machine are wavelet, comer, line and the center of gravity of the text areas. It shows a good result in the experiment compare to the simple edge based method.

作者钮燕

机构地区紫琅职业技术学院

出处《科技信息》 2011年第27期I0040-I0042,I0083,共4页 Science & Technology Information

关键词边缘文本定位支持向量机多次投影分析 Edge Text localization Support vector machine Projection analysis

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1史迎春,王韬,周献中.一种基于时空分布特征的新闻字幕检测新算法[J].系统仿真学报,2004,16(11):2483-2485. 被引量：5
2王勇,燕继坤,郑辉.一种自适应的视频帧中字幕检测定位方法[J].计算机应用,2004,24(1):134-135. 被引量：5
3马小勇,谢萍,张宪民.视频帧中提取文字区域的算法[J].计算机工程,2003,29(9):155-157. 被引量：3
4沈淑娟.基于时空域信息的视频字幕提取算法研究[D]西安电子科技大学,西安电子科技大学2004.
5H. Li,D. Doermann.A video text detection system based on automated training. Proceedings of IEEE International Conference on Pattern Recognition . 2000
6K.Jung,K.I.Kim,A.K.Jain.Text Information Extraction in Images and Video: A Survey. Pattern Recognition . 2004
7Wu V,Manmatha R,Riseman E M.TextFinder: an automatic system to detect and recognize text in images. IEEE Transactions on Pattern Analysis and Machine Intelligence . 1999
8Otsu N.A threshold selection method from gray-level histograms. IEEE Transactions on Systems Man and Cybernetics . 1979
9Edward K.Wong, Minya Chen.A new robust algorithm for video text extraction. Pattern Recognition . 2003
10B.L. Yeo and B.Liu.Visual Content Highlighting via Automatic Extraction of Embedded Captions on MPEG Compressed Video. . 1996

二级参考文献17

1Hua Xiansheng, Chen Xiangrong, Liu Wenyin, et al . Automatic Localion of Text in Video Frames. 3nd Intl Workshop on Multimedia Information Retrieval (MIR2001) Ottawa, Canada, 2001-10-05.
2Trajkovic M, Hedley M. Fast Comer Detection[J]. Image and Vision Computing, 1998, 16(I): 75-87.
3Smith S M, Brady J M. SUSAN -- A New Approach to Low Level Image Processing. Int.Jour.of Computer Vision.23(I),1977-05:45-78.
4Xi J, Hua Xiansheng, Chen Xiangrong, et al. A Video Text Detection and Recognition System. IEEE International Conference on Multimedia and Expo (ICME2001) , Waseda University, Tokyo, Japan,2001-08- 22.
5Wemicke A, Lienhart R. On the Segmentation of Text in, Videos. IEEE Int.Conf.on Multimedia and Expo (ICME2000). 2000-07:1511-1514.
6Jain A K, Yu B. Automatic Text Location in Images and Video Frames Pattern Recognition, 1998,31(12): 2055-2076.
7Shi Yingchun, et al. Multiplayer semantic modeling of video and its semantic extraction framework[A]. In: proceedings of RISSP 2003[C]. Changsha: 2003. 783-788.
8Lienhart R, Stuber F. Automatic text recognition in digital videos[A]. In: proceedings of ACM multimedia[C]. boston: 1996. 11-20
9Wernicke A, Lienhart R. On the segmentation of text in videos[Z]. IEEE standards office, New York ,2000:1511-1544
10Zhong Yu, Zhang H J et al. Automatic caption localization in compressed video[J]. pattern analysis and machine intelligence, 2000,22(4):385-392.

共引文献10

1闻京,张凌,袁华.一种复杂背景图像中文字区域提取算法[J].中山大学学报（自然科学版）,2008,47(z1):5-10. 被引量：1
2王勇,李建彬,胡德文,郑辉.一种基于学习的视频字幕验证方法[J].中国图象图形学报,2006,11(11):1645-1649. 被引量：1
3蓝照华,赵进创.新闻视频检索技术的研究[J].中国有线电视,2006(24):2414-2416. 被引量：1
4李治强,杨强.基于时空分布特征的新闻字幕检测改进算法[J].广播与电视技术,2007,34(2):103-105. 被引量：3
5罗洪刚,王士林,倪佑生,李生红.一种用于网络动画过滤的文字提取方法[J].信息安全与通信保密,2007,29(11):66-67. 被引量：5
6刘元春,凌坚,练益群.电视新闻节目中标题字幕的提取技术探索[J].广播与电视技术,2008,35(11):91-92. 被引量：1
7艾力.居麦,哈力旦.A,黄浩.视频图像中维吾尔文字的识别研究[J].计算机工程与应用,2011,47(36):190-192. 被引量：6
8陈燕升,任江涛,黄达峰.基于AFSA-LSSVM的视频字幕定位模型[J].电视技术,2014,38(5):42-45.
9文毅,龚飞,党静雅,邢更力.视频文字信息检查工具的设计与实现[J].计算机测量与控制,2015,23(5):1754-1757. 被引量：1
10王亚,褚晶辉,刘子玉,吕卫.支持多种文字的视频字幕叠加工具设计[J].信息技术,2015,39(9):118-120.

1符强,任风华,蒋昌茂,纪元法,赵岭忠.基于小波分解的PCNN遥感图像融合算法[J].华中师范大学学报（自然科学版）,2012,46(1):117-120. 被引量：1
2石跃祥,朱健,刘海涛.基于提升小波变换的图像融合新算法[J].计算机工程与应用,2012,48(10):167-170. 被引量：3
3李桐.图像融合中角点检测技术研究[J].北京印刷学院学报,2010,18(2):39-41.
4钮燕.基于多分辨率的广告视频文本定位[J].科技信息,2011(29).
5王奎,李卫华,李小春.非采样轮廓波变换下的红外与可见光图像融合[J].空军工程大学学报（自然科学版）,2015,16(6):55-59. 被引量：6
6李皓,唐朝京,张权.快速图像序列人脸提取算法[J].现代电子技术,2007,30(8):103-105.
7陈乐,吕文阁,丁少华.角点检测技术研究进展[J].自动化技术与应用,2005,24(5):1-4. 被引量：45
8范晓,尹宝才,孙艳丰.基于嘴部Gabor小波特征和线性判别分析的疲劳检测[J].北京工业大学学报,2009,35(3):409-413. 被引量：4
9初秀琴,胡乐,王飞,彭剑峰.复杂背景下的人脸检测新算法[J].红外技术,2010,32(8):467-470. 被引量：1
10晁锐,张科,李言俊.一种基于小波变换的图像融合算法[J].电子学报,2004,32(5):750-753. 被引量：153

科技信息

2011年第27期

浏览历史

内容加载中请稍等...

一种新的视频文本定位方法

参考文献10

二级参考文献17

共引文献10

相关作者

相关机构

相关主题

浏览历史