基于模板匹配的新闻图像字幕行切分算法被引量：2

News Image Caption Line Segmentation Algorithm Based on Template Matching

导出

摘要针对新闻图像中水平字幕行的字符切分问题,为了克服已有基于单字符切分方法造成的字符分裂问题,利用字幕行中字符的分布规律构造了响应函数,从而将字符切分问题转变为响应函数的最优值问题,最后基于优化结果进行字符切分.该算法主要包括两部分:首先,利用垂直投影直方图确定单个字符的粗略宽度,并根据该值构造一个可变长模板;然后,构造模板响应函数,根据不同长度模板的最优响应函数值确定单个字符的左右边界位置;最后输出切分结果.实验结果表明,对于粘连/非粘连字符图像,该算法均能获得较好的实验结果. The research on the character segmentation of the horizontal caption line in news images was made in this article. In order to overcome the character splitting problem caused by existing single character based segmentation methods,a response function was proposed based on character distribution. The character segmentation problem is converted into an optimal problem,and the character segmentation can be attained by turning to the optimal result. The algorithm mainly contains two parts： First,the rough width of a single character is determined based on the vertical projection histogram,which is utilized to construct a variable length template; Then,the template response function is constructed and the left /right boundary position of a single character is determined by the optimal value of the response function of different length templates; Last,output the segmentation results. Experimental results show that the proposed method can obtain satisfactory results for adhesion / non-adhesion character images.

作者王志衡郭超刘红敏 WANG Zhi-heng GUO Chao LIU Hong-min(College of Computer Science and Technology, Henan Polytechnic University, Henan Jiaozuo 454000, Chin)

机构地区河南理工大学计算机科学与技术学院

出处《北京邮电大学学报》 EI CAS CSCD 北大核心 2016年第3期49-53,共5页 Journal of Beijing University of Posts and Telecommunications

基金国家自然科学基金项目(61572173 61472119 61472373 61272394) 河南省高校创新科技人才项目(13HASTIT039) 河南理工大学创新型科研团队项目(T2014-3) 河南理工大学杰出青年基金项目(J2013-2)

关键词新闻图像标题字幕模板匹配字符切分 news images captions template matching character segmentation

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献7

1Yan Jianqiang, l.i Jie, Gao Xinbo. Chinese text location under complex background using Gabor filter and SVM [J]. Netlrocomputing, 2011, 74(17): 2998-3008.
2Shivakumara P, Bhowmiek .S, Su B, et al. A new gradi- ent based character segmentation method for video text recognition[ C] //International Conference on Document Analysis and Recognition. Beijing: IEEE, 2011 : 126-130.
3Huang Xiaodong, Ma Huadong, Zhang He. A new video text extraction approach [ C ]//International Conference on Multimedia and Expo. Cancun, Mexico: IEEE, 2009: 650-653.
4Sharma N, Shivakumara P, Pal U, et al. A new method for character segmentation from multi-oriented video words [ C ] //International Conference on Document Analysis and Recognition. Washington, DC, USA: IEEE, 2013: 413-417.
5Huang Liangkai, Wang Maoiun. Image thresholding by minimizing the measures of fuzziness [ J]. Pattern Recog- nition,1995,28(1):41-51.
6宋砚,刘安安,张勇东,林守勋.基于聚类的视频字幕提取方法[J].通信学报,2009,30(2):136-140. 被引量：10
7Phan T Q, Shivakumara P, Lu S, et al. A gradient vec- tor flow-based method for video character segmentation [ C ] //International Conference on Document Analysis and Recognition. Beijing, China: IEEE, 2011: 102d- 1028.

二级参考文献10

1JAIN A K, YU B. Automatic text location in images and video frames[J]. Pattern Recognition, 1998, 31 ( 12):2055-2076.
2KIM K I, JUMG K, KIM H. Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm[J]. IEEE Transaction on PAMI, 2003, 25( 12): 1631-1639.
3WU V, MANMATHA R, RISEMAN E M. Textfinder: an automatic system to detect and recognize text in images[J]. IEEE Transaction on PAMI, 1999, 21(11): 1224-1229.
4OTSU N. A threshold selection method from gray-level histograms[J]. IEEE Transaction on Systems, Man and Cybernetics, 1979, 9(1):62-66.
5XI J HUA X S, CHEN X R, et al. A video text detection and recognition system[A]. Proceedings of IEEE International Conference on Multimedia Expo[C]. Tokyo, Japan, 2001.873-876.
6SATO T, KANADE T, HUGHES E, et al. Video OCR for digital news archives[A], Proceedings of IEEE Workshop on Content-Based Access of Image and Video Database[C]. India, 1998.52-60.
7SOBEL I. Machine Vision for Three-Dimensional Scenes[M]. Orlando, USA, Academic Press, 1990. 376-379.
8LYU M R, SONG J Q, CAI M. A comprehensive method for multilingual video text detection, localization, and extraction[J]. IEEE Transaction on Circuits and Systems for Video Technology, 2005, 15(2):243-255.
9WERNICKE A, LIENHART R. On the segmentation of text in videos[A]. Proceedings of IEEE International Conference on Multimedia Expo[C]. New York, USA, 2000.
10YE Q X, HUANG Q H, GAO W, et al. Fast and robust text detection in images and video frames[J]. Image and Vision Computing, 2005, 23(6): 565-576.

共引文献9

1陈树越,张世林.基于灰度差分和二维最大熵阈值的新闻字幕检测[J].计算机应用研究,2011,28(8):3195-3197. 被引量：3
2刘毅,毛震东,张冬明,张勇东,林守勋.低质量汉字的分块搜索两级识别法[J].计算机辅助设计与图形学学报,2012,24(2):170-175. 被引量：2
3苏畅,胡晓冬,王斌辅,尚凤军.基于笔画相关加权的视频图像文字识别[J].计算机应用,2012,32(8):2305-2308. 被引量：4
4李琼.基于颜色分析的新闻视频字幕区提取方法研究[J].安徽电子信息职业技术学院学报,2013,12(3):6-9.
5任通,程江华,金阳,库锡树.一种基于白像素增量比的字幕图像分割算法[J].电视技术,2014,38(5):190-193. 被引量：1
6陈梓洋,王宇飞,钱侃,张超,孙知信.自然场景下基于区域检测的文字识别算法[J].计算机技术与发展,2015,25(7):230-233. 被引量：6
7田清越,高志荣,熊承义,陈少波.联合边缘增强的MSER自然场景文本检测[J].小型微型计算机系统,2017,38(11):2604-2609. 被引量：3
8赵洁,罗丹,樊李行,曹梦琪,耿耀君.农业科教视频中文字信息提取算法[J].数字技术与应用,2018,36(6):129-130.
9李雅静,丁海洋.基于MSER视频字幕敏感词过滤算法[J].现代信息科技,2023,7(21):80-84.

同被引文献7

1魏星,周萍.改进型蚁群算法的语音动态规划研究[J].计算机仿真,2011,28(5):402-405. 被引量：7
2傅颖,郭晶云.基于动态时间规整的人体动作识别方法[J].电子测量技术,2014,37(3):69-72. 被引量：17
3闯跃龙,张石清,郭文平,赵小明.基于感知哈希和自适应搜索的通用对象检测[J].光电子．激光,2016,27(2):231-238. 被引量：3
4张盼盼,张颖颖.模板匹配法与八邻域分析法在数字识别细化预处理中的应用及比较[J].软件导刊,2016,15(5):210-211. 被引量：4
5王正,陶品,冯立新,温江涛,杨士强.基于模板匹配的调色板方法[J].计算机辅助设计与图形学学报,2016,28(7):1146-1151. 被引量：2
6王刚,孙晓亮,尚洋,于起峰.一种基于最佳相似点对的稳健模板匹配算法[J].光学学报,2017,37(3):274-280. 被引量：20
7陈宁,王胜,黄正文.基于特征匹配的集装箱识别与定位技术研究[J].图学学报,2016,37(4):530-536. 被引量：4

引证文献2

1荣昕萌,傅博.模板匹配问题的动态规划算法实现[J].软件导刊,2017,16(6):37-40.
2程淑红,周斌,程树春.铝轮毂背腔字符的多模板匹配分割[J].齐齐哈尔大学学报（自然科学版）,2018,34(3):1-5.

1赵娟.基于Q学习的新闻图像检索方法[J].计算机工程与设计,2012,33(8):3210-3213.
2相子喜,吕学强,张凯.基于有向图模型的多模态新闻图像检索研究[J].科学技术与工程,2016,16(3):78-84. 被引量：4
3黄樟灿,蒋银峰.图分裂问题的数学模型及并行遗传算法[J].武汉汽车工业大学学报,1996,18(4):74-78.
4张静.新闻图像处理与优化办法——以县级电视台为例[J].中国传媒科技,2012(05X):144-145.
5刘佳宾,黄德武.改进的矩形区域检索算法[J].沈阳理工大学学报,2008,27(3):40-44.
6李云峰,贾晨辉.基于灰度投影的快速纸币图像几何校正[J].计算机应用与软件,2009,26(6):253-255. 被引量：5
7潘磊,袁小珂,周欢.单幅图像梯度域实时去雾算法[J].计算机工程,2016,42(10):266-270. 被引量：1
8王若君.论网络新闻图像传播中的问题及对策[J].西部广播电视,2016,37(13):73-73.
9相子喜,吕学强,张凯.新闻图像中重要人物的自动检测和识别研究[J].科学技术与工程,2015,35(36):183-188.
10朱景耀.灵活运用Flash巧做电视滚动字幕[J].教育技术资讯,2006(3):68-68.

北京邮电大学学报

2016年第3期

浏览历史

内容加载中请稍等...

基于模板匹配的新闻图像字幕行切分算法被引量：2

参考文献7

二级参考文献10

共引文献9

同被引文献7

引证文献2

相关作者

相关机构

相关主题

浏览历史

基于模板匹配的新闻图像字幕行切分算法 被引量：2

参考文献7

二级参考文献10

共引文献9

同被引文献7

引证文献2

相关作者

相关机构

相关主题

浏览历史

基于模板匹配的新闻图像字幕行切分算法被引量：2