过渡映射耦合改进的阈值分割的文本提取方法

Text extraction method based on transition mapping coupled improved threshold segmentation

下载PDF

导出

摘要针对文本提取技术难以准确定位文本区域的问题,提出一种场景文本检测与提取方法。根据文本与其相邻背景之间的瞬态颜色差异,基于像素强度的变化,构建过渡映射,生成一个过渡图;通过计算过渡像素与其周围纹理的一致性,确定候选文本区域;利用LBP算子计算过渡像素附近的强度变化,得到文本区域,利用像素投影优化文本区域,精确定位文本区域的边界;在过渡像素中添加一个约束,利用改进的阈值分割方法,从文本区域中准确提取文本字符串。实验结果表明,与当前场景文本提取技术相比,在复杂视频场景中,所提算法具有更高的文本提取精度与鲁棒性。 To solve the defects of inaccurate localization of text regions in current video text extraction technology,a method of scene text detection and extraction was proposed.According to the transient color difference between the text and its adjacent background,a transition map was constructed based on the change of pixel intensity to generate a transition map.The candidate text region was determined by calculating the consistency between the transition pixel and its surrounding texture.The LBP was used to calculate the intensity change near the transition pixels to obtain text region,and the pixel region was optimized using pixel projection in the transition map so that the boundaries of the text region were accurately located.The text strings were extracted from the text region based on the improved threshold segmentation method and adding a constraint to the transition pixels.Experimental results show that the proposed algorithm has higher text extraction accuracy and robustness in complex video scene compared with other video text extraction algorithms.

作者贾彦茹张连堂周丽宴 JIA Yan-ru1 , ZHANG Lian-tang2 , ZHOU Li-yan3(1.School of Mathematics and information, Xinyang University, Xinyang 464000, China; 2. School of Computer and information Engineering, Henan University, Kaifeng 475001, China; 3. School of information Engineering, Zhengzhou University, Zhengzhou 450001, Chin)

机构地区信阳学院数学与信息学院河南大学计算机与信息工程学院郑州大学信息工程学院

出处《计算机工程与设计》北大核心 2018年第8期2603-2609,共7页 Computer Engineering and Design

基金国家自然科学基金项目(61172086) 河南省科技发展计划基金项目(132300410474)

关键词文本提取过渡映射像素投影文本区域阈值分割文本边界 video text extraction transition mapping pixel projection text region threshold segmentation text boundary

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1胡胜红,谭生龙,桂超,孙宝林.基于记分牌时间和新闻文本提取足球视频精彩事件[J].济南大学学报（自然科学版）,2016,30(5):321-327. 被引量：2
2胡峰松,朱浩.基于HSI颜色空间和行扫描的车牌定位算法[J].计算机工程与设计,2015,36(4):977-982. 被引量：18
3刘舒萍,汤宏颖.基于MCA与判别字典学习的场景图文字检测方法[J].传感器与微系统,2017,36(7):45-49. 被引量：2
4张国和,黄凯,张斌,符欢欢,赵季中.最大稳定极值区域与笔画宽度变换的自然场景文本提取方法[J].西安交通大学学报,2017,51(1):135-140. 被引量：18
5吴晓雨,何彦,杨磊,张宜春.基于改进形状上下文特征的二值图像检索[J].光学精密工程,2015,23(1):302-309. 被引量：31

二级参考文献33

1GAO Wen HUANG Qing-ming JIANG Shu-qiang ZHANG Peng.Sports video summarization and adaptation for application in mobile communication[J].Journal of Zhejiang University-Science A(Applied Physics & Engineering),2006,7(5):819-829. 被引量：1
2赵丕锡,胡滨,王秀坤,李国辉.足球视频的结构分析与概要[J].计算机工程与应用,2005,41(30):166-168. 被引量：6
3ZHAO Q,CAO J,HU Y.Image retrieval based on color-spatial distributing feature[J].Multimedia and Signal Processing Communications in Computer and Information Science,2012,346:79-86.
4KEKRE H B,THEPADE S D.Image retrieval using color-texture features extracted from walshlet pyramid[J].ICGST International Journal on Graphics,Vision and Image Processing (GVIP),2010,10:9-18.
5LEDWICH L,WILLIAMS S.Reduced SIFT features for image retrieval and indoor localization[C].Australian Conference on Robotics and Automation,2004,322:3.
6BELONGIE S,MALIK J,PUZICHA J.Shape context:A new descriptor for shape matching and object recognition[C].NIPS,2000,2:3.
7LING H,JACOBSA D W.Shape classification using the inner-distance[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2007,29(2):286-299.
8XIE J,HENG P A,SHAH M.Shape matching and modeling using skeletal context[J].Pattern Recognition,2008,41(5):1756-1767.
9ROMAN-RANGEL E,PALLAN C,ODOBEZ J M,et al..Analyzing ancient Maya glyph collections with contextual shape descriptors[J].International Journal of Computer Vision,2011,94(1):101-117.
10BELONGIE S,MALIK J,PUZICHA J.Shape matching and object recognition using shape contexts[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24 (4):509-522.

共引文献65

1G.Peters,W.Sll,崔运花,李毓陵.纱线中杂质、灰尘量的测定[J].国际纺织导报,2000,28(1):30-34.
2孙娜.自然语言文本中否定性信息智能抽取仿真[J].计算机仿真,2018,35(12):276-279. 被引量：2
3王灿进,孙涛,李正炜.基于快速轮廓转动力矩特征的激光主动成像目标识别[J].中国光学,2015,8(5):775-784. 被引量：5
4李玲,宋莹玮,杨秀华,陈逸杰.应用图学习算法的跨媒体相关模型图像语义标注[J].光学精密工程,2016,24(1):229-235. 被引量：3
5李黎,戴军,杨旭.改进的形状上下文在舰船图像匹配中的应用[J].数字技术与应用,2016,34(3):76-78. 被引量：2
6张瑞萌,张重阳.多特征级联筛选的高鲁棒车牌检测[J].电视技术,2016,40(4):109-114.
7施隆照,王凯.基于连通区域的复杂车牌的字符分割算法[J].计算机工程与设计,2016,37(8):2138-2142. 被引量：13
8唐瑞尹,卢博超,李博文.彩色图像车牌定位方法的研究现状及展望[J].工业控制计算机,2016,29(9):113-114.
9范莹,白瑞林,王秀平,李新.改进型形状上下文的工件立体匹配方法[J].激光技术,2016,40(6):814-819. 被引量：4
10卫丽华,朱鹏程,管致锦.基于方向局部极值模式的图像检索算法[J].计算机工程与设计,2016,37(12):3334-3339. 被引量：3

1王志军.按大小快速实现数字重排[J].电脑知识与技术（经验技巧）,2018,0(3):46-46.
2杨艳,刘洲峰,李春雷.基于机器视觉检测算法的织物疵点检测系统研究[J].中原工学院学报,2017,28(4):36-39. 被引量：1
3熊海朋,陈洋洋,陈春玮.基于卷积神经网络的场景图像文本定位研究[J].电子科技,2018,31(1):50-53. 被引量：11
4胡强,马文广,石志儒.HEVC兼容的全景视频运动补偿预测算法[J].中兴通讯技术,2017,23(6):28-31.
5王林,张晓锋.卷积深度置信网络的场景文本检测[J].计算机系统应用,2018,27(6):231-235. 被引量：2
6杨帆,赵增鹏,张磊.基于高斯混合模型的遥感影像云检测技术[J].南京林业大学学报（自然科学版）,2018,42(4):134-140. 被引量：5
7胡晓.基于云计算平台的人脸识别[J].电信快报（网络与通信）,2018(4):39-41. 被引量：1
8谢宗彦,黎巎,周纯洁.基于CNN和SOM的评论主题发现[J].情报科学,2018,36(6):30-34. 被引量：3
9方承志,黄梅玲.自然场景中多方向文本的检测[J].计算机工程与设计,2018,39(5):1377-1381. 被引量：2
10程申前,游林.基于局部宏观结构和微观特征融合的手指静脉识别算法[J].通信技术,2018,51(7):1585-1593.

计算机工程与设计

2018年第8期

浏览历史

内容加载中请稍等...

过渡映射耦合改进的阈值分割的文本提取方法

参考文献5

二级参考文献33

共引文献65

相关作者

相关机构

相关主题

浏览历史