基于分割的自然场景下文本检测方法与应用

Text detection and application in natural scene based on segmentation

下载PDF

导出

摘要自然场景文本检测识别在智能设备中应用广泛,而对文本识别的第一步则是对文本进行精确的定位检测。对于现有像素分割方法PixelLink中存在的弯曲文本定位包含过多背景信息、检测图像后处理不足两个主要问题提出改进。引入特征通道注意力机制,关注生成特征图中特征通道间的权重关系,提升检测方法的鲁棒性。接着改变公开数据集标注形式,将坐标点表示为一串带有方向的序列形式,在LSTM模型中进行多边形框的学习与框定。最后在公开数据集和自建数据集上进行文本检测测试。实验表明,改进的检测方法在各数据集中表现优于原方法,与当前领先方法精度相近,能够在各个环境中完成对文本的检测功能。 Text recognition in nature scene is currently applied in various intelligence equipment.The first step of text recognition is to precisely locate the text.In the Pixel Link text location methods,there are mainly two problems:too much background information is incorporated in the text region,and the test accuracy is insufficient.Aiming at these issues,an improved text location method was proposed to precisely locate the text in the natural scene.At first,an attention mechanism was incorporated into the original network.By focusing on the weight relationship between feature channels in the generated feature map,one can improve the weight coefficient of effective feature channels,and suppress the weight of inefficient or invalid feature channels.In the second,by changing the form of data set annotation,the coordinate points can be expressed as a series of sequence forms,so that the text lines can be framed adaptively in the LSTM model.At last,the located object is rotated according to the angle between a pair of vertexes in the polygon frame,and is subsequently fed to the text recognition interface to obtain the final character.Finally,the text detection test is carried out on the open data set and self-built data set.The experimental results show that the improved detection method is superior to the original method on different dataset,and the accuracy is similar to the current leading method.

作者陈小顺王良君 Chen Xiaoshun;Wang Liangjun(School of Computer Science and Telecommunication Engineering,Jiangsu University,Zhenjiang 212013,China)

机构地区江苏大学计算机科学与通信工程学院

出处《电子技术应用》 2021年第2期54-57,共4页 Application of Electronic Technique

基金国家自然科学基金(61601202) 江苏省自然科学基金(BK20140571) 江苏大学高级专业人才科研启动基金(14JDG038)。

关键词像素分割注意力机制 LSTM 自然场景文本检测 pixel segmentation attention mechanism LSTM natural scene text detection

分类号 TN911.73 [电子电信—通信与信息系统] TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1王俊,苗军,卿来云,乔元华.基于改进EAST的文本检测算法[J].计算机科学与应用,2021,11(1):167-175. 被引量：1
2洪瑛瑛.小学语文阅读教学中体验学习探究[J].现代基础教育研究,2020,40(4):191-194. 被引量：1
3张芸芸.让“脑靶向教学法”在初中语文阅读课上闪闪发光[J].启迪与智慧（上）,2021(1):91-91.
4肖志斌,杨尧,史庆杰,陈国良,张凯,苗锡奎.基于相对最值点的最小二乘直线检测方法[J].飞控与探测,2020,3(6):95-102. 被引量：1
5刘成,李正辉,高基豪.基于深度学习的银行卡号识别研究与应用[J].湖南邮电职业技术学院学报,2020,19(4):35-38. 被引量：3
6阮孟丽,陶兆胜,占伟豪,王丽华,王彪.基于SLIC-DPC算法的车辆检测研究[J].齐齐哈尔大学学报（自然科学版）,2021,37(2):41-45. 被引量：1
7江波.商超移动终端导购软件的设计与实现[J].信息与电脑,2020,32(24):71-73. 被引量：1
8李佳琪,杨硕.基于光照不均的场景文本提取算法[J].网络安全技术与应用,2021(1):45-48.

电子技术应用

2021年第2期

浏览历史

内容加载中请稍等...

基于分割的自然场景下文本检测方法与应用

相关作者

相关机构

相关主题

浏览历史