期刊文献+

基于CRNN模型的中文场景文字识别 被引量:4

ChineseScene Text Recognition based on CRNN model
下载PDF
导出
摘要 中文场景文字识别(STR)是光学字符识别(OCR)技术的重要研究方向,在拍照翻译、无人驾驶等领域广泛应用。但是,中文场景下的文字面临着字体和字符种类多、文字背景复杂等问题。本文着眼于“中国街景”图像,基于CRNN模型提出了一种免分割、端到端的中文场景文字识别方法。首先CNN提取图像卷积特征,然后RNN进行序列特征预测,其中Bi-GRU有效抑制梯度消失或梯度爆炸,Dropout可以防止过拟合,最后引入CTC作为损失函数解决训练时字符无法对齐的问题。本文用Python实现了算法,以较好的效果完成了实验。 Chinese scene character recognition(STR)is an important research direction of optical character recognition(OCR)technology,which is widely used in the fields of photo translation and unmanned driving.However,the characters in Chinese scene are faced with many problems,such as many types of fonts and characters,complex text background and so on.This paper focuses on the"Chinese street view"image,and proposes a segmentation free,end-to-end Chinese scene text recognition method based on crnn model.Firstly,CNN extracts image convolution features,and then RNN performs sequence feature prediction.Bi Gru can effectively suppress gradient disappearance or gradient explosion,dropout can prevent over fitting.Finally,CTC is introduced as a loss function to solve the problem that characters cannot be aligned during training.In this paper,Python is used to implement the algorithm,and the experiment is completed with good effect.
作者 辜双佳 栗智 Gu Shuangjia;Li Zhi(School of Computer Science and Engineering,Chongqing University of Technology,Chongqing 400000;School of Computer Science,Chongqing University,Chongqing 400000)
出处 《科技风》 2021年第17期108-110,共3页
关键词 中文OCR CRNN 免分割 端到端 中国街景 Chinese OCR CRNN No split End-to-End Chinese street scene
  • 相关文献

参考文献5

二级参考文献13

共引文献53

同被引文献35

引证文献4

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部