期刊文献+

基于CNN和LSTM的自然场景文本检测应用 被引量:1

Application of the Natural Scene Text Detection Based on CNN and LSTM
下载PDF
导出
摘要 针对自然场景的文本检测,构建了一种基于卷积神经网络(CNN)和长短时记忆网络(LSTM)的自然场景文本识别框架,运用CNN网络对图像中的静态特征进行提取,LSTM提取上下文特征信息。在解码上,提出了一种混合的CTC-Attention机制对输出层的编码进行解码。 Aiming at the text detection in natural scenes,a text recognition framework based on CNN and LSTM network is constructed.The static features in images are extracted by CNN network,and the context features are extracted by LSTM.In decoding,a hybrid CTC-Attention mechanism is proposed to decode the encoding at the output layer.
作者 王雪娇 张超敏 WANG Xuejiao;ZHANG Chaomin(Jiangsu United Vocational and Technical College,Wuxi Mechanical and Electrical Branch,Wuxi 214400,China)
出处 《仪表技术》 2020年第9期17-23,45,共8页 Instrumentation Technology
关键词 文本检测 卷积神经网络 循环神经网络(RNN) text detection convolutional neural network recurrent neural network(RNN)
  • 相关文献

参考文献4

二级参考文献20

  • 1欧文武,朱军民,刘昌平.自然场景文本定位[J].中文信息学报,2004,18(5):42-47. 被引量:17
  • 2谢毓湘,栾悉道,吴玲达,老松杨.新闻视频帧中的字幕探测[J].计算机工程,2004,30(20):167-168. 被引量:15
  • 3JUNG K,KIM K I,JAIN A K.Text Information Extraction in Images and Video:A Survey[J].Pattern Recognition.2004,37:977-997
  • 4LI H,DOERMANN D,KIA O.Automatic text detection and tracking in digital video.IEEE Trans.Image Process,2000,9 (1):147-156.
  • 5ZHONG Yu,ZHANG Hongjiang,JIAN A K.Automatic Caption Localization in Compressed Video[J].Ieee Transactions on pattern Analysis and Achine intelligence,2000,22(4):385-392.
  • 6LIENHART R,WERNICKE A.Localizing and Segmenting Text in Images and Videos[J].IEEE Transactions on Circuits and Systems for Video Technology,2000,12(4):256 -268.
  • 7JEONG K Y,JUNG K,KIM E Y,et al.Neural Network-Based Text Location for News Video Indexing[J].IEEE Transactions on Information Theory,1998,44(5):319 -323.
  • 8Lienhart R. , Effelsberg W. , Automatic text segmentation and text recognition for video indexing [J]. Multimedia System, 2000,8(1):69-81.
  • 9X.-C. Yin,X.-W. Yin,K.-Z. Huang,H.-W. Hao."Robust text detection in natural scene images,". IEEE Transactions on Pattern Analysis and Machine Intelligence . 2013
  • 10Huang W,Lin Z,Yang J. et al.Text Localization in Natural Images Using Stroke Feature Transform and Text Covariance Descriptors. Proceedings of the 14th IEEE International Conference on Computer Vision . 2013

共引文献19

同被引文献9

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部