期刊文献+

融合字符结构特征的多任务老挝语文字识别研究

A Multi-task Approach to Lao Character Recognition with Structural Features
下载PDF
导出
摘要 老挝语属于资源稀缺型语言,直接从互联网中获取老挝语文本语料较为困难,老挝语文字识别研究可在有限的图片文本资源中获取更多的老挝语文本语料。在开展老挝文字光学字符识别的研究工作中,针对老挝单字符误切分、上/下位元音以及音调识别位置存在偏差和相似老挝字符的识别问题,该文研究了老挝字符书写等级和下位辅音,提出一种有效融合老挝字符结构特征的多任务字符识别方法。首先,利用深度残差网络提取字符图片中的老挝字符结构特征,通过边框回归矫正单字符包围框;其次,将已矫正切分结果和提取的字符特征作为联合输入,通过双向长短时记忆网络预测老挝字符序列,利用连接主义时间分类对预测结果进行序列对齐;最后,根据老挝字符固定组合优化模型预测结果。实验结果表明:该方法可以精确识别已切分的老挝字符序列,字符错误率指标低至13.06%。 Focused on the Optical Character Recognition of Lao script,this paper investigates the problems of Lao characters mis-segmentation,the misperception of hypernym/hyponym vowels and tone,and the confusion of simi-lar Lao characters.According to the writing scheme and the hypo consonant of Lao characters,this paper proposes a multitasking character recognition to effectively integrate the structural features of Lao characters.The model ex-tracts the structural features of Lao characters from character pictures via Deep Residual Network,and corrects the single character bounding box through Bounding Box Regression.Then,the corrected segmentation results and ex-tracted character features are input jointly into Bi-directional Long-Short Term Memory network to identify the Lao character sequence,and the sequence alignment is completed by the Connectionist Temporal Classification.Finally,the result is predicted by the fixed combinatorial optimization model of Lao characters.The experimental result shows the method can reduce the Character Error Rate to 13.06%.
作者 陈琢 周兰江 郝永彬 张建安 CHEN Zhuo;ZHOU Lanjiang;HAO Yongbin;ZHANG Jian'an(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming,Yunnan 650500,China;Kunming Brarch,No.3 College,PLA Information Engineering University,Kunming,Yunnan 650500,China)
出处 《中文信息学报》 CSCD 北大核心 2023年第4期34-44,共11页 Journal of Chinese Information Processing
基金 国家自然科学基金(61662040)。
关键词 老挝印刷字符识别 老挝字符结构特征 多任务识别 端到端模型 Lao printed characters'recognition Lao characters structural features multi-task recognition end-to-end model
  • 相关文献

参考文献2

二级参考文献9

  • 1刘群,张华平,俞鸿魁,程学旗.基于层叠隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429. 被引量:197
  • 2Shaopint Ma, Yijiang Jin, Zhe Jiang, et al. A Method of Printing Tibetan Character Recognition [C].Proceedings of the 4th World Congress on Intelligent Control and Automation, Shanghai, China,2002,3: 2304-2307.
  • 3Kato N,Suzuki M,(hnachi S, et al. A Handwritten Character Recognition System using Directional Element Feature and Asymmetric Mahalanobis Distance [J]. IEEE Tram on PAMI, 1999, 21(3):258 -262.
  • 4K. Fukunaga, Introduction to Statistical Pattern Recognition (2nd Edition) [M]. Academic Press,New York, 1990: 460 - 465.
  • 5F. Kimum, K. Takashina, S. Tsuruoka, Modified quadratic Diseriminant Functions and the Application to Chinese Character Recognition [ J ]. IEEE Trans on PAMI, 1987, 9 (1) : 149 - 153.
  • 6X. Lin, Xiaoqing Ding, Ming Chen, et al. Adaptive Confidence Transform Based Classifier Combination for Chinese Character Recognition [J]. Pattern Recognition Letters, 1998, 19 (10): 975-988.
  • 7马少平,夏莹,朱小燕.基于模糊方向线素特征的手写体汉字识别[J].清华大学学报(自然科学版),1997,37(3):42-45. 被引量:37
  • 8王维兰.藏文基本字符识别算法研究[J].西北民族学院学报(自然科学版),1999,20(3):20-23. 被引量:16
  • 9张洪刚,李焕.基于双向长短时记忆模型的中文分词方法[J].华南理工大学学报(自然科学版),2017,45(3):61-67. 被引量:12

共引文献32

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部