期刊文献+

基于Faster-RCNN的水书古籍手写文字的检测与识别 被引量:3

Detection and recognition of handwritten characters in Shuishu ancient books based on Faster-RCNN
下载PDF
导出
摘要 中国非物质文化遗产水书文化面临失传威胁,近年大量深度学习的方法用于手写古籍文字的识别.但水书古籍文字识别面临数据集建立和标注困难、样本不平衡等问题,研究进展不大,且鲜少进行水书古籍页面级的文字检测与识别.首先建立了一个较大规模的水书手写文字数据集,通过几种数据扩增方式,获得包含80个文字类别,共110610个带标签的字符样本.将Faster-RCNN(faster-region based convolutional neural network)算法应用到水书古籍文字识别研究上,以不同组合的数据集作为输入进行实验,在全部80个目标类别上获得了91.95%的平均识别率,实现了页面级的端到端的水书古籍文字的准确定位与识别.实验结果表明,Faster-RCNN模型在目前的数据集上能很好地实现水书手写文字的检测与识别,文中采用的数据扩增方式能明显提升水书手写文字的识别率,为水书文化的保护和传承提供了新思路,对于解决实际应用场景中的水书文字识别问题具有重要意义. The Shuishu culture of China's intangible cultural heritage is facing the threat of loss.In recent years,a large number of deep learning methods have been used to recognize characters in handwritten ancient books.However,the character recognition of Shuishu ancient books faces problems such as dataset establishment and labeling,sample imbalance,etc.,and there is not much research progress.In this paper,a large scale Shuishu handwritten character dataset is firstly established,discussed several data augmentation methods,and obtained 110610 labeled character samples containing 80 categories.Faster-RCNN(fast region-based convolutional neural network)was applied to the recognition of Shuishu handwritten characters.Experiments were carried out with different combinations of datasets as input,and an average recognition rate of 91.95%was obtained on all 80 target classes,achieving end-to-end accurate localisation and recognition of Shuishu handwritten characters at the page level.The experimental results show that the Faster-RCNN can achieve good detection and recognition of Shuishu handwritten characters on the current dataset,and the data augmentation method used in this paper can significantly improve the recognition rate of Shuishu handwritten characters.This paper provides new ideas for the preservation and transmission of Shuishu culture,and is important for solving the problem of Shuishu character recognition in practical application scenarios.
作者 汤敏丽 谢少敏 刘向荣 TANG Minli;XIE Shaomin;LIU Xiangrong(School of Informatics,Xiamen University,Xiamen 361005,China;School of Big Data Engineering,Kaili University,Kaili 556011,China;Key Laboratory of Digital Protection and Intelligent Processing of Intangible Cultural Heritage of Fujian and Taiwan,Ministry of Culture and Tourism,Xiamen University,Xiamen 361005,China)
出处 《厦门大学学报(自然科学版)》 CAS CSCD 北大核心 2022年第2期272-277,共6页 Journal of Xiamen University:Natural Science
关键词 水书 手写中文文字识别 Faster-RCNN 页面级文字识别 数据扩增 Shuishu handwritten Chinese character recognition Faster-RCNN page level character recognition data augmentation
  • 相关文献

参考文献7

二级参考文献30

共引文献119

同被引文献23

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部