叠层模型驱动的书法文字识别方法研究被引量：1

Calligraphy Character Recognition Method Driven by Stacked Model

下载PDF

导出

摘要基于二维图像的书法文字识别是指利用计算机视觉技术对书法文字单字图像进行识别,在古籍研究和文化传播中具有重要应用.目前书法文字识别技术已经取得了相当不错的进展,但依旧面临很多挑战,比如复杂多变的字形可能导致的识别误差,汉字本身又存在较多形近字,且汉字字符类别数与其他语言文字相比更多,书法文字图像普遍存在类内差距大、类间差距小的问题.为解决这些问题,提出叠层模型驱动的书法文字识别方法(Stacked-model driven character recognition,SDCR),通过使用数据预处理、节点分离策略和叠层模型对现有单一分类模型进行改进,按照字体类别对同一类别不同字体风格的文字进行二次划分;针对类间差距小的问题,根据书法文字训练集图像识别置信度对形近字进行子集划分,针对子集进行嵌套模型增强训练,在测试阶段利用叠层模型对形近字进行二次识别,提升形近字的识别准确率.为了验证该方法的鲁棒性,在自主生成的SCUT_Calligraphy数据集和CASIA-HWDB 1.1,CASIA-AHCDB公开数据集上进行训练和测试,实验结果表明该方法在上述数据集的识别准确率均有较大幅度提升,在CASIA-HWDB 1.1、CASIA-AHCDB和自建数据集SCUT_Calligraphy上测试准确率分别达到96.33%、99.51%和99.90%,证明了该方法的有效性. Calligraphy character recognition based on two-dimensional images means to recognize single calligraphy character based on computer vision,which has important applications in ancient book research and cultural dissemination.At present,calligraphy character recognition has made considerable progress,but still faces many challenges,such as recognition errors caused by complex and variable font shapes,the existence of many similar characters in Chinese,and the number of Chinese character categories is extremely large.Calligraphy character images generally have large intra class differences and small inter class differences.In order to tackle these issues,we proposed a calligraphy character recognition method based on stacked model(SDCR).By using data preprocessing,node separation strategy and stacked model,and the characters with different font styles in the same category is subdivided according to the font style.To address the issue of small inter class differences,the calligraphy character training set image recognition confidence level is used to divide the characters with similar style into subsets.Nested model enhancement training is conducted on the subsets,and in the testing stage,a stacked model is used for secondary recognition of characters with similar style to improve the recognition accuracy of shape near characters.In order to verify the robustness of our proposed method,we train and test on self-generated dataset SCUT_Calligraphy and publicly available datasets CASIA-HWDB 1.1,CASIA-AHCDB.The experimental results showed that the proposed method significantly improved the recognition accuracy of the datasets mentioned above.The testing accuracy on CASIA-HWDB 1.1,CASIA-AHCDB and SCUT_Calligraphy reached 96.33%,99.51%,and 99.90%,respectively,which proves the effectiveness of the method described in this article.

作者麻斯亮许勇 MA Si-Liang;XU Yong(School of Computer Science and Engineering,South China University of Technology,Guangzhou 510006;Pengcheng Laboratory,Shenzhen 518000)

机构地区华南理工大学计算机科学与工程学院鹏城实验室

出处《自动化学报》 EI CAS CSCD 北大核心 2024年第5期947-957,共11页 Acta Automatica Sinica

基金国家自然科学基金(62072188)资助。

关键词书法文字识别模型驱动节点分离叠层模型精度学习 Calligraphy character recognition model driven nodes separation stacked model precision learning

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献4

1李文英,曹斌,曹春水,黄永祯.一种基于深度学习的青铜器铭文识别方法[J].自动化学报,2018,44(11):2023-2030. 被引量：24
2Tai-Ling Yuan,Zhe Zhu,Kun Xu,Cheng-Jun Li,Tai-Jiang Mu,Shi-Min Hu.A Large Chinese Text Dataset in the Wild[J].Journal of Computer Science & Technology,2019,34(3):509-521. 被引量：10
3张颐康,张恒,刘永革,刘成林.基于跨模态深度度量学习的甲骨文字识别[J].自动化学报,2021,47(4):791-800. 被引量：9
4杨春,刘畅,方治屿,韩铮,刘成林,殷绪成.开放集文字识别技术[J].中国图象图形学报,2023,28(6):1767-1791. 被引量：3

引证文献1

1刘畅,杨春,殷绪成.基于文字局部结构相似度量的开放集文字识别方法[J].自动化学报,2024,50(10):1977-1987.

1《中国医药科学》投稿须知[J].中国医药科学,2024,14(7).
2舒媛,刘金鹏.浅析锚网喷、锚索、钢带联合支护在硐室施工的应用[J].中文科技期刊数据库（文摘版）工程技术,2016(9):55-55.
3冯华民,程彦民.钢筋混凝土在水利工程施工中的重要应用[J].中文科技期刊数据库（全文版）工程技术,2016(9):34-34.
4寇永锋,杨坤,张斌,肖迤文,鲁建英,陈朗.基于烤燃实验和数值模拟的战斗部装药热安全性[J].兵工学报,2023,44(S01):41-49.
5韩俊鹏.智能化化工仿真系统的研究与开发[J].中国科技期刊数据库工业A,2016(4):16-16.
6孟玉洁.藏书印“祝昭声章”稽考[J].文史杂志,2024(3):122-124.
7王强.航空锻造企业数字化转型思路与路线研究[J].产品可靠性报告,2024(3):65-67.
8乐星辰,杜伟锋,路胜利,开国银.基于改进YOLOv7-tiny算法的浙贝母切片角度识别[J].中国实验方剂学杂志,2024,30(11):183-191.
9郭占兵,黄蓓.高温合金材料热加工过程中的组织与性能演变研究[J].中国设备工程,2024(10):85-87.
10DONG Hairong,WU Wei,SONG Haifeng,LIU Zhen,ZHANG Zixuan.Data and Model Driven Task Offloading Strategy in the Dynamic Mobile Edge Computing System[J].Journal of Systems Science & Complexity,2024,37(1):351-368.

自动化学报

2024年第5期

浏览历史

内容加载中请稍等...

叠层模型驱动的书法文字识别方法研究被引量：1

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

叠层模型驱动的书法文字识别方法研究 被引量：1

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

叠层模型驱动的书法文字识别方法研究被引量：1