期刊文献+

基于OCR技术的航天器材料及器件试验数据识别系统 被引量:1

Test Data Identification System for Spacecraft Material and Device Based on OCR Technology
下载PDF
导出
摘要 航天器材料及器件数据库需要海量国内外试验报告数据的支撑,其中表格作为最普遍的数据存储形式含有的数据量最为庞大,然而面对人工识别提取表格数据工作繁琐且易出错的难点,以PDF文档的表格为研究对象,提出基于OCR技术的航天器材料及器件试验数据识别系统;采用了B/S架构,基于EXT、JAVA、Python等技术语言进行开发,系统具备PDF文档转换、表格识别、数据提取、数据编辑等功能;依据系统设计采用版面分析和PDFPlumber表格检测的关键技术和方法以达导准确有效识别PDF文档表格的目的,采用EXT表格控件形式展现提取的数据经试验测试实现了对PDF文档内规整表格的批量识别和数据提取;验证了设计方案的可行性,满足了试验数据试别系统的高识别准确率、快速识别等特点。 The database of spacecraft materials and devices needs the support of massive test reports at home and abroad. As the most common form of data storage, the table contains the largest amount of data. However, faced with the tedious and error-prone work of manual identification and extraction of table data, the table of PDF document is taken as the research object. A data identification system of spacecraft material and device test based on OCR technology is proposed. Using B/S architecture, based on the developments of EXT, JAVA, Python and other technical languages, the system has the functions such as PDF document conversion, form recognition, data extraction, data editing;According to the system design, the key technologies and methods for the layout analysis and PDFPlumber form inspection are used to identify PDF document forms accurately and effectively. The extracted data are displayed in the EXT form control. The batch identification and data extraction of regular forms in PDF documents are realized through the test. The feasibility of the design scheme is verified to meet the high accuracy and fast recognition of the system.
作者 陆俊杰 魏亚东 李晓峰 王成 李洪普 李锋 LU Junjie;WEI Yadong;Li Xiaofeng;WANG Cheng;LI Hongpu;LI Feng(Wuxi Orient Software Technology Co.,Ltd.,Wuxi 214000,China;School of Materials Science and Engineering,Harbin Institute of Technology,Harbing 150001,China;China Ship Scientific Research Center,Wuxi 214000,China)
出处 《计算机测量与控制》 2023年第1期282-288,293,共8页 Computer Measurement &Control
关键词 航天器材料与器件 数据识别系统 OCR PDF文档 表格识别 spacecraft materials and devices data identification system OCR PDF form recognition
  • 相关文献

参考文献10

二级参考文献47

共引文献57

同被引文献15

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部