基于改进结构与位置对齐网络的表结构识别法被引量：1

Table structure recognition method based on improved structure and position alignment network

下载PDF

导出

摘要针对现有表格结构检测方法运算量大,表格结构识别准确率低的问题,提出了一种改进的表格结构识别方法。该方法优化了结构与位置对齐网络,提出在一个轻量级的CPU卷积神经网络PPLCNet较深层增加残差连接,加强网络的学习能力;在特征提取和特征融合之间引入卷积块注意力模块(convolutional block attention module,CBAM)机制,同时从通道和空间维度加强模型对目标对象的定位能力;在Head部分采用卷积层替代全连接层,实现权重共享,用来降低模型的计算量;此外,还采用Smooth L1损失函数,通过回归表格四顶点坐标,避免图像畸变对于模型性能的影响;为了验证算法的性能,采用PubTabNet数据集进行测试,结果表明所提方法的准确率(Acc)达到71.58%,基于树编辑距离的相似度(tree-editdistance-based similarity,TEDS)达到94.47%;相比较于改进前模型精度提升了2.76%,TEDS提升了0.79%,模型综合性能更优。 In response to the problem of the existing table structure detection method,the accuracy of the accuracy of the table structure is low,and a improved table structure recognition method is proposed.This method optimizes the structure and position alignment network,and proposes to increase the residual connection in a lightweight CPU convolutional neural network PPLCNet to enhance the learning capabilities of the network.The introduction of convolutional block attention module(CBAM)mechanism,at the same time,the localization ability of the model to the target object is strengthened from the channel and spatial dimensions.Use a convolutional layer to replace the full connection layer in the head part to reduce the weight sharing to reduce the calculation of the model.In addition,Smooth Ll loss function is also used to avoid the impact of image distortion on model performance by regression of table four vertex coordinates.In order to verify the performance of the algorithm,the PubTabNet dataset was tested,the results showed that the accuracy(Acc)of the method reached 71.58%,and the tree-editdistance-based similarity(TEDS)reached 94.47%.Compared with the accuracy of the model before improved,the accuracy of the model was increased by 2.76%,TEDS increased by 0.79%,and the model comprehensive performance was better.

作者陈雨蒋三新 Chen Yu;Jiang Sanxin(College of Electronics and Information Engineering,Shanghai University of Electric Power,Shanghai 201306,China)

机构地区上海电力大学电子与信息工程学院

出处《国外电子测量技术》北大核心 2023年第12期57-62,共6页 Foreign Electronic Measurement Technology

关键词深度学习表格结构识别注意力机制残差网络 deep learning table structure recognition attention mechanisms residual network

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1唐锐,邓建新,叶志兴,张海平.PDF文件的表格抽取研究综述[J].计算机应用与软件,2021,38(7):1-7. 被引量：8
2孔令军,包云超,王茜雯,李华康.基于深度学习的表格检测识别算法综述[J].计算机与网络,2021,47(2):65-73. 被引量：8
3袁鸿雁.基于本体的Web表格信息抽取技术的研究[J].青岛大学学报（自然科学版）,2010,23(2):47-51. 被引量：3
4杨光,丁博,宋昕.基于改进的Bernsen秸秆覆盖率图像处理算法研究[J].电子测量与仪器学报,2021,35(12):158-166. 被引量：4
5余伶俐,夏旭梅,周开军,陈海初.基于仿生视觉的图像RST不变属性特征提取方法[J].仪器仪表学报,2017,38(4):985-995. 被引量：7
6赵朋成,冯玉田,涂云轩.基于高倍特征深度残差网络的手写数字识别[J].电子测量技术,2018,41(6):86-89. 被引量：10

二级参考文献49

1刘媛媛,张硕,于海业,王跃勇,王佳木.基于语义分割的复杂场景下的秸秆检测[J].光学精密工程,2020,28(1):200-211. 被引量：18
2苏艳波,张东远,李洪文,何进,王庆杰,李慧.基于自动取阈分割算法的秸秆覆盖率检测系统[J].农机化研究,2012,34(8):138-142. 被引量：10
3成瑜,何洁月.本体驱动的半结构化Web生物数据抽取[J].计算机工程,2006,32(5):192-194. 被引量：5
4许文,都云程,李渝勤,施水才.一种通用HTML网页主题信息提取方法[J].现代图书情报技术,2007(1):40-43. 被引量：11
5Lim S,Ng Y.An Automated Approach for Retrieving Hierarchical Data From HTML Tables[A].In Proceedings of the Eighth International Conference on Information and Knowledgd management[C]∥ Kansas City,Missouri,1999:466-474.
6Hu J,Kashi R,Lopresti D,et al.Why Table Ground-Truthing is Hard[A].In Proceedings of the 16th International Conference on Document Analysis and Recognition[C]∥ Honolulu,Hawaii 2001:129-133.
7Chen H.Mining Tables From Large Scale HTML Texts[A].In Proceedings of the 18th International Conference on Computational Linguistics[C]∥ Philadelphia,Pennsylvania,2000:166-172.
8Tengli A,Yang Y,Li N.Learning Table Extraction From Examples[A].In Proceedings of 20th International Conference on Computational Linguistics[C]∥ Geneva,Switzerland,2004:23-27.
9Cohen W W,Hurst M,Jensen L S.A flexible Learning System for Wrapping Tables and Lists in Html Documents[A].In Proceedings of International World Wide Web Conferences[C]∥ Honolulu,Hawaii,2002,5:232-241.
10Embley DW,Jiang Y S,Ng Y K.Record-Boundary Discovery in Web Documents[A].In Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data[C]∥ Philadelphia,Pennsylvania,1999,5/6:467-478.

共引文献32

1安晓飞,代均益,路振田,徐新刚,尹彦鑫,孟志军.基于彩色空间距离优化的秸秆覆盖率检测算法研究[J].农业机械学报,2023,54(S02):229-234.
2许鸿奎,邵星,韩晓,宫淑兰,王兆斌.基于堆栈自编码的刻划字符检测研究[J].山东建筑大学学报,2018,33(5):24-30.
3汪雅琴,夏春蕾,戴曙光.混合样本训练方式的手写数字识别[J].电子测量技术,2018,41(18):52-56. 被引量：3
4刘科,钟志成.扫描电子显微镜图像的特征提取方法研究[J].现代电子技术,2018,41(22):68-71. 被引量：4
5李震,吴俊君,高强.基于改进遗传算法的微小图像边缘特征快速识别研究[J].机械设计与制造工程,2019,48(1):102-106. 被引量：3
6茹晓青,华国光,李丽宏,李莉.基于形变卷积神经网络的手写体数字识别研究[J].微电子学与计算机,2019,36(4):47-51. 被引量：14
7张德,李国璋,王怀光,张峻宁.位姿估计自适应学习率的改进[J].电子测量与仪器学报,2019,31(6):51-58. 被引量：5
8郁涯.移动终端手势交互中的图像特征识别技术仿真[J].计算机仿真,2019,36(10):462-466. 被引量：2
9佟健颉,黎英,王一旋.基于深度残差网络的短时交通流量预测[J].电子测量技术,2019,42(18):85-89. 被引量：5
10张珂,侯捷.基于改进的卷积神经网络图像识别方法[J].科学技术与工程,2020,20(1):252-257. 被引量：21

同被引文献5

1李一仁,黄征,陈凯,郭捷,邱卫东.基于图卷积网络的表格结构提取[J].信息系统工程,2021,34(1):132-134. 被引量：2
2张宇童,李启元,刘树衎.表格检测与结构识别综述[J].计算机工程与应用,2022,58(22):1-11. 被引量：2
3杨烨,王德军,孟博.基于深度学习的政务表格单元格结构检测[J].中南民族大学学报（自然科学版）,2023,42(2):253-259. 被引量：1
4赵玲俐,余艳梅,陶青川.表格结构识别网络FcaTGRNet[J].现代计算机,2023,29(1):54-58. 被引量：1
5吕学强,张煜楠,韩晶,崔运鹏,李欢.融合边特征与注意力的表格结构识别模型[J].计算机应用,2023,43(3):752-758. 被引量：1

引证文献1

1胡滨.基于行列信息门的表格结构识别网络研究[J].中国科技纵横,2024(14):39-42.

1富钰.网页制作举一反三[J].网上出版,1997(1):48-52.
2夏金涛,黄秀秀,朱松松,朱芊,李贝贝,江丰.基于^(1)H-qNMR的水溶性镧系位移试剂识别法测定功能性食品中谷氨酰胺对映体[J].中国食品卫生杂志,2023,35(11):1579-1586.
3袁建,邸智,郑子辰,贾家琛.供应商投标文件关键信息数据自动提取方法[J].中国管理信息化,2024,27(1):173-178.
4曾广娟,王晗,赵美微,彭红丽,冯阳,耿世刚.长期有机种植模式下日光温室蔬菜栽培土壤真菌多样性研究[J].农业生物技术学报,2024,32(3):641-654. 被引量：2

国外电子测量技术

2023年第12期

浏览历史

内容加载中请稍等...

基于改进结构与位置对齐网络的表结构识别法被引量：1

参考文献6

二级参考文献49

共引文献32

同被引文献5

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于改进结构与位置对齐网络的表结构识别法 被引量：1

参考文献6

二级参考文献49

共引文献32

同被引文献5

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于改进结构与位置对齐网络的表结构识别法被引量：1