具有自校正与注意力机制相结合的场景文本检测被引量：1

Scene text detection with self-calibration and attention mechanism

导出

摘要在日常生活中,存在着丰富的文本信息,对这些信息的提取,能够极大地提高人们的生活品质。但自然场景中文本信息表达形式丰富多样,文本形状各异,在检测过程中存在误检、文本区域定位不准问题。针对以上不足,本文提出了一种具有自校正与注意力机制相结合的文本检测方法。首先,在ResNet50骨干网络中嵌入自校正卷积(self-calibrated convolution, SConv)及高效通道注意力(efficient channel attention, ECA),使网络能够校正全局无关信息的干扰,并集中关注于文本区域,提取更加丰富的语义信息;其次,在特征融合后加入协调注意力(coordinate attention, CA),纠正不同尺度的特征图在融合过程中产生的位置偏差。最后,通过修正后的特征图预测得到多个不同尺度的文本实例,采用渐进尺度扩展算法,求出最终检测到的文本实例。实验结果表明,在任意方向数据集ICDAR2015以及弯曲文本数据集Total-Text、SCUT-CTW1500上,相比于改进前的ResNet50综合指标F值分别提升了1.0%、5.2%、5.4%,证明了本方法具有良好的检测能力。 In daily life, there are rich text information, the extraction of such information can greatly improve people′s quality of life.However, there are various forms of text information expression and different text shapes in natural scenes, which result in false detection and inaccurate location of text regions.In order to solve these problems, this paper proposes a text detection method with self-calibration and attention mechanism.Firstly, the self-calibrated convolution(SConv) and efficient channel attention(ECA) are embedded in the backbone of ResNet50 to correct the interference of irrelevant global information and concentrate on the text area to extract more abundant semantic information Secondly, coordinated attention(CA) is added after feature fusion to correct the position deviation of feature map in different scale.Finally, several text instances of different scales are predicted by the modified feature map, and the final detected text instances are obtained by using the progressive scale expansion algorithm.The experimental results show that the comprehensive index F-measure is increased by 1.0%,5.2% and 5.4% respectively compared with the unmodified ResNet50 on the arbitrary direction data set ICDAR2015 and the curved text data set Total-Text and SCUT-CTW1500.It is proved that this method has good detection ability.

作者孙鹏刘粤强观臣熊炜付尧李利荣 SUN Peng;LIU Yue;QIANG Guanchen;XIONG Wei;FU Yao;LI Lirong(School of Electrical and Electronic Engineering,Hubei University of Technology,Wuhan,Hubei 430068,China;Xiangyang Industrial Research Institute,Hubei University of Technology,Xiangyang,Hubei 441003,China;Department of Computer Science and Engineering,University of South Carolina,Columbia,SC 29201,USA)

机构地区湖北工业大学电气与电子工程学院襄阳湖北工业大学产业研究院美国南卡罗来纳大学计算机科学与工程系

出处《光电子．激光》 CAS CSCD 北大核心 2022年第12期1287-1295,共9页 Journal of Optoelectronics·Laser

基金国家自然科学基金(61571182,61601177) 国家留学基金(201808420418) 湖北省自然科学基金(2019CFB530) 湖北省科技厅重大专项(2019ZYYD020) 襄阳湖北工业大学产业研究院科研项目(XYYJ2022C05)资助项目。

关键词自校正卷积(SConv) 高效通道注意力(ECA) 协调注意力(CA) 渐进尺度扩展算法 self-calibrated convolutions(SConv) efficient channel attention(ECA) coordinate attention(CA) progressive scale expansion algorithm

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

同被引文献4

1陈玮,周晴,曹桂涛.基于SVM和阈值分析法的摔倒检测系统[J].计算机应用与软件,2017,34(7):182-187. 被引量：11
2卫少洁,周永霞.一种结合Alphapose和LSTM的人体摔倒检测模型[J].小型微型计算机系统,2019,40(9):1886-1890. 被引量：14
3郝骏,郑紫微,孙滋昂,杨洋.基于Yolo与残差网络算法改进的运动目标检测[J].光电子．激光,2020,31(1):81-88. 被引量：7
4许伟,熊卫华,姚杰,沈云青.基于改进YOLOv3算法在垃圾检测上的应用[J].光电子．激光,2020,31(9):928-938. 被引量：22

引证文献1

1李尧,李金哲,黄刚,周箩鱼.融合注意力机制的轻量级摔倒检测[J].光电子．激光,2024,35(3):283-292.

1娜衣拉·艾沙江,娜迪拉·迪力夏提.心脏彩超在诊断多病因慢性心力衰竭时的临床应用价值研究[J].中文科技期刊数据库（全文版）医药卫生,2022(4):104-107.
2杜伟雄.室内环境空气质量检测研究——以TVOC浓度检测为例[J].中文科技期刊数据库（文摘版）工程技术,2022(8):188-190.
3曲宗希,江志欣,牛春华.面向公众的风险概率信息沟通策略研究[J].图书与情报,2022(2):80-93.

光电子．激光

2022年第12期

浏览历史

内容加载中请稍等...

具有自校正与注意力机制相结合的场景文本检测被引量：1

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

具有自校正与注意力机制相结合的场景文本检测 被引量：1

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

具有自校正与注意力机制相结合的场景文本检测被引量：1