期刊文献+

文本核重建与扩展实现任意形状文本检测

Text Kernel Reconstruction and Expansion for Arbitrary Shape Text Detection
下载PDF
导出
摘要 基于分割的方法对自然场景中的文本进行像素级预测,大幅度提升了对任意形状文本的检测效果,但是如何有效分离相邻文本仍然是检测中的难题。目前广泛采用的方法是通过缩小文本注释边界得到文本核来分离相邻文本。然而,网络预测文本核时舍弃了文本核外大部分信息,降低了基于分割的文本检测方法的性能。为了解决这个问题,提出了一种文本核重建算法,将文本核的生成放在后处理阶段,通过网络预测的方向场将文本实例向内收缩形成文本核。同时,提出了一种文本核扩展算法用于将文本核恢复为完整的文本实例。实验表明,所提方法在Total-Text(88.66%)、CTW-1500(87.28%)和MSRA-TD500(90.65%)三个数据集上取得了相似或最好的检测性能。 Segmentation-based methods approaches for pixel-level text prediction in natural scenes have demonstrated significant improvement in the detection of arbitrary shape text.However,the separation of adjacent text remains a chal�lenge in text detection.One common method for addressing this issue involves the use of text kernels,which are obtained by shrinking the annotation boundaries,to separate adjacent instances.While this approach is effective in certain scenarios,it discards a significant amount of information outside the text kernel,which can degrade the performance of segmentation�based text detection methods.To address this limitation,a text kernel reconstruction algorithm is proposed that postpones the generation of text kernels to the post-processing stage.The proposed approach utilizes the direction field predicted by the network to inwardly contract text instances,resulting in the formation of text kernels.Additionally,a text kernel expan�sion algorithm is proposed to restore full text instances from the resulting text kernels.Experiments on the Total-Text,CTW-1500,and MSRA-TD500 datasets show that the proposed method achieves similar or superior detection perfor�mance compared to the state-of-the-art(88.66%,87.28%,and 90.65%respectively).
作者 邓胜军 陈念年 DENG Shengjun;CHEN Niannian(School of Computer Science and Technology,Southwest University of Science and Technology,Mianyang,Sichuan 621010,China)
出处 《计算机工程与应用》 CSCD 北大核心 2024年第9期228-236,共9页 Computer Engineering and Applications
基金 四川省科技厅重点研发项目(2021YFG0031) 四川省省级科研院所科技成果转化项目(22YSZH0021)。
关键词 场景文本检测 任意形状 文本核 scene text detection arbitrary shape text kernel
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部