摘要
针对现有的超分辨率方法难以从模糊的低分辨率图像中重建出清晰的高分辨率图像的问题,提出了一种基于生成式对抗网络(GAN)的文本图像联合超分辨率与去模糊方法。首先,本方法聚焦于严重模糊的低分辨率文本图像,由上采样模块和去模糊模块两部分组成生成器网络;然后,通过上采样模块对输入图像上采样,生成模糊的超分辨率图像;进一步利用去模糊模块重建出清晰的超分辨率图像;最后,为了更好地恢复文本图像,引入了一个联合训练损失,包含超分辨率像素损失与去模糊像素损失、语义层的特征匹配损失以及对抗损失。在合成图像和真实图像上的大量实验结果表明,与现有的先进算法--单类GAN(SCGAN)相比,峰值信噪比(PSNR)、结构相似度(SSIM)和光学字符识别(OCR)精度分别提高了1.52 dB、0.0115和13.2个百分点。所提方法能更好地处理真实场景下的退化文本图像,同时计算成本较低。
Aiming at the difficulty to reconstruct clear high-resolution images from blurred low-resolution images by the existing super-resolution methods,a joint text image joint super-resolution and deblurring method based on Generative Adversarial Network(GAN)was proposed.Firstly,the low-resolution text images with severe blur were focused,and the down-sampling module and the deblurring module were used to generate the generator network.Secondly,the input images were down-sampled by the down-sampling module to generate blurred super-resolution images.Thirdly,the deblurring module was used to reconstruct the clear super-resolution images.Finally,in order to recover the text images better,a joint training loss including super-resolution pixel loss,deblurring pixel loss,semantic layer feature matching loss and adversarial loss was introduced.Extensive experiments on synthetic and real-world images demonstrate that compared with the existing advanced method SCGAN(Single-Class GAN),the proposed method has the Peak Signal-to-Noise Ratio(PSNR),Structural Similarity(SSIM)and OCR(Optical Character Recognition)accuracy improved by 1.52 dB,0.0115 and 13.2 percentage points respectively.The proposed method can better deal with degraded text images in real scenes with low computational cost.
作者
陈赛健
朱远平
CHEN Saijian;ZHU Yuanping(College of Computer and Information Engineering,Tianjin Normal University,Tianjin 300387,China)
出处
《计算机应用》
CSCD
北大核心
2020年第3期859-864,共6页
journal of Computer Applications
基金
国家自然科学基金资助项目(61602345,61703306)
天津自然科学基金资助项目(18JCYBJC85000,16JCQNJC00600)~~
关键词
超分辨率
去模糊
生成对抗网络
残差学习
文本图像
super-resolution
deblurring
Generative Adversarial Network(GAN)
residual learning
text image