期刊文献+

基于Transformer和多尺度CNN的图像去模糊 被引量:1

Image Deblurring Based on Transformer and Multi-scale CNN
下载PDF
导出
摘要 卷积神经网络(CNN)单独应用于图像去模糊时感受野受限,Transformer能有效缓解这一问题但计算复杂度随输入图像空间分辨率的增加呈2次方增长。为此,提出一种基于Transformer和多尺度CNN的图像去模糊网络(T-MIMO-UNet)。利用多尺度CNN提取空间特征,并嵌入Transformer全局特性捕获远程像素信息。设计局部增强Transformer模块、局部多头自注意力计算网络和增强前馈网络,采用窗口的方式进行局部逐块多头自注意力计算,通过增加深度可分离卷积层,加强不同窗口之间的信息交互。在GoPro测试数据集上的实验结果表明,T-MIMO-UNet的峰值信噪比相比于MIMO-UNet、DeepDeblur、DeblurGAN、SRN网络分别提升了0.39 dB、2.89 dB、3.42 dB、1.86 dB,参数量相比于MPRNet减少了1/2,能有效解决动态场景下的图像模糊问题。 Convolutional Neural Network(CNN)has limitations when applied solely to image deblurring tasks with restricted receptive fields.Transformer can effectively mitigate these limitations.However,the computational complexity increases quadratically as the spatial resolution of the input image increases.Therefore,this study proposes an image deblurring network based on Transformer and multi-scale CNN called T-MIMO-UNet.The multi-scale CNN is used to extract spatial features while the global feature of the Transformer is employed to capture remote pixel information.The local enhanced Transformer module,local Multi-Head Self-Attention(MHSA)computing network,and Enhanced Feed-Forward Network(EFFN)are designed.The block-by-block MHSA computation is performed using a windowing approach.The information interaction between different windows is enhanced by increasing the depth of the separable convolution layer.The results of the experiment conducted using the GoPro test dataset demonstrate that the Peak Signal-to-Noise Ratio(PSNR)of the T-MIMO-UNet increases by 0.39 dB,2.89 dB,3.42 dB,and 1.86 dB compared to the MIMO-UNet,DeepDeblur,DeblurGAN,and SRN networks,respectively.Additionally,the number of parameters is reduced by 1/2 compared to MPRNet.These findings prove that the T-MIMO-UNet effectively addresses the challenge of image blurring in dynamic scenes.
作者 李现国 李滨 LI Xianguo;LI Bin(School of Electronics and Information Engineering,Tiangong University,Tianjin 300387,China;Tianjin Key Laboratory of Photoelectric Detection Technology and System,Tianjin 300387,China)
出处 《计算机工程》 CAS CSCD 北大核心 2023年第9期226-233,245,共9页 Computer Engineering
基金 天津市重点研发计划科技支撑重点项目(18YFZCGX00930)。
关键词 图像去模糊 多尺度卷积神经网络 Transformer编码器 多头自注意力 增强前馈网络 image deblurring multi-scale Convolutional Neural Network(CNN) Transformer encoder Multi-Head Self-Attention(MHSA) Enhanced Feed-Forward Network(EFFN)
  • 相关文献

参考文献2

二级参考文献13

  • 1Fergus R, Singh B, Hertzmann A, et al. Removing Camera Shake from a Single Photograph[C]//Proc. of ACM SIGGRAPH'06. Boston, USA: ACM Press, 2006.
  • 2Shah Qi, Jia Jiaya, Agarwala A. High-quality Motion Deblurring l?om a Single lmage[C]//Proceedings of ACM SIGGRAPH'08. Los Angeles, USA: ACM Press, 2008.
  • 3Cai Jianfeng, Ji Hui, Liu Chaoqiang, et al. Blind Motion Deblurring from a Single Image Using Sparse Approxi- mation[C]//Proceedings of CVPR'09. Miami Beach, USA: IEEE Press, 2009.
  • 4Joshi N, Szeliski R, Kriegman D J. PSF Estimation Using Sharp Edge Prediction[C]//Proceedings of CVPR'08. Anchorage, USA:IEEE Press, 2008.
  • 5Cho S, Lee S. Fast Motion Deblurring[J]. ACM Transactions on Graphics, 2009, 28(5).
  • 6Joshi N. Enhancing Photographs Using Content-specific Image Priors[D]. San Diego, USA: University of California, 2008.
  • 7Xu Li, Jia Jiaya. Two-phase Kernel Estimation for Robust Motion Deblurring[C]//Proceedings of ECCV'10. Crete, Greece: [s. n.], 2010.
  • 8Yuan Lu, Sun Jian, Quan Long, et al. Progressive Inter-scale and lntra-scale Non-blind Image Deconvolution[J]. ACM Trans. On Graphics, 2008, 27(5).
  • 9Hong Hanyu, Park I K. Single-image Motion Deblurring Using Adaptive Anisotropic Regularization[J]. Optical Engineering, 2010, 49(9).
  • 10Tomasi C, Manduchi R. Bilateral Filtering for Gray and Color Images[C]//Proceedings of International Conference on Computer Vision. Bombay,. India: IEEE Press, 1998.

共引文献10

同被引文献2

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部