多层次感知残差卷积网络的单幅图像超分重建被引量：1

Single image super-resolution reconstruction based on multi-level perceptual residual convolutional network

导出

摘要目的单幅图像超分辨率重建的深度学习算法中,大多数网络都采用了单一尺度的卷积核来提取特征(如3×3的卷积核),往往忽略了不同卷积核尺寸带来的不同大小感受域的问题,而不同大小的感受域会使网络注意到不同程度的特征,因此只采用单一尺度的卷积核会使网络忽略了不同特征图之间的宏观联系。针对上述问题,本文提出了多层次感知残差卷积网络(multi-level perception residual convolutional network,MLP-Net,用于单幅图像超分辨率重建)。方法通过特征提取模块提取图像低频特征作为输入。输入部分由密集连接的多个多层次感知模块组成,其中多层次感知模块分为浅层多层次特征提取和深层多层次特征提取,以确保网络既能注意到图像的低级特征,又能注意到高级特征,同时也能保证特征之间的宏观联系。结果实验结果采用客观评价的峰值信噪比(peak signal to noise ratio,PSNR)和结构相似性(structural similarity,SSIM)两个指标,将本文算法其他超分辨率算法进行了对比。最终结果表明本文算法在4个基准测试集上(Set5、Set14、Urban100和BSD100(Berkeley Segmentation Dataset))放大2倍的平均峰值信噪比分别为37.8511 dB,33.9338 dB,32.2191 dB,32.1489 dB,均高于其他几种算法的结果。结论本文提出的卷积网络采用多尺度卷积充分提取分层特征中的不同层次特征,同时利用低分辨率图像本身的结构信息完成重建,并取得不错的重建效果。 Objective Single image super-resolution reconstruction(SISR)is a classic problem in computer vision.SISR aims to reconstruct one high-resolution image from single or many low-resolution(LR)images.Currently,image super-resolution(SR)technology is widely used in medical imaging,satellite remote sensing,video surveillance,and other fields.However,the SR problem is an essentially complex and morbid problem.To solve this problem,many SISR methods have been proposed,including interpolation-based methods and reconstruction-based methods.Due to large amplification factors,the repair performance will drop sharply,and the reconstructed results are very poor.With the rise of deep learning,deep convolutional neural networks have also been used to solve this problem.Researchers have proposed a series of models and made significant progress.With the gradual understanding of deep learning techniques,researchers have found that deep network brings better results than shallow network,and too deep network can cause gradient explosion or disappearance.In addition,the gradient explosion or disappearance can cause the model to be untrainable and thus unable to achieve the best results through training.In recent years,most networks based on deep learning for single-image SR reconstruction adopt single-scale convolution kernels.Generally,a 3×3 convolution kernel is used for feature extraction.Although single-scale convolution kernels can also extract a lot of detailed information,these algorithms usually ignore the problem of different receptive field sizes caused by different convolution kernel sizes.Receptive fields of different sizes will make the network pay attention to different features;therefore,only using a 3×3 convolution kernel will cause the network to ignore the macroscopic relation between different feature images.Considering these problems,this study proposes a multi-level perception network based on Goog Le Net,residual network,and dense convolutional network.Method First,the feature extraction module is used as the input,which can extract low-frequency image features.The feature extraction module consists of two3×3 convolution layers,which is input to multiple densely connected multi-level perception modules.The multi-level perception module is composed of 3×3 and 5×5 convolution kernels.The 3×3 convolution kernel is responsible for extracting detailed feature information,and the 5×5 convolution kernel is responsible for extracting global feature information.Second,the multi-level perception module is divided into shallow multi-level feature extraction,deep multi-level feature extraction,and tandem compression unit.The shallow multi-level feature extraction is composed of 3×3 chain convolution and 5×5 chain convolution.The former is responsible for extracting fine local feature information in shallow features,whereas the latter is responsible for extracting global features in shallow features.The deep multi-level feature extraction is also composed of 3×3 chain convolution and 5×5 chain convolution.The former extracts fine local feature information in deep features,whereas the latter extracts global feature information in deep features.In the tandem compression unit,the global feature information in shallow features,the fine local feature information in deep features,the global information in deep features,and the initial input are concatenated together and then compressed into the same dimension as the input image.In this way,not only low-level and high-level features of the image can be ensured,but also the macro relationship between the features can be guaranteed.Finally,the reconstruction module is used to obtain the final output by combining the upscaling image with the residual image.This study adopts the DIV2 K dataset,which consists of 800 high-definition images,and each image has probably 2 million pixels.In order to make full use of these data,the picture is randomly rotated by 90°,180°,and 270°and horizontally flipped.Result The reconstructed results are evaluated by using the peak signal-to-noise ratio(PSNR)and structural similarity index and compared with some state-of-the-art SR reconstruction methods.The reconstructed results with 2 scaling factor show that the PSNRs of the proposed algorithm in four benchmark test sets(Set5,Set14,Berkeley Segmentation Dataset(BSD100),and Urban100)are 37.8511 dB,33.9338 dB,32.2191 dB,and 32.1489 dB,respectively,which are all higher than those of the other methods.Conclusion Compared with other algorithms,the proposed convolutional network model in this study can better take into account the problem of the receptive field and fully extracts different levels of hierarchical features through multi-scale convolution.At the same time,the model uses the structural feature information of the LR image itself to complete the reconstruction,and good reconstructed results can be obtained by using this model.

作者何蕾程佳豪占志钰杨雯博刘沛然 He Lei;Cheng Jiahao;Zhan Zhiyu;Yang Wenbo;Liu Peiran(Hefei University of Technology,Hefei 230601,China)

机构地区合肥工业大学

出处《中国图象图形学报》 CSCD 北大核心 2021年第4期776-786,共11页 Journal of Image and Graphics

基金国家自然科学基金项目(61502141)。

关键词深度学习卷积神经网络(CNN) 单幅图像超分辨率(SISR) 多层次感知残差网络密集连接 DIV2K deep learning convolutional neural network(CNN) single image super-resolution(SISR) multi-level perception residual network dense connection DIV2K

分类号 TP391. [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1李浪宇,苏卓,石晓红,黄恩博,罗笑南.图像超分辨率重建中的细节互补卷积模型[J].中国图象图形学报,2018,23(4):572-582. 被引量：6
2沈明玉,俞鹏飞,汪荣贵,杨娟,薛丽霞.多阶段融合网络的图像超分辨率重建[J].中国图象图形学报,2019,0(8):1258-1269. 被引量：14
3杨娟,李文静,汪荣贵,薛丽霞.融合感知损失的生成式对抗超分辨率算法[J].中国图象图形学报,2019,0(8):1270-1282. 被引量：8

二级参考文献3

1杨云,张海宇,朱宇,张艳宁.类别信息生成式对抗网络的单图超分辨重建[J].中国图象图形学报,2018,23(12):1777-1788. 被引量：8
2李云飞,符冉迪,金炜,纪念.多通道卷积的图像超分辨率方法[J].中国图象图形学报,2017,22(12):1690-1700. 被引量：12
3吴从中,陈曦,季栋,詹曙.结合深度残差学习和感知损失的图像去噪[J].中国图象图形学报,2018,23(10):1483-1491. 被引量：19

共引文献25

1赵应武.图像超分辨率重建[J].中国新通信,2020,0(2):99-100.
2周云朋,李硕,张宪祥,张正东,高远翔,丁磊,卢云.基于深度神经网络的高分辨MRI直肠淋巴结辅助诊断系统的临床应用价值研究[J].中华外科杂志,2019,57(2):108-113. 被引量：25
3刘月峰,杨涵晰,蔡爽,张晨荣.基于改进卷积神经网络的单幅图像超分辨率重建方法[J].计算机应用,2019,39(5):1440-1447. 被引量：26
4刘树东,王晓敏,张艳.一种对称残差CNN的图像超分辨率重建方法[J].西安电子科技大学学报,2019,46(5):15-23. 被引量：14
5唐家军,刘辉,胡雪影.功能型复合深度网络的图像超分辨率重建[J].计算机科学与探索,2020,14(8):1368-1379. 被引量：4
6韩鸣.深度学习在图像超分辨中的应用研究[J].现代科学仪器,2020(3):169-172.
7梁超,黄洪全.基于卷积神经网络的轻量级图像超分辨率[J].计算机与现代化,2020(11):23-27.
8史永祥,蒋斌,黄雍晫,杨桂生,李庆武,张志良.基于深度学习的红外图像超分辨率重建[J].应用科技,2020,47(4):8-13.
9谢幸雨,贺辉,邢海花.基于改进的3D⁃CNN的高光谱遥感图像地物分类[J].数据采集与处理,2021,36(1):156-163. 被引量：3
10李现国,冯欣欣,李建雄.多尺度残差网络的单幅图像超分辨率重建[J].计算机工程与应用,2021,57(7):215-221. 被引量：18

同被引文献3

1谢雪晴.基于残差密集网络的单幅图像超分辨率重建[J].计算机应用与软件,2019,36(10):222-226. 被引量：2
2李彬,喻夏琼,王平,傅瑞罡,张虹.基于深度学习的单幅图像超分辨率重建综述[J].计算机工程与科学,2021,43(1):112-124. 被引量：12
3曲延云,陈蓉,李翠华,王菡子.深度学习单帧图像超分辨率重建研究综述[J].厦门大学学报（自然科学版）,2021,60(3):555-570. 被引量：7

引证文献1

1陈蓉,李翠华.非局部多尺度融合的单帧图像超分辨率重建[J].厦门大学学报（自然科学版）,2022,61(2):278-287. 被引量：1

二级引证文献1

1余秋,曾强宇,张福贵,王皓,史朝,李浩然.基于注意力反向投影网络的天气雷达回波超分辨率重建算法[J].气象科技,2023,51(3):319-330.

1王静,宋慧慧,张开华,刘青山.全局注意力门控残差记忆网络的图像超分重建[J].中国图象图形学报,2021,26(4):766-775. 被引量：3
2梁敏,王昊榕,张瑶,李杰.基于加速残差网络的图像超分辨率重建方法[J].计算机应用,2021,41(5):1438-1444. 被引量：3
3杨昊,余映.利用通道注意力与分层残差网络的图像修复[J].计算机辅助设计与图形学学报,2021,33(5):671-681. 被引量：9
4崔琛,张凯兵.基于双字典正则化的单帧图像超分辨率重建方法[J].西安工程大学学报,2021,35(2):66-72. 被引量：4
5柏壮壮,卢一相,高清维,孙冬.基于自适应紧框架学习的轴承故障诊断[J].振动与冲击,2021,40(10):296-303. 被引量：4
6陈默,张景祥,胡恩华,吴林海,张义.基于结构化分析和语义相似度的食品安全事件领域数据挖掘模型[J].食品科学,2021,42(7):35-44. 被引量：3
7王宏伟,冯川,赵新学.基于视觉辅助技术的智能标线维护机设计与应用[J].机械研究与应用,2021,34(2):113-116. 被引量：1
8无.The Earliest Giant Flying Birds Flew over Antarctica 50 Million Years Ago[J].Bulletin of the Chinese Academy of Sciences,2020,34(4):255-256.
9Zhijian Shao,Bingwen Feng,Xingzheng Li.A Semantically Sensitive Privacy Protection Method for Trajectory Publishing[J].Journal of Computer and Communications,2021,9(4):35-56.
10张亚京,杨亮.个人通用信用评分领域中客户分组的国际经验与方法[J].征信,2021,39(4):70-73. 被引量：1

中国图象图形学报

2021年第4期

浏览历史

内容加载中请稍等...

多层次感知残差卷积网络的单幅图像超分重建被引量：1

参考文献3

二级参考文献3

共引文献25

同被引文献3

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

多层次感知残差卷积网络的单幅图像超分重建 被引量：1

参考文献3

二级参考文献3

共引文献25

同被引文献3

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

多层次感知残差卷积网络的单幅图像超分重建被引量：1