一种多尺度的图像动态场景盲去模糊网络被引量：3

Multi-scale Image Blind Deblurring Network for Dynamic Scenes

下载PDF

导出

摘要近几年,基于卷积神经网络(convolutional neural network, CNN)的单幅图像动态场景盲去模糊(single image dynamic scene blind deblurring, SIDSBD)方法已经取得了巨大的进步.其成功主要是源于多尺度模型或者多块模型、编解码器架构的设计和残差块结构的设计3个方面.基于此,提出了一种新的多尺度卷积神经网络(multiscale convolutional neural network, MSCNN)来进一步开发多尺度模型、编解码器架构和残差块结构的优势,以实现更高质量的动态场景盲去模糊.首先,受到空间金字塔池化(spatial pyramid pooling, SPP)和多块模型的启发,提出了一种分等级的多块通道注意力机制(hierarchical multi-patch channel attention, HMPCA).提出的HMPCA通过利用特征图的全局特征统计量和局部特征统计量来自适应地对特征图进行逐通道的权重赋值.因为利用了局部信息,因此HMPCA可以被认为是增加了通道方向的感受野,也正因如此,提出的HMPCA能够进一步增强网络的表达能力.其次,不同于现有的多尺度模型,发展出了一种新的多尺度模型,该模型中的每个尺度是由多个编码器和多个解码器构成的.因为HMPCA,使得同一尺度内的编码器和解码器并不完全相同,因此提出的多尺度模型可以被看作是增加了编解码器的深度,因此能够提升每一个尺度的去模糊性能,最终实现更高质量的动态场景盲去模糊.大量的实验结果表明:提出的方法较近几年的一些成功的SIDSBD方法相比,能够复原出更高质量的去模糊图像,在客观的评价指标和主观的视觉效果上均有显著的改进. Recently, the convolutional neural network(CNN) based single-image dynamic scene blind deblurring(SIDSBD) methods have made significant progress. Their success mainly stems from the multi-scale/multi-patch model and the design of the encoder-decoder architecture and the residual block structure. In this paper, a novel multi-scale CNN(MSCNN) is proposed to further exploit the advantages of the multi-scale model, the encoder-decoder architecture, and the residual block structure, which can achieve higher-quality SIDSBD. First, inspired by the spatial pyramid pooling(SPP) and the multi-patch model, this study put forward a hierarchical multi-patch channel attention(HMPCA) strategy to perform adaptive weight assignment for feature images channel-wise by using the global and local feature statistics. The proposed HMPCA uses local information, which can be considered to enlarge the receptive field in the channel direction and thus can enhance the representational ability of the network. Then, different from existing multi-scale models, a novel multiscale model is built, in which each scale consists of multiple encoders and decoders. Because of the HMPCA, the encoders and decoders at the same scale are not exactly the same. The proposed multi-scale model can be regarded to increase the depth of the encoder-decoder architecture, thus able to improve the deblurring performance of each scale and finally achieve higher-quality blind deblurring for dynamic scenes. Extensive experiments comparing the proposed SIDSBD method with state-of-the-art ones demonstrate the superiority of the method in terms of both qualitative evaluation and quantitative metrics.

作者唐述万盛道谢显中杨书丽黄容顾佳郑万鹏 TANG Shu;WAN Sheng-Dao;XIE Xian-Zhong;YANG Shu-Li;HUANG Rong;GU Jia;ZHENG Wan-Peng(Chongqing Key Laboratory of Computer Network and Communications Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,China)

机构地区重庆邮电大学计算机网络和通信技术重庆市重点实验室

出处《软件学报》 EI CSCD 北大核心 2022年第9期3498-3511,共14页 Journal of Software

基金国家自然科学基金(61601070,61501074) 重庆市教委科学技术研究重点项目(KJZD-K201800603) 重庆市教委科学技术研究重大项目(KJZD-M201900602) 重庆市基础研究与前沿探索项目(cstc2018jcyjAX0432) 重庆市技术创新与应用发展专项面上项目(cstc2020jscx-msxmX0135)。

关键词卷积神经网络动态场景盲去模糊多尺度模型通道注意力机制空间金字塔池化 convolutional neural network(CNN) blind deblurring for dynamic scene multi-scale model channel attention spatial pyramid pooling(SPP)

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

同被引文献33

1周成虎.全空间地理信息系统展望[J].地理科学进展,2015,34(2):129-131. 被引量：164
2王晓明,刘瑜,张晶.地理空间认知综述[J].地理与地理信息科学,2005,21(6):1-10. 被引量：52
3薛存金,周成虎,苏奋振,董庆,谢炯.面向过程的时空数据模型研究[J].测绘学报,2010,39(1):95-101. 被引量：48
4张克权,祝国瑞.试论地图制图学的理论体系[J].武汉测绘科技大学学报,1990,15(2):28-33. 被引量：8
5袁林旺,闾国年,罗文,俞肇元,易琳,盛业华.GIS多维统一计算的几何代数方法[J].科学通报,2012,57(4):282-290. 被引量：11
6李翀伦,刘忠,杨露菁.联合模糊图像建模与复原算法[J].电光与控制,2015,22(9):31-36. 被引量：1
7孙超,钟少波,邓羽.基于暴雨内涝灾害情景推演的北京市应急救援方案评估与决策优化[J].地理学报,2017,72(5):804-816. 被引量：26
8华一新,周成虎.面向全空间信息系统的多粒度时空对象数据模型描述框架[J].地球信息科学学报,2017,19(9):1142-1149. 被引量：78
9郭仁忠,应申.论ICT时代的地图学复兴[J].测绘学报,2017,46(10):1274-1283. 被引量：60
10闾国年,袁林旺,俞肇元.地理学视角下测绘地理信息再透视[J].测绘学报,2017,46(10):1549-1556. 被引量：49

引证文献3

1马飞,王梓璇,杨飞霞,徐光宪.基于分数阶全变分和低秩正则化的彩色图像去模糊方法[J].电光与控制,2024,31(5):101-107.
2靖常峰,李佳宁,吴森森,冯云龙,曹一冰,陈奕君,蒋捷,周成虎.基于对象空间的地理场景表达模型与组织管理方法及应用[J].地理学报,2024,79(9):2230-2245.
3黄萍,管丽鹏,朱惠娟.基于DeblurGANv2的图像去模糊技术的轻量化应用[J].信息技术,2024,48(10):49-55.

1Xiaojiao SONG,Jianjun ZHU,Jingfan FAN,Danni AI,Jian YANG.Topological distance-constrained feature descriptor learning model for vessel matching in coronary angiographies[J].Virtual Reality & Intelligent Hardware,2021,3(4):287-301.
2李博文,刘进锋.图像去雾技术研究综述[J].现代计算机,2022,28(13):57-61. 被引量：1
3徐泽昊,刘川.基于改进YOLOv3的口罩佩戴检测算法[J].信息记录材料,2022,23(7):158-161. 被引量：1
4王汉谱,刘志豪,谷旭轩,廖建英,贺志强,涂兵,彭怡书.基于DeepLabv3的样本不均衡图像语义分割研究[J].成都工业学院学报,2022,25(3):16-21. 被引量：1
5王瑶,龙华,邵玉斌,杜庆治,王延凯.基于CRNN混合神经网络的多语种识别[J].光电子．激光,2022,33(6):620-628.
6马润玉,郝一涵,田新鹏,邓谦.基于多尺度模型解释固体中的挠曲电效应[J].固体力学学报,2022,43(4):477-484.
7张志华,温亚楠,慕号伟,杜小平.结合双注意力机制的道路裂缝检测[J].中国图象图形学报,2022,27(7):2240-2250. 被引量：11
8Yujie Wang,Yixin Zhuang,Yunzhe Liu,Baoquan Chen.MDISN:Learning multiscale deformed implicit fields from single images[J].Visual Informatics,2022,6(2):41-49.
9Hao-Xuan Song,Jiahui Huang,Yan-Pei Cao,Tai-Jiang Mu.HDR-Net-Fusion:Real-time 3D dynamic scene reconstruction with a hierarchical deep reinforcement network[J].Computational Visual Media,2021,7(4):419-435. 被引量：1
10韦春苗,徐岩,蒋新辉,魏一铭.基于PiT的皮肤镜图像分类方法研究[J].光电子．激光,2022,33(5):505-512. 被引量：1

软件学报

2022年第9期

浏览历史

内容加载中请稍等...

一种多尺度的图像动态场景盲去模糊网络被引量：3

同被引文献33

引证文献3

相关作者

相关机构

相关主题

浏览历史

一种多尺度的图像动态场景盲去模糊网络 被引量：3

同被引文献33

引证文献3

相关作者

相关机构

相关主题

浏览历史

一种多尺度的图像动态场景盲去模糊网络被引量：3