基于卷积字典扩散模型的眼底图像增强算法

Fundus image enhancement algorithm based on convolutional dictionary diffusion model

导出

摘要目的视网膜眼底图像广泛用于临床筛查和诊断眼科疾病,但由于散焦、光线条件不佳等引起的眼底图像模糊,导致医生无法正确诊断,且现有图像增强方法恢复的图像仍存在模糊、高频信息缺失以及噪点增多问题。本文提出了一个卷积字典扩散模型,将卷积字典学习的去噪能力与条件扩散模型的灵活性相结合,从而解决了上述问题。方法算法主要包括两个过程:扩散过程和去噪过程。首先向输入图像中逐步添加随机噪声,得到趋于纯粹噪声的图像;然后训练一个神经网络逐渐将噪声从图像中移除,直到获得一幅清晰图像。本文利用卷积网络来实现卷积字典学习并获取图像稀疏表示,该算法充分利用图像的先验信息,有效避免重建图像高频信息缺失和噪点增多的问题。结果将本文模型在EyePACS数据集上进行训练,并分别在合成数据集DRIVE(dgital retinal images for vessel extraction)、CHASEDB1(child heart and health study in England)、ROC(retinopathy online challenge)和真实数据集RF(real fundus)、HRF(high-resolution fundus)上进行测试,验证了所提方法在图像增强任务上的性能及跨数据集的泛化能力,其评价指标峰值信噪比(peak signal-to-noise ratio,PSNR)和学习感知图像块相似度(learned perceptual image patch similarity,LPIPS)与原始扩散模型(learning enhancement from degradation,Led)相比平均分别提升了1.9929 dB和0.0289。此外,将本文方法用于真实眼科图像下游任务的前处理能够有效提升下游任务的表现,在含有分割标签的DRIVE数据集上进行的视网膜血管分割实验结果显示,相较于原始扩散模型,其分割指标对比其受试者工作特征曲线下面积(area under the curve,AUC),准确率(accuracy,Acc)和敏感性(sensitivity,Sen)平均分别提升0.0314,0.0030和0.0738。结论提出的方法能够在保留真实眼底特征的同时去除模糊、恢复更丰富的细节,从而有利于临床图像的分析和应用。 Objective Retinal fundus images have important clinical applications in ophthalmology.These images can be used to screen and diagnose various ophthalmic diseases,such as diabetic retinopathy,macular degeneration,and glaucoma.However,the acquisition of these images is often affected by various factors in real scenarios,including lens defo⁃cus,poor ambient light conditions,patient eye movements,and camera performance.These issues often lead to quality problems such as blurriness,unclear details,and inevitable noise in fundus images.Such poor-quality images pose a chal⁃lenge to ophthalmologists in their diagnostic work.For example,blurred images will lead to the absence of detailed informa⁃tion about the morphological structure of the retina,which causes difficulty for the physicians to accurately localize and identify abnormalities,lesions,exudations,and other conditions.Existing enhancement methods for fundus images have progressed in improving image quality.However,some problems still exist,such as image blurring,artifacts,missing high-frequency information,and increased noise.Therefore,in this study,we propose a convolutional dictionary diffusion model,which combines convolutional dictionary learning with conditional diffusion model.This algorithm aims to cope with the abovementioned problems of low-quality images to provide an effective tool for fundus image enhancement.Our approach can improve the quality of fundus images and enable physicians to increase diagnostic confidence,improve assessment accuracy,monitor treatment progress,and ensure better care for patients.This method will contribute to oph⁃thalmic research and provide more opportunities for prospective healthcare management and medical intervention,which positively impacts patients’ocular health and overall quality of life.Method The algorithm consists of two parts:simula⁃tion of diffusion process and inverse denoising process.First,random noise is gradually added to the input image to obtain a purely noisy image.Then,a neural network is trained to gradually remove the noise from the image until a clear image is finally obtained.This study takes the blurred fundus image as the conditional information to better preserve the fine-grained structure of the image.Collecting blurred-clear fundus image pairs is difficult.Thus,synthetic fundus dataset is widely used for training.Therefore,a Gaussian filtering algorithm is designed to simulate the defocus blur images.In the training process,the conditional information and the noisy image are first spliced and fed into the network,and the abstract features of the image are extracted by continuously reducing the image size through downsampling.This procedure can significantly reduce the time and space complexity of the sparse representation calculation.Then,the convolutional network is used to implement convolutional dictionary learning and obtain the sparse representation of the image.Given that the self-attention mechanism can capture non-local similarity and long-range dependency,this study adds self-attention to the convolutional dictionary learning module to improve the reconstruction quality.Finally,hierarchical feature extraction is achieved by fea⁃ture concatenation to realize information fusion between different levels and better use local features in the image.The downsampled feature is recovered to the original image size by an inverse convolutional layer.The model minimizes the negative log-likelihood loss,which represents the difference in probability distribution between the generated image and the original image.After the model is trained,a clear fundus image is generated by gradually removing the noise from a noisy picture with a blurred image as conditional input.Result The proposed method was evaluated on EyePACS dataset,and multiple experiments were performed on synthetic datasets DRIVE(digital retinal images for vessel extraction),CHASEDB1(child heart and health study in England),ROC(retinopathy online challenge),realistic datasets RF(real fundus)and HRF(high-resolution fundus)to demonstrate the generalizability of our model.Experimental results show that the evaluation metrics peak signal-to-noise ratio(PSNR)and learned perceptual image patch similarity(LPIPS)are improved on average by 1.9929 and 0.0289,respectively,compared with the original diffusion model(learning enhance⁃ment from degradation(Led)).Moreover,the proposed approach was used as a preprocessing module for downstream tasks.The experiment on retinal vessel segmentation is adopted to prove that our approach can benefit the downstream tasks in clinical application.The results of segmentation experiments on the DRIVE dataset show that all the segmentation metrics improve compared with the original diffusion model.Specifically,the area under the curve(AUC),accuracy(Acc),and sensitivity(Sen)are improved by 0.0314,0.0030,and 0.0738 on average,respectively.Conclusion The proposed method provides a practical tool for fundus image deblurring and a new perspective to improve the quality and accuracy of diagnostic.This approach has a positive impact on patients and ophthalmologists and is expected to promote fur⁃ther development in the interdisciplinary research of ophthalmology and computer science.

作者王珍霍光磊兰海胡建民魏宪 Wang Zhen;Huo Guanglei;Lan Hai;Hu Jianmin;Wei Xian(College of Mechanical and Electrical Engineering,Fujian Agriculture and Forestry University,Fuzhou 350108,China;Fujian Institute of Research on the Structure of Matter,Chinese Academy of Sciences,Fuzhou 350002,China;Quanzhou TongweiTechnology Co.,Ltd.,Quanzhou 362008,China;School of Medical Technology and Engineering,Fujian Medical University,Fuzhou 350122,China;Software Engineering Institute,East China Normal University,Shanghai 200062,China)

机构地区福建农林大学机电工程学院中国科学院福建物质结构研究所泉州通维科技有限责任公司福建医科大学医学技术与工程学院华东师范大学软件工程学院

出处《中国图象图形学报》 CSCD 北大核心 2024年第8期2426-2438,共13页 Journal of Image and Graphics

基金泉州市科技项目(2023C009R,2022C004L)。

关键词眼底图像增强卷积字典学习稀疏表示扩散模型条件扩散模型 fundus image enhancement convolutional dictionary learning sparse representation diffusion model condi⁃tional diffusion model

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1焦莉娟,王文剑,赵青杉,曹建芳.近邻局部OMP稀疏表示图像去噪[J].中国图象图形学报,2017,22(11):1486-1492. 被引量：7
2王丽芳,窦杰亮,秦品乐,蔺素珍,高媛,张程程.双重字典学习与自适应PCNN相结合的医学图像融合[J].中国图象图形学报,2019,24(9):1588-1603. 被引量：8

二级参考文献7

1焦李成,杨淑媛,刘芳,侯彪.压缩感知回顾与展望[J].电子学报,2011,39(7):1651-1662. 被引量：316
2练秋生,石保顺,陈书贞.字典学习模型、算法及其应用研究进展[J].自动化学报,2015,41(2):240-260. 被引量：121
3魏宁,杨元琴,董方敏.多模图像交叉双域滤波算法[J].中国图象图形学报,2016,21(6):691-697. 被引量：4
4焦莉娟,王文剑.一种快速的K-SVD图像去噪方法[J].小型微型计算机系统,2016,37(7):1608-1612. 被引量：10
5张晓,薛月菊,涂淑琴,胡月明,宁晓锋.基于结构组稀疏表示的遥感图像融合[J].中国图象图形学报,2016,21(8):1106-1118. 被引量：15
6董侠,王丽芳,秦品乐,高媛.改进耦合字典学习的脑部CT/MR图像融合方法[J].计算机应用,2017,37(6):1722-1727. 被引量：3
7楼建强,李俊峰,戴文战.非下采样剪切波变换的医学图像融合[J].中国图象图形学报,2017,22(11):1574-1583. 被引量：18

共引文献13

1李秀明,乜勇,刘丹青.局部自交干扰的无参模糊图像自适应去燥仿真[J].计算机仿真,2018,35(10):457-461. 被引量：2
2陈清江,石小涵,柴昱洲.一种基于信息保留网络的图像去噪算法[J].应用光学,2019,40(3):440-446. 被引量：4
3王慧,冯金顺,程正兴.基于局部路径特征信息神经网络的图像去噪[J].液晶与显示,2020,35(1):70-79. 被引量：3
4亓法国,张海洋,柳淳,赵长明,张子龙.一种基于双分支改良编解码器的图像去噪算法[J].应用光学,2020,41(5):956-964. 被引量：2
5南栋,王志田,郑少华,何林远.一种基于稀疏系数匹配学习的图像去雾算法[J].控制与决策,2020,35(11):2797-2802.
6陈子鎏,胡高鹏,王晓明,黄增喜,杜亚军.基于局部类内结构的鉴别性字典学习方法[J].计算机应用研究,2021,38(2):489-494. 被引量：3
7郭淑娟,高媛,秦品乐,王丽芳.基于多尺度边缘保持分解与PCNN的医学图像融合[J].计算机工程,2021,47(3):276-283. 被引量：8
8蔡郁青,孙忠贵.权值动态化约束的跨模态非局部均值滤波器[J].数字技术与应用,2021,39(7):110-113.
9殷喆,高媛,秦品乐,刘朋伟,王丽芳.基于麦克劳林展开与PCNN的医学图像融合[J].微电子学与计算机,2021,38(12):47-53.
10王杰,赵文义,潘细朋,杨辉华.基于像素校正的编解码多聚焦图像融合网络[J].计算机仿真,2021,38(12):424-429.

1赵春林,马广成,陶思翰,陈卓琳,王明月,方祎鸣,施炜.多模影像对糖尿病小鼠眼底特征的实验研究[J].中国现代医学杂志,2024,34(11):43-50. 被引量：1
2黄柯蒙,姜娜娜,赵文博,郑妍昕,刘文平,朱炬波.SAR图像稀疏表示模型的实证研究[J].中山大学学报（自然科学版）（中英文）,2024,63(4):107-114.
3姜英梅.血清生化指标及凝血酶原时间在妊娠肝病患者检测中的意义探讨[J].中文科技期刊数据库（文摘版）医药卫生,2024(9):0197-0200.
4马真.儿童支气管哮喘的诊断及药物治疗[J].人人健康,2024(20):35-35.
5郑宏亮,贾森清,郭宇朋,薛颖杰,韩晶,赵河明,石志刚.基于DnCNN 的侵彻过载时频去噪方法[J].装备环境工程,2024,21(8):17-24.
6Qian-Qian Wan,Jin-Qiong Zhou,Li-Jian Fang,Ya-Xing Wang,Ye-Nan Wang,Qian Wang,Yan-Ni Yan,Xuan Yang,Shou-Ling Wu,Shuo-Hua Chen,Jost B Jonas,Wen-Bin Wei.Retinal nerve fiber layer defects and chronic kidney disease:the Kailuan Eye Study[J].International Journal of Ophthalmology(English edition),2024,17(9):1696-1706.
7WENBIN LUO,YULING ZOU,HONGXI WU,ZHONGYI YANG,ZHIPENG YOU.Blueberry anthocyanins extract attenuates oxidative stress and angiogenesis on an in vitro high glucose-induced retinopathy model through the miR-33/GLCCI1 axis[J].BIOCELL,2024,48(8):1275-1284.
8陈荣川.心脏彩色多普勒超声联合心电图对高血压性心脏病的诊断效果分析[J].医学前沿,2024(6):33-34.
9周天凡,邵飞雪,万盛,周晨晨,周思锦,花晓琳.基于人工智能模型量化视网膜血管特征参数预测子痫前期的可行性研究[J].上海交通大学学报（医学版）,2024,44(5):552-559.
10杨丽,刘柯婷,杨百元,张丹,罗曦,徐严明.非典型帕金森综合征视功能障碍及视网膜病变的研究进展[J].临床医学进展,2024,14(8):533-540.

中国图象图形学报

2024年第8期

浏览历史

内容加载中请稍等...

基于卷积字典扩散模型的眼底图像增强算法

参考文献2

二级参考文献7

共引文献13

相关作者

相关机构

相关主题

浏览历史