感受野扩增的轻量级病理图像聚焦质量评估网络

Lightweight focus quality assessment network for pathological image with amplified receptive field

导出

摘要目的病理切片扫描仪成像的数字病理图像的聚焦质量不佳,会严重影响肿瘤诊断的准确性。因此,开展对数字病理图像的聚焦质量评估的自动化算法至关重要。现有的聚焦质量评估主要采用深度学习方法,但常规的卷积神经网络(convolutional neural network,CNN)存在全局信息提取能力差和计算量过大问题。为此,提出一种感受野扩增的轻量级病理图像聚焦质量评估网络。方法该网络引入大卷积核来扩增网络的感受野,以捕获更多的全局信息。再利用新的双流大核注意力机制,增强对空间和通道上全局信息的提取能力。最后,将该网络优化为参数量递减的大型、中型和小型3个版本,以实现网络的轻量化。结果本文提出的大型网络比同类先进方法取得更优的性能。与本文的大型网络相比,优化后的小型网络牺牲了较小的性能,却取得参数量、计算量和CPU推理时间的显著下降。与同类轻量级网络SDCNN(self-defined convolutional neural network)相比,本文的小型网络在SRCC(Spearman’s rank correlation coefficient)、PLCC(Pearson linear correlation coefficient)和KRCC(Kendall rank correlation coefficient)等度量指标上分别提升了0.0161、0.0166和0.0299,而参数量、计算量和CPU推理时间分别减少了39.06%、95.11%和51.91%。结论本文提出的方法可有效地提取数字病理图像的全局聚焦信息,且计算资源消耗更低,具有现实可行性。 Objective Histopathology is the gold standard for tumor diagnosis.With the development of digital pathology slide scanners,digital pathology has introduced revolutionary changes to clinical pathological diagnosis.Pathologists use digital images to examine tissues and make diagnoses based on the characteristics of the observed tissues.Simultaneously,these digital images are fed into a computer-aided diagnostic system for automated diagnosis,thereby speeding up diagnosis.However,the quality of digital pathology images is blurred locally or globally by the focusing errors produced in the scanning process.For pathologists,these blurred areas will prevent accurate observations of tissue and cellular structures,leading to misdiagnosis.Therefore,studying the focus quality evaluation for pathological images is crucial.Methods based on machine and deep learning are currently available for this research.In machine learning-based methods,features are artificially designed with the help of a priori knowledge,such as optical or microscopic imaging,and fed into a classifier to automatically obtain focused predictions.However,these methods do not automatically learn the focus features in pathological images,resulting in low evaluation accuracy.Meanwhile,deep learning-based methods automatically learn complex features,substantially improving evaluation accuracy.Current learning-based work enhances the capability to process global focus information from pathological images by introducing attention mechanisms.However,the receptive scope of these attention mechanisms is limited,which results in inadequate global focus information.By contrast,the existing networks with better performance require a larger number of parameters and computations,increasing the difficulty of their application in practice.In this paper,a focus quality assessment network with amplified receptive field(ARF-FQANet)is proposed to address challenges such as poor global information extraction and excessive computations.Method In ARFFQANet,a large convolution kernel is used to amplify the receptive field of the network,and the dual-stream large kernel attention(DsLKA)mechanism is then integrated.In DsLKA,large kernel channel and spatial attentions are proposed to capture the global focus information in channels and spaces,respectively.The proposed large kernel channel attention is better than the classical channel attention mechanism,and the introduced large kernel retransmit squeeze(LKRS)method redistributes the weights in the space,thus avoiding the problem of losing saliency weights in classical channel attention.However,the local cellular semantic information gradually becomes salient with the downsampling of input features,which may affect the capability of the network to represent focus information.A local stable downsampling block(LSDSB)is designed to address the above problems.Extraneous information is minimized during the upsampling and downsampling processes by integrating LSDSB,thus ensuring the local stability of the features.A short branch is introduced to create a residual attention block(RAB)based on DsLKAB and LSDSB modules.In this short branch,the noise is extracted using a minimum pooling operation,which effectively suppresses the learning of noisy information during backpropagation,thus improving the capability of the network to represent focus information.In addition,an initial feature enhancement block(IFEB)is introduced at the initial stage of the network to enhance the capability of the initial layer to represent the focus information.The features obtained by IFEB provide highly comprehensive information for subsequent networks.A strategy to decompose large convolutional kernels is introduced to obtain a lightweight network,which substantially reduces the number of parameters and computational requirements.By contrast,the network parameters are reduced to achieve further compression.The network is then optimized into three aspects:large,medium,and small,each with a reduced number of parameters.Result Comparative experiments are performed on a publicly available dataset of focused quality assessment of pathology images.The compared networks are categorized as small,medium,and large according to the number of their parameters.In terms of large networks,the proposed large network performs the best with 0.7658,0.9578,0.9562,and 0.8523 for RMSE,SRCC,PLCC,and KRCC,respectively.These results show that the predicted focus scores are highly consistent with the actual focus scores.In terms of small and medium networks,the performance of the proposed small and medium networks is slightly degraded,but its parameters and computational complexity are notably reduced.Compared with self-defined convolutional neural network(SDCNN),the parameters of the small network(ARF-FQANet-S),the floating-point operations,and the CPU reference time(CPU-Time)are reduced by 39.06%,95.11%,and 51.91%,respectively.The small network may not be able to outperform the FocusLiteNN network in terms of speed;however,performance comparable to larger networks is still provided.This paper visualizes the receptive field of several networks in different stages.The results indicate that the ARF-FQANet proposed in this paper obtains larger receptive fields,especially in the initial layer of the network.Thus,additional global focusing information is obtained at the initial layer of the network,which contributes to the stable performance of the small ARF-FQANet.Conclusion Compared with similar methods,the proposed network efficiently extracts global focus information from pathological images.In this network,a large convolutional kernel is used to expand the receptive field of the network,and DsLKA is introduced to enhance the global information within the learning space and channels.This strategy ensures that the network maintains competitive performance even after notable parameter reductions.The small network(ARF-FQANet-S)offers remarkable advantages in terms of CPU inference time and is ideal for lightweight deployments on edge devices.Overall,the results provide a technical reference for the lightweight models.

作者丁维龙朱伟廖婉茵刘津龙汪春年祝行琴 Ding Weilong;Zhu Wei;Liao Wanyin;Liu Jinlong;Wang Chunnian;Zhu Xingqin(College of Computer Science and Technology,Zhejiang University of Technology,Hangzhou 310023,China;Ningbo Diagnostic Pathology Center,Ningbo 315021,China)

机构地区浙江工业大学计算机科学技术学院宁波市临床病理诊断中心

出处《中国图象图形学报》 CSCD 北大核心 2024年第11期3447-3461,共15页 Journal of Image and Graphics

基金国家自然科学基金项目(32271983) 浙江省基础公益研究计划项目(TGY24F020014,LTGY23F020005)。

关键词数字病理图像聚焦质量评估感受野扩增注意力机制轻量级 digital pathological images focus quality assessment amplified receptive field attention mechanism light⁃weight

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1Meng-Hao Guo,Cheng-Ze Lu,Zheng-Ning Liu,Ming-Ming Cheng,Shi-Min Hu.Visual attention network[J].Computational Visual Media,2023,9(4):733-752. 被引量：39
2Yunfan Xue,Honglin Qian,Xu Li,Jing Wang,Kefeng Ren,Jian Ji.A deep-learning-based workflow to deal with the defocusing problem in high-throughput experiments[J].Bioactive Materials,2022,7(5):218-229. 被引量：1
3金旭,文可,吕国锋,石军,迟孟贤,武铮,安虹.深度学习在组织病理学中的应用综述[J].中国图象图形学报,2020,25(10):1982-1993. 被引量：17

二级参考文献2

1卞修武,平轶芳.我国病理学科发展面临的挑战和机遇[J].第三军医大学学报,2019,41(19):1815-1817. 被引量：35
2Yanran Li,Si Wang,Yuanjun Dong,Ping Mu,Yun Yang,Xiangyang Liu,Changjian Lin,Qiaoling Huang.Effect of size and crystalline phase of TiO2 nanotubes on cell behaviors: A high throughput study using gradient TiO2 nanotubes[J].Bioactive Materials,2020,5(4):1062-1070. 被引量：4

共引文献54

1张晓丽,张魁星,江梅,魏本征,丛金玉.淋巴瘤图像分类技术研究综述[J].计算机工程与应用,2021,57(6):1-9. 被引量：1
2王红玉,张墺琦,卜起荣,崔磊,冯筠.基于双路径特征融合的结肠组织病理腺体分割方法[J].西北大学学报（自然科学版）,2021,51(4):577-586. 被引量：1
3李祥霞,谢娴,李彬,尹华,许波,郑心炜.生成对抗网络在医学图像处理中的应用[J].计算机工程与应用,2021,57(18):24-37. 被引量：5
4王旷怡,胡秀枋.深度学习在胰腺医学影像中的应用综述[J].软件导刊,2022,21(3):249-252. 被引量：1
5黄鸿,王涛,李远,周凡琳,李昱.基于深度特征融合的癌症病理图像分割网络[J].光子学报,2022,51(3):1-12. 被引量：7
6陈怡洋,孔维正,吴辉群,季菊玲.基于深度学习的人工智能辅助肺腺癌胸水脱落细胞学诊断的方法[J].中国临床医学,2022,29(3):396-400. 被引量：4
7高威,蒋慧,焦一平,王向学,徐军.基于多任务和注意力的胰腺癌全切片图像多组织分割模型[J].生物医学工程学杂志,2023,40(1):70-78.
8易序晟,尹爱华,黄杰晟,彭璟,陈汉彪,郭莉,林成创,李双印,赵淦森.深度学习下主流染色体分类算法的性能评估[J].中国图象图形学报,2023,28(2):570-588. 被引量：2
9赵樱莉,丁维龙,游庆华,朱峰龙,朱筱婕,郑魁,刘丹丹.融合空间相关性特征的乳腺组织病理全切片分类[J].中国图象图形学报,2023,28(4):1134-1145. 被引量：4
10何琳莉,闫睿怡,黄一凡.课程思政融入病理学教学的路径探究[J].中国继续医学教育,2023,15(10):165-169. 被引量：2

1张植林.构建智慧思政课堂实现学生减负与教学质量提升的路径研究[J].中华活页文选（高中版）,2020(5):0108-0110.
2黄凌智,黄喜海.食品质量控制与风险评估研究[J].中外食品工业,2024(5):54-56.
3姜森,吴辉.以提升党的政治建设质量带动新时代党的建设质量整体跃升[J].治理现代化研究,2024,40(6):30-38.
4苗睿岚.江苏省轨道交通类技术技能人才培养的挑战与对策探究[J].教育信息化论坛,2024(15):90-92.
5黄伟涛,许钡榛,黄茂荣.结合GhostNetv2与YOLOv7的交通标志实时检测[J].北京测绘,2024,38(11):1620-1626.
6陈晓雷,张育儒,胡森涌,杜泽龙.基于邻域信息和注意力的无参考点云质量评估[J].中国图象图形学报,2024,29(10):2979-2991.
7王博雅,杨小春,卢升荣,唐勇平,洪树权,蒋惠园.基于图卷积神经网络的多维度节点重要性评估方法[J].物理学报,2024,73(22):224-237.
8朱明航,冯杰,马汉杰,邵蒙悦,刘新天,张海翔.基于昇腾平台的图像描述算法的部署与优化[J].智能计算机与应用,2024,14(11):52-58.
9傅萍珍.低碳背景下建筑电气供配电系统设计研究[J].房地产导刊,2024(12):149-150.
10徐映芬,胡学敏,黄婷玉,李燊,陈龙.面向驾驶场景精准图像翻译的条件扩散模型[J].中国图象图形学报,2024,29(11):3305-3318.

中国图象图形学报

2024年第11期

浏览历史

内容加载中请稍等...

感受野扩增的轻量级病理图像聚焦质量评估网络

参考文献3

二级参考文献2

共引文献54

相关作者

相关机构

相关主题

浏览历史