融合先验式图卷积与Transformer的儿童肺炎CT图像分割网络

Childhood Pneumonia CT Image Segmentation Network with Prior Graph Convolution and Transformer Fusion

导出

摘要提出了一种融合先验式图卷积与Transformer的U型网络(GTU-Net)。该网络通过将图卷积和Transformer有机结合,从像素到块逐级建立网络所需的局部-全局信息。在此过程中,为应对低特异性导致背景无关组织干扰性强的客观挑战,在图卷积前设计了一种新型的图邻接矩阵构造方法,即先验图学习。该方法通过监督学习的方式学习标签中潜藏的像素级类别及位置先验信息,从而更精确地刻画出不同特征的类间及类内关系,进一步增强图卷积网络的局部推理能力。实验结果表明,GTU-Net在私有儿童肺炎计算机断层扫描(CT)数据集上的Dice相似系数、杰卡德系数、敏感性、马修斯相关系数和平均表面距离指标相较性能最优的Transformer对比网络分别提升了1.82%、2.91%、1.33%、1.85%和0.0268 pixel,同时也在2个公有新型冠状病毒感染CT数据集上取得了较好的表现,这有效验证了其良好的泛化性。 Objective Pneumonia is one of the most common and fatal diseases during childhood.Accurate segmentation of lung CT images is crucial for early detection.However,manually outlining infected lung regions is labor-intensive for radiologists.Automatic segmentation technology holds significant promise in alleviating the strain on medical resources.In childhood pneumonia CT images,infected areas are often fragmented across different lung lobes.Therefore,precise global contextual information is essential for accurate segmentation.While purely transformer-based segmentation networks have demonstrated strong learning capabilities in this regard,they often struggle with producing high-quality local details due to limited patch size and insufficient local prior knowledge.Moreover,lung tissues such as the hilum and mediastinal areas closely resemble infected regions in childhood pneumonia,which demands robust network performance to minimize interference.To address these challenges,we propose a prior graph convolution and transformer fusion network based on U-Net(GTU-Net).Methods The core concept of GTU-Net involves integrating graph convolutional network(GCN)and Transformer to mutually enhance each other’s strengths.It utilizes GCN to establish pixel relationships within each patch,and then leverages the Transformer to capture global information between patches.In addition,a novel method called prior graph learning(PGL)is introduced within GCN to mitigate interference from irrelevant regions.GTU-Net comprises three main modules:PGL,graph convolution mixed transformer(GCT),and the encoder-decoder structure of U-Net,as illustrated in Fig.2.Upon receiving features extracted by the encoder,these are first processed using coordinate-aware projection(Fig.3)to form graph nodes and adjacency matrices.Subsequently,the adjacency matrices undergo further refinement through the PGL module(Fig.4),which uses a supervised approach to incorporate category priors and localization information from labels.The data is then divided into non-overlapping subgraphs.With PGL’s assistance,the local reasoning capabilities of GCN are significantly enhanced,enabling precise descriptions of intra-class and inter-class feature relationships.This design is referred to as a prior graph convolutional network(PriorGCN).Next,the divided graph data are fed into the GCT module,which consists of PriorGCN and Vision Transformer(ViT).GCT aims to sequentially establish intra-patch localization and inter-patch globalization,thereby addressing challenges posed by complex local structures and scattered infection regions in childhood pneumonia.Finally,the decoder performs upsampling to produce the final segmentation result.Results and Discussions One private childhood pneumonia dataset(Child-P)and two publicly available COVID-19 datasets(COVID and MosMed)are used to validate the proposed GTU-Net.The ablation results indicate that each proposed module noticeably boosts segmentation performance(Table 2).Specifically,PriorGCN contributes the most,with improvements of 4.44 percentage points in DSC,6.82 percentage points in JI,6.31 percentage points in SE,4.41 percentage points in MCC,and a reduction of 0.1615 pixel in ASD compared to the baseline.In comparative experiments,GTU-Net achieves the best performance across all metrics on the Child-P dataset(Table 4),particularly excelling in JI and MCC metrics with improvements of 2.91 percentage points and 1.85 percentage points,respectively,compared to the second-best network.Moreover,GTU-Net demonstrates superior sensitivity in segmenting fragmented and tiny lesions,resulting in more comprehensive segmentation outcomes in these regions compared to other networks(Fig.10).Similarly,GTU-Net shows the best performance on the COVID dataset(Table 5),particularly notable in the improvement of the SE metric,highlighting the excellent feature discrimination capability of the PGL module.GTU-Net also outperforms other networks in DSC,JI,and MCC metrics on the MosMed dataset,achieving improvements of 1.70 percentage points,1.77 percentage points,and 1.93 percentage points,respectively,compared to the second-best network(Table 4).Visualization results from the two COVID-19 datasets reveal that GTU-Net effectively addresses issues such as under-segmentation or over-segmentation(Fig.11).Additionally,GTU-Net exhibits superior local segmentation results,avoiding the checkerboard artifact often seen in transformer-based networks(Fig.12).Importantly,GTU-Net maintains its superior performance even when it is trained on small datasets without pre-training on larger datasets(Fig.13).Conclusions We select childhood pneumonia as our research focus,an area relatively underexplored in existing studies.We propose a novel GTU-Net to address the segmentation challenges presented by childhood pneumonia CT images,which are characterized by high noise interference,the presence of tiny lesions,and fragmented distribution.GTU-Net incorporates a GCT module to systematically capture local-global information.Additionally,a PGL module is introduced to construct a high-quality graph adjacency matrix for GCN,enhancing the network’s ability to discriminate between inter-class and intra-class features.Unlike most existing transformer-based segmentation networks,GTU-Net does not rely on pre-training,which strengthens its clinical applicability.Experimental results on a private childhood pneumonia CT dataset demonstrate that GTU-Net outperforms state-of-the-art transformer networks.Furthermore,it exhibits strong performance on two publicly available COVID-19 CT datasets,verifying its generalizability.

作者梁浩城吕佳于明楷陈欣 Liang Haocheng;LüJia;Yu Mingkai;Chen Xin(College of Computer and Information Sciences,Chongqing Normal University,Chongqing 401331,China;National Center for Applied Mathematics in Chongqing,Chongqing Normal University,Chongqing 401331,China;Ministry of Education Key Laboratory of Child Development and Disorders,National Clinical Research Center for Child Health and Disorders,Children’s Hospital of Chongqing Medical University,Chongqing 400014,China)

机构地区重庆师范大学计算机与信息科学学院重庆师范大学重庆国家应用数学中心重庆医科大学附属儿童医院国家儿童健康与疾病临床医学研究中心儿童发育与疾病教育部重点实验室

出处《光学学报》 EI CAS CSCD 北大核心 2024年第16期88-102,共15页 Acta Optica Sinica

基金国家自然科学基金重大项目(11991024) 国家儿童健康与疾病临床医学研究中心卫生项目(NCRCCHD-2022-HP-01) 重庆市教委重点项目(KJZD-K202200511) 重庆市2023年研究生科研创新项目(CYS23407) 重庆师范大学研究生科研创新项目(YKC23031)。

关键词图像处理儿童肺炎 CT图像分割 U-Net 图卷积网络 TRANSFORMER 先验图学习 image processing childhood pneumonia CT image segmentation U-Net graph convolutional network Transformer prior graph learning

分类号 TP361 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

1李亚琴,卢燕鸣.儿童肺炎,吃药好还是输液好[J].大众医学,2024(10):64-65.
2毕海霞,沈国民,周家俊,袁增权.头孢呋辛和阿莫西林克拉维酸钾分别联合布地奈德治疗儿童肺炎临床疗效观察[J].实用医院临床杂志,2024,21(5):123-126.
3方圆,廖永杏,陈思逸.纤维支气管镜肺泡灌洗诊治儿童肺炎支原体性大叶性肺炎临床研究[J].首都食品与医药,2024,31(18):11-14.
4王碧,黄育丹.儿童肺炎的临床疾病特征及支原体DNA的检测分析[J].中国妇幼保健,2024,39(17):3323-3326.
5刘一博.肺癌CT图像分割和三维重建技术研究[J].中文科技期刊数据库（全文版）医药卫生,2024(10):0157-0161.
6傅锦涛,张弓,张树忠.基于迁移学习的轴承故障诊断[J].福建理工大学学报,2024,22(4):393-400.
7张继源.3D打印金属垫块在髋臼骨缺损重建中的临床应用研究[J].中文科技期刊数据库（引文版）医药卫生,2024(9):0017-0020.
8金凡晴.给我选择,但别太多[J].求贤,2024(9):40-40.
9柳玉婷,昌杰,黄道斌,胡天寒.基于改进的U-Net的新冠肺炎病灶分割方法研究[J].西安文理学院学报（自然科学版）,2024,27(2):45-50.
10西安电子科技大学陈晓峰团队研究成果被USENIX Security收录[J].信息网络安全,2024(10):1611-1611.

光学学报

2024年第16期

浏览历史

内容加载中请稍等...

融合先验式图卷积与Transformer的儿童肺炎CT图像分割网络

相关作者

相关机构

相关主题

浏览历史