摘要
提出了一种融合先验式图卷积与Transformer的U型网络(GTU-Net)。该网络通过将图卷积和Transformer有机结合,从像素到块逐级建立网络所需的局部-全局信息。在此过程中,为应对低特异性导致背景无关组织干扰性强的客观挑战,在图卷积前设计了一种新型的图邻接矩阵构造方法,即先验图学习。该方法通过监督学习的方式学习标签中潜藏的像素级类别及位置先验信息,从而更精确地刻画出不同特征的类间及类内关系,进一步增强图卷积网络的局部推理能力。实验结果表明,GTU-Net在私有儿童肺炎计算机断层扫描(CT)数据集上的Dice相似系数、杰卡德系数、敏感性、马修斯相关系数和平均表面距离指标相较性能最优的Transformer对比网络分别提升了1.82%、2.91%、1.33%、1.85%和0.0268 pixel,同时也在2个公有新型冠状病毒感染CT数据集上取得了较好的表现,这有效验证了其良好的泛化性。
Objective Pneumonia is one of the most common and fatal diseases during childhood.Accurate segmentation of lung CT images is crucial for early detection.However,manually outlining infected lung regions is labor-intensive for radiologists.Automatic segmentation technology holds significant promise in alleviating the strain on medical resources.In childhood pneumonia CT images,infected areas are often fragmented across different lung lobes.Therefore,precise global contextual information is essential for accurate segmentation.While purely transformer-based segmentation networks have demonstrated strong learning capabilities in this regard,they often struggle with producing high-quality local details due to limited patch size and insufficient local prior knowledge.Moreover,lung tissues such as the hilum and mediastinal areas closely resemble infected regions in childhood pneumonia,which demands robust network performance to minimize interference.To address these challenges,we propose a prior graph convolution and transformer fusion network based on U-Net(GTU-Net).Methods The core concept of GTU-Net involves integrating graph convolutional network(GCN)and Transformer to mutually enhance each other’s strengths.It utilizes GCN to establish pixel relationships within each patch,and then leverages the Transformer to capture global information between patches.In addition,a novel method called prior graph learning(PGL)is introduced within GCN to mitigate interference from irrelevant regions.GTU-Net comprises three main modules:PGL,graph convolution mixed transformer(GCT),and the encoder-decoder structure of U-Net,as illustrated in Fig.2.Upon receiving features extracted by the encoder,these are first processed using coordinate-aware projection(Fig.3)to form graph nodes and adjacency matrices.Subsequently,the adjacency matrices undergo further refinement through the PGL module(Fig.4),which uses a supervised approach to incorporate category priors and localization information from labels.The data is then divided into non-overlapping subgraphs.With PGL’s assistance,the local reasoning capabilities of GCN are significantly enhanced,enabling precise descriptions of intra-class and inter-class feature relationships.This design is referred to as a prior graph convolutional network(PriorGCN).Next,the divided graph data are fed into the GCT module,which consists of PriorGCN and Vision Transformer(ViT).GCT aims to sequentially establish intra-patch localization and inter-patch globalization,thereby addressing challenges posed by complex local structures and scattered infection regions in childhood pneumonia.Finally,the decoder performs upsampling to produce the final segmentation result.Results and Discussions One private childhood pneumonia dataset(Child-P)and two publicly available COVID-19 datasets(COVID and MosMed)are used to validate the proposed GTU-Net.The ablation results indicate that each proposed module noticeably boosts segmentation performance(Table 2).Specifically,PriorGCN contributes the most,with improvements of 4.44 percentage points in DSC,6.82 percentage points in JI,6.31 percentage points in SE,4.41 percentage points in MCC,and a reduction of 0.1615 pixel in ASD compared to the baseline.In comparative experiments,GTU-Net achieves the best performance across all metrics on the Child-P dataset(Table 4),particularly excelling in JI and MCC metrics with improvements of 2.91 percentage points and 1.85 percentage points,respectively,compared to the second-best network.Moreover,GTU-Net demonstrates superior sensitivity in segmenting fragmented and tiny lesions,resulting in more comprehensive segmentation outcomes in these regions compared to other networks(Fig.10).Similarly,GTU-Net shows the best performance on the COVID dataset(Table 5),particularly notable in the improvement of the SE metric,highlighting the excellent feature discrimination capability of the PGL module.GTU-Net also outperforms other networks in DSC,JI,and MCC metrics on the MosMed dataset,achieving improvements of 1.70 percentage points,1.77 percentage points,and 1.93 percentage points,respectively,compared to the second-best network(Table 4).Visualization results from the two COVID-19 datasets reveal that GTU-Net effectively addresses issues such as under-segmentation or over-segmentation(Fig.11).Additionally,GTU-Net exhibits superior local segmentation results,avoiding the checkerboard artifact often seen in transformer-based networks(Fig.12).Importantly,GTU-Net maintains its superior performance even when it is trained on small datasets without pre-training on larger datasets(Fig.13).Conclusions We select childhood pneumonia as our research focus,an area relatively underexplored in existing studies.We propose a novel GTU-Net to address the segmentation challenges presented by childhood pneumonia CT images,which are characterized by high noise interference,the presence of tiny lesions,and fragmented distribution.GTU-Net incorporates a GCT module to systematically capture local-global information.Additionally,a PGL module is introduced to construct a high-quality graph adjacency matrix for GCN,enhancing the network’s ability to discriminate between inter-class and intra-class features.Unlike most existing transformer-based segmentation networks,GTU-Net does not rely on pre-training,which strengthens its clinical applicability.Experimental results on a private childhood pneumonia CT dataset demonstrate that GTU-Net outperforms state-of-the-art transformer networks.Furthermore,it exhibits strong performance on two publicly available COVID-19 CT datasets,verifying its generalizability.
作者
梁浩城
吕佳
于明楷
陈欣
Liang Haocheng;LüJia;Yu Mingkai;Chen Xin(College of Computer and Information Sciences,Chongqing Normal University,Chongqing 401331,China;National Center for Applied Mathematics in Chongqing,Chongqing Normal University,Chongqing 401331,China;Ministry of Education Key Laboratory of Child Development and Disorders,National Clinical Research Center for Child Health and Disorders,Children’s Hospital of Chongqing Medical University,Chongqing 400014,China)
出处
《光学学报》
EI
CAS
CSCD
北大核心
2024年第16期88-102,共15页
Acta Optica Sinica
基金
国家自然科学基金重大项目(11991024)
国家儿童健康与疾病临床医学研究中心卫生项目(NCRCCHD-2022-HP-01)
重庆市教委重点项目(KJZD-K202200511)
重庆市2023年研究生科研创新项目(CYS23407)
重庆师范大学研究生科研创新项目(YKC23031)。