期刊文献+

基于Transformer的肾实质新型分割网络

A Novel Renal Parenchyma Segmentation Network Based on Transformer
下载PDF
导出
摘要 肾在人体内是一个很重要的器官,而肾实质是很常见的一种肾病。目前关于肾实质病变的判断是临床医生通过标注,人工进行判断的。这样人工的方式需要大量的时间以及人工成本,也因此我们亟需一种自动化的标注分割方法,从而提升肾实质的分割效率与精度。本文针对小儿肾实质的分割问题,绘制了一套儿童的肾实质数据集,并且根据数据集的特点提出了一种基于transform的分割方法。Transform架构不同于传统卷积提取特征的架构,transform更加关注语义的上下文信息,我们用它作为编码器的一部分来提取语义信息,从序列到序列学习的角度,为图像分割提供了一个全新的视角。这样做不仅改善了分辨率降低导致感受野下降的问题,同时也改善了跳跃连接会带来语义间隙的问题,得到最终的分割结果图,极大地减少了人工标注的代价。本文的代码是基于pytorch框架进行的编程,在所提出的肾图数据集上进行的实验,并将本文提出的网络与经典的FCN、SegNet、U-Net和Deeplab-V3+做了对比实验。结果显示本文提出的方法在precision、dice_coeff、recall三种评价指标上(对比其网络在这三种指标上最优的结果),分别提升了1.99%、1.65%、2.23%和3.001%,其效果也得到了专业医生的认可。 The kidney is a very important organ in the human body, and the renal parenchyma is a very common kidney disease. At present, the judgment of renal parenchymal lesions is made manually by clinicians through labeling. This artificial method requires a lot of time and labor cost. Therefore, we urgently need an automatic labeling and segmentation method to improve the efficiency and accuracy of renal parenchyma segmentation. Therefore, aiming at the segmentation of children’s renal parenchyma, this paper draws a set of children’s renal parenchyma data set, and proposes a segmentation method based on transform according to the characteristics of the data set. Transform architecture is different from the traditional convolution feature extraction architecture. Transform pays more attention to the semantic context information. We use it as a part of the encoder to extract semantic information. From the perspective of sequence to sequence learning, it provides a new perspective for image segmentation. This not only improves the problem of reduced resolution leading to the decline of receptive field, but also improves the problem of semantic gap caused by jump connection, and obtains the final segmentation result image, which greatly reduces the cost of manual annotation. The code of this paper is based on the programming of pytorch framework. The experiment is carried out on the proposed nephrogram data set, and the network proposed in this paper is compared with the classical FCN, segnet, u-net and deeplab-v3 +. The results show that the proposed method is effective in precision, dice_coeff and recall on the three evaluation indexes of coeff and recall (comparing the best results of their network in these three indexes) have increased by 1.99%, 1.65%, 2.23% and 3.001% respectively, and their effects have also been recognized by professional doctors.
作者 张容祥
机构地区 上海理工大学
出处 《人工智能与机器人研究》 2022年第2期122-133,共12页 Artificial Intelligence and Robotics Research
  • 相关文献

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部