医学图像分割任务中,为充分利用TransUNet模型能有效捕获全局和局部特征的优势,在其基础上,提出DouTransNet模型,编码器部分针对单分支Transformer模块学习角度单一、容易丢失细节特征的问题,将Transformer设计成双分支并行结构,来提取...医学图像分割任务中,为充分利用TransUNet模型能有效捕获全局和局部特征的优势,在其基础上,提出DouTransNet模型,编码器部分针对单分支Transformer模块学习角度单一、容易丢失细节特征的问题,将Transformer设计成双分支并行结构,来提取不同尺度的特征,融合两个分支的特征,实现特征互补;针对在融合两个分支的特征时可能存在冗余信息问题,添加多核并行池化模块,在保留多尺度特征的同时去除冗余信息;在解码器设计多尺度融合模块(USF),融合来自编码器的三个尺度信息,有效弥补编码器与解码器之间的信息差距。在Synapse和ACDC数据集上进行了多次对比实验,Synapse数据集上平均DSC系数可达79.20%,较TransUNet模型提高3.34%,HD距离为25.24%,降低了11.67%;在ACDC数据集上平均DSC系数可达90.30%,提高1.67%。In the medical image segmentation task, in order to make full use of the advantages of TransUNet model which can effectively capture global and local features, the DouTransNet model is proposed based on it. The encoder part aims at the problems of single learning Angle and easy loss of detail features in single-branch Transformer module. Transformer is designed as a two-branch parallel structure to extract features of different scales and fuse features of two branches to achieve feature complementarity. To solve the problem of redundant information when fusing the features of two branches, a multi-core parallel pooling module is added to remove redundant information while retaining multi-scale features. In the decoder design multi-scale fusion module (USF), the information of three scales from the encoder is fused to effectively bridge the information gap between the encoder and the decoder. Several comparison experiments were conducted on Synapse and ACDC data sets. The average DSC coefficient on Synapse data set can reach 79.20%, which is 3.34% higher than TransUNet model, and the HD distance is 25.24%, which is 11.67% lower. The average DSC coefficient on ACDC dataset can reach 90.30%, an increase of 1.67%.展开更多
The development of oceanic remote sensing artificial intelligence has made possible to obtain valuable information from amounts of massive data.Oceanic internal waves play a crucial role in oceanic activity.To obtain ...The development of oceanic remote sensing artificial intelligence has made possible to obtain valuable information from amounts of massive data.Oceanic internal waves play a crucial role in oceanic activity.To obtain oceanic internal wave stripes from synthetic aperture radar(SAR)images,a stripe segmentation algorithm is proposed based on the TransUNet framework,which is a combination of U-Net and Transformer,which is also optimized.Through adjusting the number of Transformer layer,multi-layer perceptron(MLP)channel,and Dropout parameters,the influence of over-fitting on accuracy is significantly weakened,which is more conducive to segmenting lightweight oceanic internal waves.The results show that the optimized algorithm can accurately segment oceanic internal wave stripes.Moreover,the optimized algorithm can be trained on a microcomputer,thus reducing the research threshold.The proposed algorithm can also change the complexity of the model to adapt it to different date scales.Therefore,TransUNet has immense potential for segmenting oceanic internal waves.展开更多
Aim:This study aims to establish an artificial intelligence model,ThyroidNet,to diagnose thyroid nodules using deep learning techniques accurately.Methods:A novel method,ThyroidNet,is introduced and evaluated based on...Aim:This study aims to establish an artificial intelligence model,ThyroidNet,to diagnose thyroid nodules using deep learning techniques accurately.Methods:A novel method,ThyroidNet,is introduced and evaluated based on deep learning for the localization and classification of thyroid nodules.First,we propose the multitask TransUnet,which combines the TransUnet encoder and decoder with multitask learning.Second,we propose the DualLoss function,tailored to the thyroid nodule localization and classification tasks.It balances the learning of the localization and classification tasks to help improve the model’s generalization ability.Third,we introduce strategies for augmenting the data.Finally,we submit a novel deep learning model,ThyroidNet,to accurately detect thyroid nodules.Results:ThyroidNet was evaluated on private datasets and was comparable to other existing methods,including U-Net and TransUnet.Experimental results show that ThyroidNet outperformed these methods in localizing and classifying thyroid nodules.It achieved improved accuracy of 3.9%and 1.5%,respectively.Conclusion:ThyroidNet significantly improves the clinical diagnosis of thyroid nodules and supports medical image analysis tasks.Future research directions include optimization of the model structure,expansion of the dataset size,reduction of computational complexity and memory requirements,and exploration of additional applications of ThyroidNet in medical image analysis.展开更多
文摘医学图像分割任务中,为充分利用TransUNet模型能有效捕获全局和局部特征的优势,在其基础上,提出DouTransNet模型,编码器部分针对单分支Transformer模块学习角度单一、容易丢失细节特征的问题,将Transformer设计成双分支并行结构,来提取不同尺度的特征,融合两个分支的特征,实现特征互补;针对在融合两个分支的特征时可能存在冗余信息问题,添加多核并行池化模块,在保留多尺度特征的同时去除冗余信息;在解码器设计多尺度融合模块(USF),融合来自编码器的三个尺度信息,有效弥补编码器与解码器之间的信息差距。在Synapse和ACDC数据集上进行了多次对比实验,Synapse数据集上平均DSC系数可达79.20%,较TransUNet模型提高3.34%,HD距离为25.24%,降低了11.67%;在ACDC数据集上平均DSC系数可达90.30%,提高1.67%。In the medical image segmentation task, in order to make full use of the advantages of TransUNet model which can effectively capture global and local features, the DouTransNet model is proposed based on it. The encoder part aims at the problems of single learning Angle and easy loss of detail features in single-branch Transformer module. Transformer is designed as a two-branch parallel structure to extract features of different scales and fuse features of two branches to achieve feature complementarity. To solve the problem of redundant information when fusing the features of two branches, a multi-core parallel pooling module is added to remove redundant information while retaining multi-scale features. In the decoder design multi-scale fusion module (USF), the information of three scales from the encoder is fused to effectively bridge the information gap between the encoder and the decoder. Several comparison experiments were conducted on Synapse and ACDC data sets. The average DSC coefficient on Synapse data set can reach 79.20%, which is 3.34% higher than TransUNet model, and the HD distance is 25.24%, which is 11.67% lower. The average DSC coefficient on ACDC dataset can reach 90.30%, an increase of 1.67%.
基金The National Natural Science Foundation of China under contract No.51679132the Science and Technology Commission of Shanghai Municipality under contract Nos.21ZR1427000 and 17040501600.
文摘The development of oceanic remote sensing artificial intelligence has made possible to obtain valuable information from amounts of massive data.Oceanic internal waves play a crucial role in oceanic activity.To obtain oceanic internal wave stripes from synthetic aperture radar(SAR)images,a stripe segmentation algorithm is proposed based on the TransUNet framework,which is a combination of U-Net and Transformer,which is also optimized.Through adjusting the number of Transformer layer,multi-layer perceptron(MLP)channel,and Dropout parameters,the influence of over-fitting on accuracy is significantly weakened,which is more conducive to segmenting lightweight oceanic internal waves.The results show that the optimized algorithm can accurately segment oceanic internal wave stripes.Moreover,the optimized algorithm can be trained on a microcomputer,thus reducing the research threshold.The proposed algorithm can also change the complexity of the model to adapt it to different date scales.Therefore,TransUNet has immense potential for segmenting oceanic internal waves.
基金supported by MRC,UK (MC_PC_17171)Royal Society,UK (RP202G0230)+8 种基金BHF,UK (AA/18/3/34220)Hope Foundation for Cancer Research,UK (RM60G0680)GCRF,UK (P202PF11)Sino-UK Industrial Fund,UK (RP202G0289)LIAS,UK (P202ED10,P202RE969)Data Science Enhancement Fund,UK (P202RE237)Fight for Sight,UK (24NN201)Sino-UK Education Fund,UK (OP202006)BBSRC,UK (RM32G0178B8).
文摘Aim:This study aims to establish an artificial intelligence model,ThyroidNet,to diagnose thyroid nodules using deep learning techniques accurately.Methods:A novel method,ThyroidNet,is introduced and evaluated based on deep learning for the localization and classification of thyroid nodules.First,we propose the multitask TransUnet,which combines the TransUnet encoder and decoder with multitask learning.Second,we propose the DualLoss function,tailored to the thyroid nodule localization and classification tasks.It balances the learning of the localization and classification tasks to help improve the model’s generalization ability.Third,we introduce strategies for augmenting the data.Finally,we submit a novel deep learning model,ThyroidNet,to accurately detect thyroid nodules.Results:ThyroidNet was evaluated on private datasets and was comparable to other existing methods,including U-Net and TransUnet.Experimental results show that ThyroidNet outperformed these methods in localizing and classifying thyroid nodules.It achieved improved accuracy of 3.9%and 1.5%,respectively.Conclusion:ThyroidNet significantly improves the clinical diagnosis of thyroid nodules and supports medical image analysis tasks.Future research directions include optimization of the model structure,expansion of the dataset size,reduction of computational complexity and memory requirements,and exploration of additional applications of ThyroidNet in medical image analysis.