多尺度深度特征提取的肝脏肿瘤CT图像分类被引量：4

CT image classification of liver tumors based onmulti-scale and deep feature extraction

导出

摘要目的肝脏肿瘤是人体最具侵袭性的恶性肿瘤之一,传统的肿瘤诊断依靠观察患者的CT(computed tomography)图像,工作量大时易造成疲劳,难免会产生误诊,为此使用计算机辅助的方法进行诊断,但现有的深度学习方法中存在肿瘤分类准确率低、网络的特征表达能力和特征提取能力较弱等问题。对此,本文设计了一种多尺度深度特征提取的分类网络模型。方法首先在原始CT图像中选取感兴趣区域,然后根据CT图像的头文件进行像素值转换,并进行数据增强来扩充构建数据集,最后将处理后的数据输入到本文提出的分类网络模型中输出分类结果。该网络通过多尺度特征提取模块来提取图像的多尺度特征并增加网络的感受野,使用深度特征提取模块降低背景噪声信息,并着重关注病灶区域有效特征,通过集成并行的空洞卷积使得尺度多元化,并将普通卷积用八度卷积替换来减少参数量,提升分类性能,最终实现了对肝脏肿瘤的精确分类。结果本文模型达到了87.74%的最高准确率,比原始模型提升了9.92%;与现有主流分类网络进行比较,多项评价指标占优,达到了86.04%的召回率,87%的精准率,86.42%的F1分数;此外,通过消融实验进一步验证了所提方法的有效性。结论本文方法可以较为准确地对肝脏肿瘤进行分类,将此方法结合到专业的医疗软件当中去,能够为医生早期的诊断和治疗提供可靠依据。 Objective Liver tumors are the most aggressive malignancies in the human body. The definition of lesion type and lesion period based on computed tomography(CT) images determines the diagnosis and strategy of the treatment, which requires professional knowledge and rich experience of experts to classify them. Fatigue is easily experienced when the workload is heavy, and even experienced senior experts have difficulty avoiding misdiagnosis. Deep learning can avoid the drawbacks of traditional machine learning that takes a certain amount of time to manually extract the features of the image and perform dimensionality reduction, and is capable of extracting high-dimensional features of an image. Using deep learning to assist doctors in diagnosis is important. In the existing medical image classification task, the challenge of the low accuracy of tumor classification, the weak capability of the feature extraction, and the rough dataset still remain. To address these tasks, this study presents a method with a multi-scale and deep feature extraction classification network. Method First, we extract the region of interest(ROI) according to the contours of the liver tumors that were labeled by experienced radiologists, along with the ROI of healthy livers. The ROI is extracted to capture the features of the lesion area and surrounding tissue, which is relative to the size of the lesion. Due to the different sizes of the lesion area, the size of the extracted ROI is also different. Then, the pixel value is converted and data augmentation is performed. The dataset is Hounsfield windows, the range of CT values is(-1 024, 3 071), and the range of digital imaging and communications in medicine(DICOM) image is(0, 4 096). The pixel values of DICOM images have to be converted to CT values. First, we read rescaleintercept and rescaleslope from the DICOM header file, and then we use the formula to convert. Thereafter, we limit the CT values of liver datasets to [-100, 400] Hounsfield HU to avoid the influence of the background noise of the unrelated organs or tissues. We perform several data augmentation methods such as flipping, rotation, and transforming to expand the diversity of the datasets. Then, these images are sent into the MDSENet for classification. The MDSENet network is a SEResNet-like convolution neural network that can achieve end-to-end classification. The SEResNet learns the important features automatically from each channel to strengthen the useful features and suppress useless ones. MDSENet network is much deeper than SEResNet. Our contributions are the following: 1) Hierarchical residual-like connections are used to improve multi-scale expression and increase the receptive field of each network layer. In the study, the image features after 1×1 convolution layers are divided into four groups. Each group of features passes through the 3×3 residual-like convolution groups, which improves the multi-scale feature extraction of networks and enhances the acquisition of focus areas features. 2) Channel attention and spatial attention are used to further focus on effective information on medical images. We let the feature images first go through the channel attention module, then we multiply its input and output to go through the spatial attention module. Then, we multiply the output of the spatial attention module and its input, which can pay more attention to the features of the lesion area and reduce the influence of background noise. 3) Atrous convolutions connected in parallel which refer to the spatial pyramid pooling, then we use 1×1 convolution layers to strengthen the feature. Finally, we concatenate the output and use softmax in classification. In this way, we can expand the receptive field and increase the image resolution, which can improve the feature expression ability and prevent the loss of information effectively. 4) The ordinary convolution is replaced by octave convolution to reduce the number of parameters and improve the classification performance. In this study, we compared the results of DenseNet, ResNet, MnasNet, MobileNet, ShuffleNet, SKResNet, and SEResNet with those of our MDSENet, all of which were trained on the liver dataset. During the experiment, due to the limitation of graphics processing unit(GPU) memory, we set a batch size of 16 with Adam optimization and learning rate of 0.002 for 150 epochs. We used the dataset in Pytorch framework, Ubuntu 16.04. All experiments used the NVIDIA GeForce GTX 1060 Ti GPU to verify the effectiveness of our proposed method. Result Our training set consists of 4 096 images and the test set consists of 1 021 images for the liver dataset. The classification accuracy of our proposed method is 87.74% and is 9.92% higher than the baseline(SEResNet101). Our module achieves the best result compared with the state-of-the-art network and achieved 86.04% recall, 87% precision, 86.42% F1-score under various evaluation indicators. Ablation experiments are conducted to verify the effectiveness of the method. Conclusion In this study, we proposed a method to classify the liver tumors accurately. We combined the method into professional medical software so that we can provide a foundation that physicians can use in early diagnosis and treatment.

作者毛静怡宋余庆刘哲 Mao Jingyi;Song Yuqing;Liu Zhe(School of Computer Science and Communication Engineering,Jiangsu University,Zhenjiang 212013,China)

机构地区江苏大学计算机科学与通信工程学院

出处《中国图象图形学报》 CSCD 北大核心 2021年第7期1704-1715,共12页 Journal of Image and Graphics

基金国家自然科学基金项目(61976106,61772242,61572239) 中国博士后科学基金项目(2017M611737) 江苏省“六大人才高峰”高层次人才项目(DZXX-122) 镇江市卫生计生科技重点项目(SHW2017019)。

关键词深度学习肝脏肿瘤分类多尺度特征特征提取空洞卷积 deep learning liver lesion classification multi-scale features feature extraction dilated convolution

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1刘哲,张晓林,宋余庆,朱彦,袁德琪.结合改进的U-Net和Morphsnakes的肝脏分割[J].中国图象图形学报,2018,23(8):1254-1262. 被引量：17

二级参考文献5

1吉宏伟,何江萍,杨新.基于层次上下文活动轮廓的三维CT肝脏图像分割[J].生物医学工程学杂志,2014,31(2):405-412. 被引量：6
2张小强,熊博莅,匡纲要.一种基于变化检测技术的SAR图像舰船目标鉴别方法[J].电子与信息学报,2015,37(1):63-70. 被引量：14
3赵于前,闫桂霞,徐效文,邹润民.基于先验信息水平集方法的肝脏CT序列图像自动分割[J].中南大学学报（自然科学版）,2015,46(4):1310-1317. 被引量：6
4韩明,刘教民,孟军英,王震洲,王敬涛.结合局部能量与改进的符号距离正则项的图像目标分割算法[J].电子与信息学报,2015,37(9):2047-2054. 被引量：13
5廖苗,赵于前,曾业战,黄忠朝,邹北骥.基于图割和边缘行进的肝脏CT序列图像分割[J].电子与信息学报,2016,38(6):1552-1556. 被引量：7

共引文献16

1刘晓虹,朱玉全,刘哲,宋余庆,朱彦,袁德琪.基于改进多尺度LBP算法的肝脏CT图像特征提取方法[J].计算机科学,2019,46(3):125-130. 被引量：15
2洪汉玉,孙建国,栾琳,王硕,郑新波.基于U-net模型的航拍图像去绳带方法[J].应用光学,2019,40(5):786-794. 被引量：3
3张欢,赵希梅,魏宾.多任务网络模型对肝脏超声影像的识别和去噪[J].青岛大学学报（自然科学版）,2019,32(4):40-49.
4任欣磊,王阳萍,杨景玉,高德成.基于改进U-net的遥感影像建筑物提取[J].激光与光电子学进展,2019,56(22):187-194. 被引量：27
5亢洁,丁菊敏,万永,雷涛.基于分水岭修正与U-Net的肝脏图像分割算法[J].计算机工程,2020,46(1):255-261. 被引量：12
6陈进,韩梦娜,练毅,张帅.基于U-Net模型的含杂水稻籽粒图像分割[J].农业工程学报,2020,36(10):174-180. 被引量：28
7姚山虎,冯智超,李娜,容鹏飞,罗爱静.基于战略坐标和共现网络的肝脏分割研究可视化分析[J].中华医学图书情报杂志,2020,29(4):53-65.
8吉彬,任建君,郑秀娟,谭聪,吉蓉,赵宇,刘凯.改进U-Net在喉白斑病灶分割中的应用[J].计算机工程,2020,46(9):248-253. 被引量：5
9冯诺,宋余庆,刘哲.特征重用和注意力机制下肝肿瘤自动分类[J].中国图象图形学报,2020,25(8):1695-1707. 被引量：1
10郝华颖,赵昆,苏攀,张辉,赵一天,刘江.一种基于改进ResU-Net的角膜神经分割算法[J].计算机工程,2021,47(1):217-223. 被引量：8

同被引文献11

1王铭,唐红.肝功能评价体系现状和研究进展[J].中国肝脏病杂志（电子版）,2017,9(2):26-31. 被引量：14
2刘晓虹,朱玉全,刘哲,宋余庆,朱彦,袁德琪.基于改进多尺度LBP算法的肝脏CT图像特征提取方法[J].计算机科学,2019,46(3):125-130. 被引量：15
3温静,安国艳,梁宇栋.基于CNN特征提取和加权深度迁移的单目图像深度估计[J].图学学报,2019,40(2):248-255. 被引量：2
4胡文墨,杨华瑜,毛一雷.基于人工智能的影像组学在肝脏疾病中的应用[J].中华普通外科杂志,2019,34(7):646-648. 被引量：9
5顾广华,曹宇尧,崔冬,赵耀.基于形式概念分析和语义关联规则的目标图像标注[J].自动化学报,2020,46(4):767-781. 被引量：9
6曹建芳,赵爱迪,张自邦.融合阈值寻优的卷积神经网络在图像标注中的应用[J].计算机应用,2020,40(6):1587-1592. 被引量：4
7王琳,张素兰,杨海峰.基于CNN和加权贝叶斯的最近邻图像标注方法[J].计算机技术与发展,2021,31(10):63-69. 被引量：4
8黄炜嘉,张正言,杨魏,李垣江,李效龙,王泽辉.基于多尺度方向数值模式的肝功能分级方法[J].计算机工程与设计,2022,43(3):692-697. 被引量：2
9Wenhai Wang,Enze Xie,Xiang Li,Deng-Ping Fan,Kaitao Song,Ding Liang,Tong Lu,Ping Luo,Ling Shao.PVT v2:Improved baselines with Pyramid Vision Transformer[J].Computational Visual Media,2022,8(3):415-424. 被引量：65
10管阳.室内三维环境重建技术中的立体匹配算法研究与仿真[J].电子设计工程,2023,31(19):186-190. 被引量：1

引证文献4

1张正言,黄炜嘉,奚彩萍,杨魏,张惠惠.基于L1范数主成分分析网络的肝功能分级方法[J].江苏科技大学学报（自然科学版）,2023,37(5):65-71.
2刘泽,姜永利,丁志伟,刘永强.一种弱纹理目标立体匹配网络[J].计算机测量与控制,2024,32(4):174-179. 被引量：1
3贾迪,蔡鹏,吴思,王骞,宋慧伦.面向弱纹理目标立体匹配的Transformer网络[J].中国图象图形学报,2024,29(8):2413-2425.
4张国有,崔永强.基于双分支注意力机制的图像自动标注研究[J].计算机技术与发展,2024,34(9):167-173.

二级引证文献1

1顿云.中短波宽带天线及其匹配网络研究[J].电声技术,2024,48(7):131-133.

中国图象图形学报

2021年第7期

浏览历史

内容加载中请稍等...

多尺度深度特征提取的肝脏肿瘤CT图像分类被引量：4

参考文献1

二级参考文献5

共引文献16

同被引文献11

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

多尺度深度特征提取的肝脏肿瘤CT图像分类 被引量：4

参考文献1

二级参考文献5

共引文献16

同被引文献11

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

多尺度深度特征提取的肝脏肿瘤CT图像分类被引量：4