自适应迁移鲁棒特征的个性化联邦医学图像分类

Personalized federated medical image classification with adaptive transfer robust features

导出

摘要目的针对联邦学习中多中心医学数据的异质性特征导致全局模型性能不佳的问题,提出一种基于特征迁移的自适应个性化联邦学习算法(adaptive personalized federated learning via feature transfer,APFFT)。方法首先,为降低全局模型中异质性特征信息影响,提出鲁棒特征选择网络(robust feature selection network,RFS-Net)构建个性化本地模型。RFS-Net通过学习两个迁移权重分别确定全局模型向本地模型迁移时的有效特征以及特征迁移的目的地,并构建基于迁移权重的迁移损失函数以加强本地模型对全局模型中有效特征的注意力,从而构建个性化本地模型。然后,为过滤各本地模型中异质性特征信息,利用自适应聚合网络(adaptive aggregation network,AANet)聚合全局模型。AA-Net基于全局模型交叉熵变化更新迁移权重并构建聚合损失,使各本地模型向全局模型迁移鲁棒特征,提高全局模型的特征表达能力。结果在3种医学图像分类任务上与4种现有方法进行比较实验,在肺结核肺腺癌分类任务中,各中心曲线下面积(area under the curve,AUC)分别为0.7915,0.7981,0.7600,0.7057和0.8069;在乳腺癌组织学图像分类任务中,各中心准确率分别为0.9849、0.9808、0.9835、0.9826和0.9834;在肺结节良恶性分类任务中,各中心AUC分别为0.8097,0.8498,0.7848和0.7923。结论所提出的联邦学习方法,降低了多中心的异质性特征影响,实现基于鲁棒特征的个性化本地模型自适应构建和全局模型自适应聚合,模型性能有较大提升。 Objective Patient data cannot be shared among medical institutions due to medical data confidentiality regula⁃tions,considerably limiting data scale.Federated learning ensures that all clients can train local models and aggregate global models in a decentralized manner without sharing data.However,the heterogeneity of medical data substantially affects the aggregation and deployment of global models in federated learning.In most federated learning methods,the aggregation of global model parameters is achieved by multiplying the fixed weight with the local model parameters and then summing them.The local model personalization method requires a large number of manual experiments to select the appro⁃priate model layer for personalization construction.Although these methods can realize the aggregation of global models or the construction of personalized local models,they cannot automatically aggregate global model parameters and construct personalized local models.Moreover,they lack pertinence to heterogeneity characteristics.Therefore,an adaptive person⁃alized federated learning algorithm via feature transfer(APFFT)is proposed.This algorithm can automatically identify and select robust features for personalized local model construction and global model aggregation.It can also suppress and filter heterogeneous feature information.Method To construct a personalized local model,a robust feature selection network(RFS-Net)was proposed in this study.RFS-Net can automatically identify and select features by calculating transfer weights and the amount of feature transfer on the basis of model representation.When transferring features from a global model to a local model,RFS-Net constructs transfer loss functions on the basis of transfer weights and the amount of feature transfer to constrain the local model and strengthen its attention toward effective transfer features.In the aggregation of the global model,the adaptive aggregation network(AA-Net)was proposed to transfer features from the local model to the global model.AA-Net updated the transfer weight and constructed the aggregation loss on the basis of the cross-entropy change of the global model for filtering the heterogeneity feature information of each local model.In this study,PyTorch was used to build and train the models,while ResNet18 was used for the convolutional neural network(CNN)structure.RFS-Net and AA-Net were composed of fully connected,pooling,softmax,and ReLU6 layers.The parameters of RFSNet,AA-Net,and the CNN were updated via stochastic gradient descent with a momentum of 0.9.Experiments were con⁃ducted on three medical image datasets:the nonpublic dataset of pulmonary adenocarcinoma and tuberculosis classifica⁃tion,the public dataset Camelyon17,and the public dataset LIDC.The dataset of pulmonary adenocarcinoma tuberculosis classification came from 5 hospitals,with 1009 cases.Among which,Center 1(training set n=260,test setn=242),Center 2(training set n=34,test set n=54),Center 3(training set n=39,test set n=40),Center 4(training set n=145,test set n=108),and Center 5(training set n=36,test set n=51)were used in the experiment.The learning rate and decay rate of RFS-Net and AA-Net were both 0.0001,while the learning rate and decay rate of the CNN were 0.001 and 0.0005,respectively.Focal loss was used to calculate cross-entropy.In addition,gender,age,and nodule size in clinical information are of considerable reference value in the diagnosis of tuberculosis and lung adenocarcinoma.There⁃fore,we provided statistics for this information,and the results showed that in Center 2,the overall age and nodule size were small,while in Center 4,the overall nodule size was large,exhibiting a certain gap with the global average level.Camelyon17 was composed of 450000 histological images from 5 hospitals.In the experiment,the learning rate and decay rate of the CNN,RFS-Net,and AA-Net were all 0.0001.Standard cross-entropy was used to constrain CNN training.LIDC data came from 7 research institutions and 8 medical image companies,with 1018 cases.Lesions with Grades 1 to 2 malignancies were classified as benign,while those with Grades 4 to 5 malignancies were classified as malignant.Finally,1746 lesions were included in the dataset to simulate the federated learning application scenario.The lesions were then randomly divided into 4 centers in accordance with the cases.Center 1(training set n=254,test set n=169),Center 2(training set n=263,test set n=190),Center 3(training set n=305,test set n=124),and Center 4(training set n=247,test set n=194)were used in the experiment.The learning rate and decay rate of RFS-Net and AA-Net were both 0.0001.The learning rate and decay rate of the CNN were 0.001 and 0.0001,respectively.The cross-entropy loss was calculated using standard cross-entropy.Result Three types of medical image classification tasks were compared with four existing methods.The evaluation indexes included receiver operating characteristic(ROC)and accuracy.The experimen⁃tal results showed that in the tuberculosis lung adenocarcinoma classification task,the center test sets of the end-to-end area under the ROC curve(AUC)were 0.7915,0.7981,0.76,0.7057,and 0.8069.In the breast cancer histological image classification task,the center test sets of end-to-end accuracy were 0.9849,0.9808,0.9835,0.9826,and 0.9834.In the pulmonary nodule benign and malignancy classification task,the center test sets of the end-to-end AUC were 0.8097,0.8498,0.7848,and 0.7923.Conclusion The federated learning method proposed in this study can reduce the influence of heterogeneous characteristics and realize the adaptive construction of personalized local models and the adaptive aggregation of global models.The results show that our model is superior to several existing federated learning methods,and model performance is considerably improved.

作者陆森良冯宝徐坤财陈业航陈相猛 Lu Senliang;Feng Bao;Xu Kuncai;Chen Yehang;Chen Xiangmeng(School of Electronic Engineering and Automation,Guilin University of Electronic Technology,Guilin 541004,China;Laboratory of Intelligent Detection and Information Processing,Guilin University of Aerospace Technology,Guilin 541004,China;Laboratory of Intelligent Computing and Application of Medical Imaging,Jiangmen Central Hospital,Jiangmen 529000,China)

机构地区桂林电子科技大学电子工程与自动化学院桂林航天工业学院智能检测与信息处理实验室江门市中心医院医学影像智能计算及应用实验室

出处《中国图象图形学报》 CSCD 北大核心 2024年第3期798-810,共13页 Journal of Image and Graphics

基金国家自然科学基金项目(81960324,62176104) 广西自然科学基金项目(2021GXNSFAA075037) 广东省医学科学技术研究基金项目(A2021138) 桂林航天工业学院校级科研基金项目(XJ21KT24)。

关键词特征迁移联邦学习异质性特征鲁棒特征选择网络自适应聚合网络医学图像分类 feature transfer federated learning heterogeneity features robust feature selection network adaptive aggre⁃gation network medical image classification

分类号 TN911.73-34 [电子电信—通信与信息系统] TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1陈弘扬,高敬阳,赵地,汪红志,宋红,苏庆华.深度学习与生物医学图像分析2020年综述[J].中国图象图形学报,2021,26(3):475-486. 被引量：20
2黎英,宋佩华.迁移学习在医学图像分类中的研究进展[J].中国图象图形学报,2022,27(3):672-686. 被引量：18

二级参考文献28

1程波,朱丙丽,熊江.基于多模态多标记迁移学习的早期阿尔茨海默病诊断[J].计算机应用,2016,36(8):2282-2286. 被引量：6
2陈诗慧,刘维湘,秦璟,陈亮亮,宾果,周煜翔,汪天富,黄炳升.基于深度学习和医学图像的癌症计算机辅助诊断研究进展[J].生物医学工程学杂志,2017,34(2):314-319. 被引量：54
3郑光远,刘峡壁,韩光辉.医学影像计算机辅助检测与诊断系统综述[J].软件学报,2018,29(5):1471-1514. 被引量：68
4褚晶辉,吴泽蕤,吕卫,李喆.基于迁移学习和深度卷积神经网络的乳腺肿瘤诊断系统[J].激光与光电子学进展,2018,55(8):196-202. 被引量：26
5张巧丽,迟学斌,赵地.基于深度学习的帕金森病症早期诊断[J].计算机系统应用,2018,27(9):1-9. 被引量：8
6张泽中,高敬阳,吕纲,赵地.基于深度学习的胃癌病理图像分类方法[J].计算机科学,2018,45(B11):263-268. 被引量：24
7汪红志,赵地,杨丽琴,夏天,周皛月,苗志英.基于AI+MRI的影像诊断的样本增广与批量标注方法[J].波谱学杂志,2018,35(4):447-456. 被引量：10
8苏庆华,张姗姗,蔡磊,谷焓,李奕飞,俞戈昊,江方舟,白翰林,赵地.基于三维分类网络的前列腺辅助诊断[J].中国数字医学,2019,14(3):18-21. 被引量：2
9颜嵩林,林溢星,李鹤喜,赵地,迟学斌.基于多重迁移学习的糖尿病视网膜病变检测[J].中国数字医学,2019,14(3):26-30. 被引量：3
10金祝新,秦飞巍,方美娥.深度迁移学习辅助的阿尔兹海默氏症早期诊断[J].计算机应用与软件,2019,36(5):171-177. 被引量：5

共引文献36

1梅少辉,张博威,马明阳,贾森.近红外高光谱图像数据预测技术[J].中国图象图形学报,2021,26(8):1786-1795. 被引量：3
2陈弘扬,高敬阳,赵地,吴忌,陈金军,全显跃,李欣明,薛峰,周沐瑶,柏冰冰.LFSCA-UNet:基于空间与通道注意力机制的肝纤维化区域分割网络[J].中国图象图形学报,2021,26(9):2121-2134. 被引量：7
3赵地,卜刚.脑机接口信号处理的研究进展[J].人工智能,2021(6):26-32. 被引量：1
4田翠杰,刘志友,高峰,张胜.模式识别方法在腹部超声诊断中的研究[J].中国医学装备,2022,19(3):39-42.
5赵人行,徐频捷,刘瑶.基于深度卷积残差网络的心电单导联房颤检测方法[J].计算机科学,2022,49(5):186-193. 被引量：1
6黎英,宋佩华.迁移学习在医学图像分类中的研究进展[J].中国图象图形学报,2022,27(3):672-686. 被引量：18
7黄帅辉,王金凤.融入平滑组稀疏化的脑部MRI图像分类[J].中国图象图形学报,2022,27(3):885-897. 被引量：1
8李延铭,李长升,余佳奇,袁野,王国仁.基于真实数据感知的模型功能窃取攻击[J].中国图象图形学报,2022,27(9):2721-2732.
9管宽岐,蔺雨桐,赵雨薇,秦列列,张楠楠,曹英丽.基于深度学习的航拍光伏板红外图像热斑检测方法研究[J].电子测量技术,2022,45(22):75-81. 被引量：2
10邓辉,张洁.基于改进的ResNet50网络的黑色素瘤分类方法[J].计算机技术与发展,2023,33(2):64-70.

1许跃雯,李明,李莉.基于对比学习MocoV2的COVID-19图像分类[J].计算机与现代化,2024(2):81-87.
2白浩田,谷宇,杨立东,张宝华,李建军,吕晓琪,唐思源,张祥松,贾成一,贺群.改进知识蒸馏Transformer的新冠肺炎医学影像分类[J].激光杂志,2024,45(2):152-160.
3陈伟杰,黄国恒,莫非,林俊宇.层次信息自适应聚合的图像超分辨率重建算法[J].计算机工程与应用,2024,60(5):221-231.
4钟静,方冰,朱江.基于稀疏矩阵结构的特征选择算法现状研究[J].信息网络安全,2024(3):352-362.
5蔡改贫,肖文聪,黄耀锋.领域对抗与分类差异的变工况球磨机负荷识别[J].电子测量与仪器学报,2023,37(12):67-75.
6张思甜,刘军清,康维.基于改进RetinaNet模型的口罩规范佩戴检测方法[J].长江信息通信,2024,37(2):35-38.
7王鹏,李丹青,王恒.基于改进交替迁移学习的滚动轴承故障诊断算法[J].振动与冲击,2024,43(5):239-249.
8王江安,黄乐,庞大为,秦林珍,梁温茜.基于自适应聚合循环递归的稠密点云重建网络[J].图学学报,2024,45(1):230-239.
9张义定,雷锦志.异质性干细胞增殖过程中的熵变化[J].生物信息学,2024,22(1):58-69.
10魏超楠,王红,邓艺杰.基于信息技术的乘务专业大学生身体机能监测与分析研究[J].文体用品与科技,2024(6):126-128.

中国图象图形学报

2024年第3期

浏览历史

内容加载中请稍等...

自适应迁移鲁棒特征的个性化联邦医学图像分类

参考文献2

二级参考文献28

共引文献36

相关作者

相关机构

相关主题

浏览历史