基于文本引导下的多模态医学图像分析算法

A Multi-Modal Medical Image Analysis Algorithm Based on Text Guidance

下载PDF

导出

摘要结合胃镜超声和白光内镜可以更准确地识别胃肠道间质瘤.但是现有的多模态方法往往仅关注于图像特征,忽略了诊断文本信息中所包含的语义信息对于精确理解和诊断医学图像的重要性.为此,本文提出一种新的基于文本引导下的多模态医学图像分析算法框架(Text-guided Multi-modal Medical image analysis framework,TMM-Net).TMM-Net使用多阶段的诊断文本来引导模型学习,以提取图像中的关键诊断信息特征,然后通过交叉模态注意力机制促进多模态特征之间的交互.值得注意的是,TMM-Net通过预测病变属性来模拟临床诊断过程,从而增强了可解释性.验证实验在两个中心包含10 025个模态数据对的数据集上进行.结果表明,该方法相比目前最优的GISTs诊断方法精度提升7.7%,同时获得了最高的(Area Under the Curve,AUC)值:0.927,其可解释性可以更好地适合临床需求. Combining gastroscopy ultrasound and white light endoscopy can improve the accuracy of identifying gas⁃trointestinal stromal tumors(GISTs).However,existing multi-modal methods often focus solely on image features and over⁃look the semantic relevance contained in diagnostic textual information,which is crucial for precise understanding and diag⁃nosis of medical images.To address this issue,we propose a novel text-guided multi-modal medical image analysis frame⁃work(TMM-Net).TMM-Net extracts key diagnostic information features from images through a multi-stage guided model of diagnostic text,and then promotes the interaction of multi-modal features through cross-modal attention mechanisms.Nota⁃bly,TMM-Net simulates the clinical diagnostic process by predicting lesion attributes,enhancing interpretability.Validation experiments were conducted on a dataset consisting of 10025 modality data pairs from two centers.The results show that the proposed method achieves a 7.7%improvement in accuracy compared to the current state-of-the-art GISTs diagnostic meth⁃od,with the highest AUC(Area Under the Curve)value of 0.927,and its interpretability may better suit clinical needs.

作者樊琳龚勋郑岑洋 FAN Lin;GONG Xun;ZHENG Cen-yang(School of Computing and Artificial Intelligence,Southwest Jiaotong University,Chengdu,Sichuan 611756,China;Engineering Research Center of Sustainable Urban Intelligent Transportation,Ministry of Education,Chengdu,Sichuan 611756,China;National Engineering Laboratory of Integrated Transportation Big Data Application Technology,Chengdu,Sichuan 611756,China;Manufacturing Industry Chains Collaboration and Information Support Technology Key Laboratory of Sichuan Province,Chengdu,Sichuan 611756,China)

机构地区西南交通大学计算机与人工智能学院可持续城市交通智能化教育部工程研究中心综合交通大数据应用技术国家工程实验室四川省制造业产业链协同与信息化支撑技术重点实验室

出处《电子学报》 EI CAS CSCD 北大核心 2024年第7期2341-2355,共15页 Acta Electronica Sinica

基金国家自然科学基金(No.62376231) 四川省重点研发项目(No.2023YFG0267) 四川省卫生健康委员会科技项目(No.23LCYJ022)~~。

关键词多模态融合模型可解释性图像-文本匹配胃肠道间质瘤胃镜超声白光内镜 multi-modal fusion model interpretability image-text matching gastrointestinal stromal tumor gastro⁃scopic ultrasound white light endoscopy

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1杜盼盼,王敬如.基于GNN的多模态医学图像分割算法分析[J].集成电路应用,2024,41(7):341-343.
2王颖彬.活用论证方法,增强说理效果[J].演讲与口才,2023(22):8-9.
3邓成黎,赖庆奎.贯彻产业协同赋能城乡融合发展——学习党的二十届三中全会精神[J].当代农村财经,2024(9):38-42.
4叶再元,章笑.胃肠间质瘤的诊治进展和展望[J].中国医师杂志,2024,26(8):1121-1128.
5李维坤,邵欣欣,胡海涛,卢一鸣,王鹏,杜永星,徐泉,田艳涛.腹腔镜胃间质瘤手术切除策略分析[J].中华腔镜外科杂志（电子版）,2024,17(3):141-145.
6章笑,余杰达,吴芳,叶再元.胃肠间质瘤转移或复发后治疗的meta分析和系统评价[J].中国医师杂志,2024,26(8):1140-1145.
7陈振光,罗瑶,于金源,吴松阳,吴宁,叶再元.不行标记和黏膜下注射的内镜黏膜下肿瘤挖除术治疗胃小胃肠间质瘤的疗效和安全性[J].中国医师杂志,2024,26(8):1146-1150.
8林奇忆,陈丽玲,李龙钦,王怀帅,庄奕翔,李银林,蔡志聪,潘健鹏,陈剑鹏,郭滔,林高枫,许国玺.术前自体血定位法在腹腔镜下胃不利部位胃间质瘤切除手术中的应用效果[J].中国医师杂志,2024,26(8):1137-1139.
9杨萍,赵晓芳,唐华丽,李易,毛芸.比较4种疗效评估标准的首次评估在胃肠间质瘤伊马替尼治疗中的预后预测价值[J].重庆医科大学学报,2024,49(8):1045-1051.
10王伟丰,吴芳,蔡徐帆,章笑,叶再元.术中超声定位在非腔外生长的胃肠间质瘤腹腔镜手术中的临床应用[J].中国医师杂志,2024,26(8):1133-1136.

电子学报

2024年第7期

浏览历史

内容加载中请稍等...

基于文本引导下的多模态医学图像分析算法

相关作者

相关机构

相关主题

浏览历史