Automated labelling of radiology reports using natural language processing:Comparison of traditional and newer methods 被引量：1

下载PDF

导出

摘要 Automated labelling of radiology reports using natural language processing allows for the labelling of ground truth for large datasets of radiological studies that are required for training of computer vision models.This paper explains the necessary data preprocessing steps,reviews the main methods for automated labelling and compares their performance.There are four main methods of automated labelling,namely:(1)rules-based text-matching algorithms,(2)conventional machine learning models,(3)neural network models and(4)Bidirectional Encoder Representations from Transformers(BERT)models.Rules-based labellers perform a brute force search against manually curated keywords and are able to achieve high F1 scores.However,they require proper handling of negative words.Machine learning models require preprocessing that involves tokenization and vectorization of text into numerical vectors.Multilabel classification approaches are required in labelling radiology reports and conventional models can achieve good performance if they have large enough training sets.Deep learning models make use of connected neural networks,often a long short-term memory network,and are similarly able to achieve good performance if trained on a large data set.BERT is a transformer-based model that utilizes attention.Pretrained BERT models only require fine-tuning with small data sets.In particular,domain-specific BERT models can achieve superior performance compared with the other methods for automated labelling.

作者 Seo Yi Chng Paul J.W.Tern Matthew R.X.Kan Lionel T.E.Cheng

机构地区 Department of Paediatrics Department of Cardiology NUS High School of Mathematics and Science Department of Diagnostic Radiology

出处《Health Care Science》 2023年第2期120-128,共9页 科学医疗（英文）

关键词 automated labelling machine learning natural language processing neural network RADIOLOGY

分类号 O15 [理学—基础数学]

引文网络
相关文献

同被引文献4

1胡君花,黄倩,胡安宁.人工智能在医学影像数字X线摄影质量控制方面的技术优化[J].影像技术,2020,32(6):12-14. 被引量：3
2余淑洁.“互联网+人工智能”的全新质控模式在放射诊断质控工作中的应用[J].中医药管理杂志,2021,29(7):211-212. 被引量：2
3乔远罡,韩晓东,韩顺霞,廉洁,刘敬禹,李秀丽,杨艳.深度学习技术在胸部X射线片质量评价中的价值研究[J].中国医学装备,2021,18(8):25-28. 被引量：2
4孟宇,马之骋,阮敬儒,高阳,杨柏林,何林阳,龚向阳.基于Faster R卷积神经网络构建胸部X线片异物智能检测模型的可行性研究[J].中华放射学杂志,2022,56(12):1359-1364. 被引量：2

引证文献1

1张禹萱,谷宗运,鲁文豪,王倩,宋亮亮,李传富.基于深度学习的X线腰椎正侧位片的智能质量控制研究[J].中国中西医结合影像学杂志,2024,22(4):406-412.

1Qing Lyu,Josh Tan,Michael E.Zapadka,Janardhana Ponnatapura,Chuang Niu,Kyle J.Myers,Ge Wang,Christopher T.Whitlow.Translating radiology reports into plain language using ChatGPT and GPT-4 with prompt learning:results,limitations,and potential[J].Visual Computing for Industry,Biomedicine,and Art,2023,6(1):109-118.
2Qian Wang,Qinwei Zhou,Di Zhang,Jiajia Shi,Siqian Cheng.Semantic Foundation and Evolution Mechanism of the“好不X”(haobu X)Structure[J].Journal of Contemporary Educational Research,2024,8(10):1-15.
3张成姝,林捷,曹辉,姜丽.基于OBE的《操作系统》课程多途径混合式教学研究[J].创新教育研究,2024,12(11):366-374.
4Francky Fouedjio.Random forest for spatial prediction of censored response variables[J].Artificial Intelligence in Geosciences,2021,2(1):115-127.
5Ruibin Lin,Xing Lv,Huanling Hu,Liwen Ling,Zehui Yu,Dabin Zhang.Dual-stage ensemble approach using online knowledge distillation for forecasting carbon emissions in the electric power industry[J].Data Science and Management,2023,6(4):227-238.
6刘琪,肖克晶,曹少中,张寒,姜丹.融合DeBERTa模型与图卷积网络的文本分类方法研究[J].人工智能与机器人研究,2024,13(4):715-725.
7王俊利,朱大磊,李安明,滕达.“四学一体”混合式教学模式下“通信原理”课程思政的探索与实[J].职业教育发展,2024,13(6):2485-2491.
8李芳.跨境电商中的知识产权法律风险及应对[J].电子商务评论,2024,13(4):4654-4659.
9Binsen Xu,Zhou Feng,Jun Zhou,Rongbo Shao,Hongliang Wu,Peng Liu,Han Tian,Weizhong Li,Lizhi Xiao.Transfer learning for well logging formation evaluation using similarity weights[J].Artificial Intelligence in Geosciences,2024,5(1):294-309.
10颜志远,解壁伟,包云岗.HVMS:基于混合向量化的SpMV优化机制[J].计算机研究与发展,2024,61(12):2969-2984.

Health Care Science

2023年第2期

浏览历史

内容加载中请稍等...

Automated labelling of radiology reports using natural language processing:Comparison of traditional and newer methods 被引量：1

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史