中医药大语言模型的关键技术与构建策略

Key technologies and construction strategies of large language models for traditional Chinese medicine

导出

摘要大语言模型(large language model,LLM)通过处理和理解自然语言数据,实现高质量的信息检索、知识提取等功能,为中医药研究提供了新机遇。基于中医药大模型发展现状,梳理了LLM开发过程中的数据存储与处理方法,概述了检索增强生成、混合专家模型、人类反馈强化学习、知识蒸馏等人工智能方法,归纳了LLM训练微调与性能评价方法。针对中医药数据的特点,从高质量数据集构建、多领域专家系统融合、信息快速提取、训练与调优等方面入手,提出了中医药LLM的构建策略,并分析了LLM在中医药领域的具体应用场景,为中医药领域LLM的构建和应用提供参考,推动中医药现代化和智能化发展。 By processing and understanding natural language data,large language models(LLM)enable the high-quality information retrieval,knowledge extraction,etc.,and provide new opportunities for traditional Chinese medicine(TCM)research.Based on recent developments of LLM in TCM,the present work summarizes the data storage and processing algorithms,as well as artificial intelligence methods,such as retrieval-augmented generation,mixture of experts,reinforcement learning from human feedback,and knowledge distillation for developing LLM.It also summarizes methods for training fine-tuning and performance evaluation of LLM.In response to the characteristics of TCM data,strategies for developing LLM for TCM are proposed,which focuses on developing high-quality datasets,integrating mixture of experts,rapid information extraction,and model training and optimization.Additionally,it outlines specific application scenarios of LLM in TCM.The aim of this work is to provide insights for the development and application of LLM in TCM,promoting the modernization and intelligent development of TCM.

作者萧文科宋驰陈士林陈伟 XIAO Wenke;SONG Chi;CHEN Shilin;CHEN Wei(Innovative Institute of Chinese Medicine and Pharmacy/Academy for Interdiscipline,Chengdu University of Traditional Chinese Medicine,Chengdu 611137,China;Institute of Herbgenomics,Chengdu University of Traditional Chinese Medicine,Chengdu 611137,China)

机构地区成都中医药大学中医药创新研究院/交叉学科研究院成都中医药大学本草基因组学研究院

出处《中草药》 CAS CSCD 北大核心 2024年第17期5747-5756,共10页 Chinese Traditional and Herbal Drugs

基金成都中医药大学引进人才项目(030041225)。

关键词中医药大语言模型混合专家系统检索增强生成人类反馈强化学习知识蒸馏 traditional Chinese medicine large language models mixture of experts retrieval-augmented generation reinforcement learning from human feedback knowledge distillation

分类号 R28 [医药卫生—中药学] TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1周细斌,李秀梅,杨繁,唐志强,谢经荣,周鸣惊.基于地空雷达数据的人工池杉林单木信息提取[J].湖北林业科技,2024,53(4):34-38.
2胡铁骊,周博翔,于浩,朱静,陈纯玉.中医药数据资源目录体系构建与研究[J].医学信息学杂志,2024,45(3):46-50.
3雷银香,熊科云.中医药领域不平衡数据的特征选择和分类方法研究[J].信息与电脑,2023,35(24):55-57. 被引量：1
4苟思媛,杨先照,王龙珠,田羽佳,茹淑瑛.基于文献挖掘探析肝癌前病变的中医临床用药规律[J].药学前沿,2024,28(9):80-89.
5岳龙.VBA技术在测量标志普查数据入库中的应用[J].测绘与空间地理信息,2024,47(3):180-181.
6闫锦崴,郑蔚恒,于鹏.基于谷歌地球引擎平台的海上养殖信息提取方法研究——以福建省平潭县为例[J].应用海洋学学报,2024,43(2):360-370.
7闫菁,唐淑芬.无人机遥感在城市园林绿化调查的应用研究[J].江西通信科技,2024(1):37-41.
8唐明.类风湿性关节炎实验模型的研究进展[J].湖北职业技术学院学报,2024,27(4):109-112.
9高爽.“互联网+”背景下企业会计电算化的实施问题与对策[J].中国管理信息化,2024,27(18):83-85.
10王燕妮,周志雄.人工智能在中小学生身体活动监测中的应用[J].体育教学,2024,44(10):81-82.

中草药

2024年第17期

浏览历史

内容加载中请稍等...

中医药大语言模型的关键技术与构建策略

相关作者

相关机构

相关主题

浏览历史