面向中医药大模型的知识增强方法研究

Knowledge Augmentation on Traditional Chinese Medicine Language Model

下载PDF

导出

摘要近年来,大语言模型(LLM)在各个领域取得了许多重大成果。由于缺乏专业知识,以及中医和现代医学的思想不同,大模型在中医药领域的应用仍是一项挑战。现有的知识增强方法难以保持中医方剂具有的自身结构性。为了解决以上问题,提出了一种新的知识增强方法。该方法由模型训练、图谱构建和知识增强三部分组成。在模型训练阶段,通过对基础大模型在中医药数据集上进行预训练和微调两阶段训练,得到中医药领域大模型。在图谱构建阶段,基于中医十万首经典方剂和古籍中的方剂,利用清洗后的数据集构建中医药图谱。在知识增强阶段,基于对知识图谱上信息的计算,利用检索图谱中的专业知识和图谱结构计算检索结果,中医药方剂中的结构特性得以保留。在中医药方剂配伍任务上,针对于任务特性提出了一组评价标准,包括主观指标和客观指标,用于评估模型在该任务上的表现。实验表明,该方法相对于基准测试模型,在主观指标和客观指标上均获得了较大提升,BLEU-1最高提升0.09,ROUGE-1最高提升0.21。消融实验表明,该方法对于模型在该任务上具有较大作用,未使用知识增强的模型BLEU-1相比于使用知识增强下降约37%。 Recently,large language models(LLM)have made significant achievements in various fields.However,due to lack of specialized knowledge and the gap between modern medicine and traditional Chinese medicine(TCM),it is still a challenge to deploy LLM in TCM.Existing methods fail to maintain the structure of TCM pre-scription.To address the problems,a pattern of knowledge augmentation is proposed.The method includes model training,knowledge graph construction and knowledge augmentation.In the training phase,TCM language model is trained on TCM corpus,by a two-stage method combining pre-training and fine-tuning.In the knowledge graph con-struction phase,prescription knowledge graph is constructed from nearly 100000 preprocessed classical TCM pre-scriptions and those from ancient books.In the knowledge augmentation phase,enhanced by the above pattern,out-puts are generated from computation of knowledge graph,according to the schema of knowledge graph from search-ing result,which preserves the structure of prescriptions.A set of evaluations specific to prescription optimizations is proposed,including objective and subjective indicators,to evaluate the performance of the model for the task.Ex-periment shows that the model improves greatly on both subjective and objective evaluations compared with base-lines.BLEU-1 is increased by up to 0.09,while ROUGE-1 is increased by up to 0.21.Ablation study shows that,it is of vital importance for the model performance to be knowledge-augmented.BLEU-1 of augmentation-free model is decreased by about 37%compared with that of the augmented model.

作者吉祥宇王鑫张鹤译孟昭鹏张俊华庄朋伟贾勇哲徐大为 JI Xiangyu;WANG Xin;ZHANG Heyi;MENG Zhaopeng;ZHANG Junhua;ZHUANG Pengwei;JIA Yongzhe;XU Dawei(College of Intelligence and Computing,Tianjin University,Tianjin 300350,China;Tianjin University of Traditional Chinese Medicine,Tianjin 300193,China;National Clinical Research Center for Chinese Medicine Acupuncture and Moxibustion,First Teaching Hospital of Tianjin University of Traditional Chinese Medicine,Tianjin 300193,China;Tiandazhitu(Tianjin)Technology Co.,Ltd.,Tianjin 300192,China)

机构地区天津大学智能与计算学部天津中医药大学天津中医药大学第一附属医院国家中医针灸临床医学中心天大智图(天津)科技有限公司

出处《计算机科学与探索》 CSCD 北大核心 2024年第10期2616-2629,共14页 Journal of Frontiers of Computer Science and Technology

基金国家自然科学基金面上项目(61972275)。

关键词大语言模型(LLM) 中医药方剂优化检索增强生成 large language model(LLM) traditional Chinese medicine prescription optimization retrieval aug-mented generation

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1张涛,朱明华,傅志强,陈景文,肖子君.筛查全/多氟烷基化合物(PFASs)生物活性的卷积神经网络模型[J].生态毒理学报,2023,18(3):11-21. 被引量：1
2王晗玥,许建中.风电场站单机聚合模型倍乘元件阻抗参数设计[J].电力系统保护与控制,2023,51(21):146-157. 被引量：1
3刘秀芝.老年重症肺炎给予中西医结合治疗的临床研究[J].中国科技期刊数据库医药,2024(10):0046-0049.
4李永生,郝贤伟,向澍,时艺丹,厉小润.分波段Transformer特征提取在近红外光谱数据分类中的应用[J].激光与光电子学进展,2024,61(13):444-451.
5周文晖,彭清桦,谢磊.面向多目标状态感知的自适应云边协同调度研究[J].计算机科学,2024,51(9):319-330.
6吴志斌,李敏.乡村文旅融合研究进展、热点与趋势的可视化分析(2018—2023)[J].文化产业研究,2024(1):212-233.
7李承放,刘爱民,谢开钰,李得天,何成旦,王琎,王永军,石忠宁.KF-AlF_(3)熔盐低温可视化电解NEU-1月壤仿真样[J].过程工程学报,2024,24(8):972-981.
8陈晨.夯实基础,精准掌握——高中生物作业设计的优化路径[J].智慧少年,2024(11):0062-0064.
9李兰英,蒋维成,周玲,黄静,彭欢.无人舰应急处理路径规划[J].信息技术,2024,48(9):125-128.
10苏海燕,刘俊宏,刘晓燕,吴红莉,胡彦军,王军玲.慢性萎缩性胃炎病证结合动物模型研究进展[J].中国民间疗法,2024,32(18):93-96.

计算机科学与探索

2024年第10期

浏览历史

内容加载中请稍等...

面向中医药大模型的知识增强方法研究

相关作者

相关机构

相关主题

浏览历史