融合多粒度语义信息和知识图谱的中文医疗问答匹配模型

Chinese Medical Q&A Matching Model Based on Multi-Granularity Semantic Information and Knowledge Graph

下载PDF

导出

摘要中文医疗领域问答容易受到医疗特定词汇的噪声影响,相对于开放领域问答其更具有挑战性。以往的中文医疗问答研究主要依赖于字符级别的细粒度信息,忽略了携带更多语义信息的单词级别的粗粒度信息。此外,引入外部医学知识图谱可以进一步丰富问答句子中的细粒度信息,然而目前大多数研究通常只采用句子和外部知识共同表示的简单方式。由此提出一种融合多粒度语义信息和知识图谱的中文医疗问答匹配模型(CMQA-MGSI)。该模型引入Lattice网络,结合Word2Vec和BERT设计了两种特征向量提取模型来选择问答句子中最相关的字符序列和单词序列以获得更丰富的多粒度语义信息;为了更好地融合外部领域知识,设计双通道注意力模块提取问答句子和知识图谱中实体嵌入以及关系嵌入之间多个角度的知识表征信息。该模型在数据集cMedQA1.0和cMedQA2.0上的实验表明,效果优于现有的问答匹配模型。 Chinese medical Q&A is easily affected by the noise of medical-specific terminology,making it more challenging than open-domain Q&A.Previous studies on Chinese medical Q&A mainly relied on character-level fine-grained information,neglecting word-level coarse-grained information that carries more semantic information.In addition,introducing external medical knowledge graph can further enrich the fine-grained information in Q&A sentences,but most existing studies usually adopt a simple way of joint representation of sentences and external knowledge.Therefore,this paper proposes a Chinese medical Q&A matching model based on multi-granularity semantic information and knowledge graph(CMQA-MGSI).The model employs a Lattice network to select the most relevant character-level and word-level sequences from the Q&A sentences,and leverages Word2Vec and BERT to enhance the semantic information;to better exploit the external domain knowledge,a dual-channel attention mechanism is devised to capture the multi-angle knowledge representations between the Q&A sentences and the entity embeddings and relation embeddings in the knowledge graph.Experiments on the cMedQA1.0 and cMedQA2.0 datasets demonstrate that the proposed model outperforms existing Chinese medical Q&A matching models.

作者管立本李实 GUAN Liben;LI Shi(College of Computer and Control Engineering,Northeast Forestry University,Harbin 150040,China)

机构地区东北林业大学计算机与控制工程学院

出处《计算机工程与应用》 CSCD 北大核心 2024年第14期152-161,共10页 Computer Engineering and Applications

关键词中文医疗问答多粒度信息知识图谱 Lattice网络注意力机制 Chinese medical Q&A multi-granularity information knowledge graph Lattice network attention mechanism

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1刘芳,于斐.面向医疗行业的智能问答系统研究与实现[J].微电子学与计算机,2012,29(11):95-98. 被引量：9

二级参考文献5

1吴晓珊,王茜.加强统计信息工作促进医院信息化管理[J].中国医院统计,2004,11(1):66-67. 被引量：36
2季永华,许华虎,沈敏,万杰.自动答疑系统的研究与实现[J].计算机工程与应用,2005,41(14):224-225. 被引量：15
3Li SJ. Semantic computation in a Chinese question an-swer system[Jl. Computer. Science and Technology,2002, 17(16): 933-939.
4Burke R D, Hammond K J, Kulyukin. Question an-swering from frequently asked question files: experi-ences with the FAQ finder system P[J]. Al Magazine,1997(18):57-66.
5钟义信.中国人工智能进展[M].北京:北京邮电大学出版社,2001:1129-1132.

共引文献8

1丁东.基于E-Learning的医疗行业知识创新服务组织[J].科技创业月刊,2013,26(8):21-23. 被引量：2
2徐启菊.基于CFN的家庭医生问答系统设计[J].商,2013,0(20):211-212.
3孙程琳,夏宇航,刘旭利,高炬,刘珉,殷亦超,阮彤.基于自然语言问题的电子病历分析工具—QReport[J].山西大学学报（自然科学版）,2018,41(1):23-33. 被引量：8
4颜昕.基于自然语言处理的医疗健康问答系统[J].通讯世界,2018,0(6):255-256. 被引量：5
5王浩畅,李斌.聊天机器人系统研究进展[J].计算机应用与软件,2018,35(12):1-6. 被引量：25
6黄梦禧,张青川,陈龙,王世锦.面向医学领域的智能问答APP设计与实现[J].软件导刊,2019,18(3):94-99. 被引量：2
7刘羿,冯子恩,万晓娴.基于知识图谱的急诊问答系统[J].电脑与电信,2020(4):51-55. 被引量：6
8赵沛时,葛亮,张晓阳,.基于交通知识的移动智能问答系统[J].电子测试,2016,27(12):25-28.

1王霄,万玉晴.一种面向法律文书的命名实体识别方法[J].计算机应用与软件,2023,40(8):180-186. 被引量：1
2张智雄.在开放科学和AI时代塑造新型学术交流模式[J].中国科技期刊研究,2024,35(5):561-567.
3陈灏,钟锦宸.高水平金融开放拓展中国式现代化发展空间的机制与路径研究[J].新疆大学学报（哲学社会科学版）,2024,52(4):82-92.
4徐徕,陈陶然.最优金融开放度探析:金融开放水平与经济增长速度之间的“倒U形”关系研究[J].世界经济研究,2024(6):30-44.

计算机工程与应用

2024年第14期

浏览历史

内容加载中请稍等...

融合多粒度语义信息和知识图谱的中文医疗问答匹配模型

参考文献1

二级参考文献5

共引文献8

相关作者

相关机构

相关主题

浏览历史