期刊文献+

面向中医古籍的隐式关系抽取方法研究

Research on Traditional Chinese Medical Text Implicit Relation Extraction Method
下载PDF
导出
摘要 自然语言种类丰富、形式灵活多变的特征使得隐式关系抽取成为目前关系抽取领域中富有难度和挑战性的任务之一。通过引入构式语法理论和依存句法分析两种认知语言学范畴的理论技术,构建了一种面向中医古籍中隐式关系的抽取方法。首先利用构式语法理论制定文本构式化策略、分析并定义出8种构式特征与5种构式类型,并使用CART(classification and regression tree,CART)分类模型完成文本分类;其次对其中4类构式使用依存句法分析技术构建句法树,通过分析句法树中的特定结构,制定医学类实体间的关系三元组抽取规则,实现隐式关系抽取;最后在经典中医古籍《黄帝内经》数据集上进行测试,实验结果表明了方法的有效性。 The variety of natural languages and their flexible forms are very rich,which makes implicit relation extraction one of the difficult and challenging tasks in the field of relation extraction.Two theoretical techniques were introduced in the field of cognitive linguistics,namely constructive grammar theory and dependent syntactic analysis,to construct a method for extracting implicit relations in traditional Chinese medical texts.Firstly,the constructive grammar theory was used to formulate a text structuring strategy,analyze and define eight constructive features and five constructive types,and the CART classification model was used to classify the text.Secondly,the dependent syntactic analysis technique was used to construct a syntactic tree for four constructs,and by analyzing the specific structure of the syntactic tree,the extraction rules of the relational triad between medical entities were formulated to realize the implicit relation extraction.Finally,the tests were conducted on the dataset of the classic traditional Chinese medical text,i.e.,Huangdi Neijing,and the experimental results showed the effectiveness of the method.
作者 马月坤 冯烨琛 MA Yuekun;FENG Yechen(College of Artificial Intelligence,North China University of Science and Technology,Tangshan 063210,China;Hebei Provincial Key Laboratory of Industrial Intelligent Perception,Tangshan 063210,China;School of Computer&Communication Engineering,University of Science&Technology Beijing,Beijing 100083,China;Beijing Key Laboratory of Knowledge Engineering for Materials Science,Beijing 100083,China)
出处 《郑州大学学报(理学版)》 CAS 北大核心 2024年第2期34-42,共9页 Journal of Zhengzhou University:Natural Science Edition
基金 河北省三三三人才项目(A201803082)。
关键词 关系抽取 中医古籍 隐式关系 构式语法理论 依存句法分析 relation extraction traditional Chinese medical implicit relation constructive grammar theory dependent syntactic analysis
  • 相关文献

参考文献7

二级参考文献57

共引文献59

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部