【目的/意义】为了明晰科学实体在学术文本中承担的语义角色,进而建立特定领域的术语的知识结构,本文提出一种以科学实体词间关系为特征工程的术语分类方法,从学术研究的语义属性角度,将学术文本中出现的科学实体分为“研究领域”“研...【目的/意义】为了明晰科学实体在学术文本中承担的语义角色,进而建立特定领域的术语的知识结构,本文提出一种以科学实体词间关系为特征工程的术语分类方法,从学术研究的语义属性角度,将学术文本中出现的科学实体分为“研究领域”“研究问题”“研究方法”“研究工具”“其他”五类。【方法/过程】采用依存句法分析的方法,对于学术文本中存在两个及两个以上科学实体的句子,挖掘它们之间的最短依存路径,将最短依存路径上的谓词成分作为实体之间的关系进行提取,构造2D矩阵作为卷积神经网络的输入,完成实体的分类研究。【结果/结论】该模型在Web of Science上获取的“人工智能”领域的学术文献进行验证,精确率为89.38%,召回率为92.46%,F1值为0.9089。【创新/局限】由科学实体关系构成的矩阵是稀疏矩阵,在计算过程中会对计算速度产生不利影响;在关系抽取的环节比较依赖依存句法分析分析工具的处理效果。展开更多
In academic papers, there are many knowledge elements about the method. How to extract these methods to form a structured knowledge base is an important research of knowledge organization. Through content analysis of ...In academic papers, there are many knowledge elements about the method. How to extract these methods to form a structured knowledge base is an important research of knowledge organization. Through content analysis of a large number of documents, method knowledge elements are classified as the following five types: knowledge element of method definition, knowledge element of method classification, knowledge element of method features, knowledge element of method process, and knowledge element of method functions. Sentences of the method are extracted from academic papers, then the sentence description structure is formed by filtering keywords in the sentences, and the description rules of method are formed after revising and classification in order to extract knowledge elements of the method.展开更多
文摘【目的/意义】为了明晰科学实体在学术文本中承担的语义角色,进而建立特定领域的术语的知识结构,本文提出一种以科学实体词间关系为特征工程的术语分类方法,从学术研究的语义属性角度,将学术文本中出现的科学实体分为“研究领域”“研究问题”“研究方法”“研究工具”“其他”五类。【方法/过程】采用依存句法分析的方法,对于学术文本中存在两个及两个以上科学实体的句子,挖掘它们之间的最短依存路径,将最短依存路径上的谓词成分作为实体之间的关系进行提取,构造2D矩阵作为卷积神经网络的输入,完成实体的分类研究。【结果/结论】该模型在Web of Science上获取的“人工智能”领域的学术文献进行验证,精确率为89.38%,召回率为92.46%,F1值为0.9089。【创新/局限】由科学实体关系构成的矩阵是稀疏矩阵,在计算过程中会对计算速度产生不利影响;在关系抽取的环节比较依赖依存句法分析分析工具的处理效果。
基金an outcome of the project “Mining and Discovery of Knowledge Cell from Academic Papers” (No. 2014M560857) supported by China Postdoctoral Science Foundation
文摘In academic papers, there are many knowledge elements about the method. How to extract these methods to form a structured knowledge base is an important research of knowledge organization. Through content analysis of a large number of documents, method knowledge elements are classified as the following five types: knowledge element of method definition, knowledge element of method classification, knowledge element of method features, knowledge element of method process, and knowledge element of method functions. Sentences of the method are extracted from academic papers, then the sentence description structure is formed by filtering keywords in the sentences, and the description rules of method are formed after revising and classification in order to extract knowledge elements of the method.