摘要
通过比较句子级知识抽取与词语级知识抽取的差异性,分析句子级知识抽取在情报学中的意义,表现在四类典型应用系统:学术抄袭检测系统、参考文献自动标注系统、文献自动综述系统、知识库构建系统。分析了知识抽取的难点与关键技术,针对难点与关键技术提出了知识抽取的3个转向:抽取对象转向以学术文献为主;抽取技术转向以内容结构分析为主;抽取目标转向以构建知识元数据库为主。
Based on the comparison of the differences between sentence-level knowledge extraction and word- level knowledge extraction, this paper analyzes the significance of the sentence-level knowledge extraction in information science. It's represented in the 4 typical application systems : academic plagiarism detection system, automatic reference-labeling system, automatic literature summarizing system and knowledge base construction system. After analyzing the difficulties and key technologies of knowledge extraction, this paper proposes 3 shifts of knowledge extraction in accordance with the difficulties and key technologies : extraction object shifts to giving priority to academic literature; extraction technology shifts to giving priority to content structure analysis; and extraction objective shifts to giving priority to constructing knowledge metadata base.
出处
《情报理论与实践》
CSSCI
北大核心
2011年第12期1-4,共4页
Information Studies:Theory & Application
基金
国家自然科学基金项目"基于句子匹配分析的知识抽取研究与实现"的研究成果之一
项目编号:70803048
关键词
知识抽取
情报学
应用研究
knowledge extraction
information science
application study