摘要
信息抽取是从自由文本语料库构建数据库,实现信息自动收集的有效途径之一。提出了一种以框架语义标注为基础构建信息抽取规则的信息抽取方法。基于框架语义标注的信息抽取是用统一的方法来指导信息抽取过程。这种方法具有较细的处理粒度,对语义规则性强的领域有一定的普遍适用性。设计了基于框架语义的BAIE(图书内容简介信息抽取)系统,并对图书的内容简介试行信息抽取。抽取结果表明,基于框架语义的信息抽取方式有一定的可行性和适用性。
Information extraction is a main approach for constructing database from free text corpus and for automatic collecting information.Frame semantic tagging is suggested to be the base for rule-building in information extraction.Information extraction based on frame semantic tagging uses a uniform approach to guide the process of information extraction.Processing at a finer granularity level,the method has a universal appeal for information extraction in domains showing strong semantic rules.A system called BAIE (Book Abstract Information Extraction system),which is based on frame semantic,is designed and used to extract information from book abstract.The result shows that the approach is feasible and has practical promise.
出处
《计算机工程与应用》
CSCD
北大核心
2008年第25期143-145,151,共4页
Computer Engineering and Applications
基金
科技部专题项目(No.2006FY11070903)
关键词
信息抽取
框架语义
抽取规则
information extraction
frame semantic
extraction rules