摘要
信息抽取是从自由文本语料库构建数据库 ,实现情报自动收集的有效途径之一。近十多年来 ,信息抽取技术逐步走向成熟 ,已成为与信息检索相平行的技术之一。对信息抽取技术进行系统的归类、总结 ,已显得较为迫切。在对当前多种主要的信息抽取技术进行分析、比较的基础上 ,结合信息抽取所面临的挑战 。
Information extraction is a main approach for constructing database from free text corpus and for automatic collecting intelligence information.In the recent decades,information extraction is maturing and became a parallel technique to information retrieval.So classifying and summarizing the main techniques used in information extraction is an urgent task.Based on the anlysis and comparison of the main techniques used in information extraction,we present the challengs to information extraction,and analyze three trends in information extraction.
出处
《情报科学》
CSSCI
北大核心
2004年第7期815-821,829,共8页
Information Science
关键词
信息抽取
自由文本
知识获取
Information extraction Free text Knowledge acquisition