摘要
为了实现对Web信息的查询、重构和再利用,人们采用了Web信息抽取技术.本文主要讨论基于DOM的Web信息抽取,研究如何构造抽取规则,才能提高信息抽取的准确度、提高抽取规则的适应能力,并给出了抽取规则的生成过程.
Web information extraction techniques were applied to Web information query, reconstruction and reuse. In this paper, we mainly discussed DOM-based Web information extraction, studied how to construct extraction rules to improve precision ratio of extraction and adaptation of extraction rules, and the rules' generation procedure is also presented.
出处
《河北大学学报(自然科学版)》
CAS
北大核心
2007年第2期209-212,共4页
Journal of Hebei University(Natural Science Edition)