摘要
详细介绍信息抽取开源软件Web-Harvest,并在其基础之上进行功能扩展和改进,设计一个通用性强的Web信息抽取系统,重点阐述开发系统的设计思想和系统流程,并简单介绍系统的数据库表设计。最后,介绍该Web信息抽取系统的应用。
In this paper,an open source software for information extraction called Web -Harvest is detailly introdueed firstly. With functional expansion and improvement, a Web information extraction system based on Web - Harvest is designed The paper focuses on the system design idea and system process, and the design of database tables is also briefly described. Finally, the application of the system is introduced .
出处
《现代图书情报技术》
CSSCI
北大核心
2010年第3期76-81,共6页
New Technology of Library and Information Service