摘要
在分析比较通用网络爬虫和主题网络爬虫的定义和处理流程基础上,结合主题网络爬虫的功能,提出了网络舆情监控系统中主题网络爬虫的设计模块。针对主题爬虫要实现的目标,分别研究了系统所要实现的关键算法。基于主题爬虫的舆情监控系统能满足面向特定领域的信息采集及监测需求,具有较强的实用价值。
On the basis of analysis and comparison of the definition and process of general crawler and focused crawler, combined with the function of the Focused crawler, the design module on Focused Crawler of the network public opinion mo- nitoring system was put forward. In view of the focused crawler goal, the key algorithm of system implementation was studied. The public opinion monitoring system based on focused crawler could meet the needs for specific areas of information collection and monitoring with a strong practical value.
出处
《舰船电子工程》
2014年第9期104-107,共4页
Ship Electronic Engineering
关键词
网络舆情监控系统
主题网络爬虫
信息采集
public opinion monitoring system, focused crawler, information collection