摘要
本文提出一种基于主题网络爬虫思想的Web数据挖掘算法,通过主题网络爬虫尽可能对Web数据进行分类整合处理,促进页面检索效率的提升,在此基础之上与贝叶斯网络算法相结合,基于关联规则对Web数据进行挖掘,并通过仿真实验的方式验证整套算法的可操作性。
Based on the idea of topical Web crawler,this paper puts forward the algorithm for Web data mining that Web data can be sorted and integrated through topical Web crawlers in order to promote the efficiency of page retrieval.On this basis,the algorithm combines with Bayesian network algorithm.The web data can be mined on grounds of association rules,and the whole algorithm will be verified by simulation experiments.
作者
景冰
JING Bing(Shanxi Vocational and Technical College of Finance and Trade,Taiyuan 030031,Shanxi Province,China)
出处
《景德镇学院学报》
2020年第3期66-68,共3页
Journal of JingDeZhen University
关键词
主题网络爬虫
数据挖掘
算法
topical web crawler
data mining
algorithm