摘要
内容分发网络(CDN)系统对内容热度的估计主要依靠管理员的经验,所以主观性比较大,无法保证服务质量(QoS)。首先对数据进行预处理,得到预测影片的初始知识库,利用数据挖掘技术对已有知识进行学习,对新加入的影片热度进行预测,将影片合理部署到CDN系统中。比较基于贝叶斯网络的影片热度预测和基于决策树模型的影片热度预测,在正确分类率和其他分类参数相同的前提下,贝叶斯网络所用的时间更短,所以选择贝叶斯网络分类器,解决管理员部署时不准确的问题,提高CDN系统的效率。
The estimation of the content popularity in the Content Distributed/Delivery Network(CDN) system mainly relies on the experience of administrators,which implies strong subjectivity and cannot guarantee the Quality of Service(QoS).In the paper,the authors firstly preprocessed the data,and obtained the initial knowledge base to predict the film popularity.This paper used data mining techniques to learn the existing knowledge and predict the popularity of films.Thus,the films in the CDN system could be deployed more effectively and efficiently.The movie popularity predicted by Bayesian network classier was compared with the movie popularity predicted by decision tree.On the premise of the same correct classification rate and other classification parameters,the time taken to build model in the Bayesian network classifier can be shorter.Therefore,the Bayesian network classifier was preferred.The method can solve the inaccurate deployment caused by the administrators' subjectivities and improve the efficiency of the CDN system.
出处
《计算机应用》
CSCD
北大核心
2011年第7期1737-1739,共3页
journal of Computer Applications
基金
科技部支撑计划项目(2008BAH28B04)
关键词
数据挖掘
贝叶斯网络
决策树
热度
广延指数模型
data mining
Bayesian network
decision tree
heat
streched exponential model