期刊文献+

基于t分布混合模型的半监督网络流分类方法 被引量:2

Semi-supervised network traffic classification based on t-distribution mixture model
下载PDF
导出
摘要 针对传统高斯分布容易受到数据样本边缘值和离群点噪声的影响,改用t分布替代原有的高斯混合模型,并使用期望最大化(Expectation Maximization,EM)算法对网络流数据样本进行t分布混合模型的建模。为降低EM算法的迭代次数,对t分布混合模型进行了改进,用理论和实验验证了算法的有效性,并对网络多媒体业务流进行了分类研究。实验表明,提出的算法有较高的分类准确率,拟合的模型要优于传统的K-Means算法和传统的高斯混合模型的EM算法。 Traditional Gaussian distribution is susceptible to the influence of edge points and outliers in data samples,therefore student’s t-distribution is adopted to improve Gaussian distribution.The EM(Expectation Maximization)algorithm is used to build T-distribution Mixture Model(TMM)for network multimedia traffic dataset.Then,A new Limited T-distribution Mixture Model(LTMM)is presented to reduce the number of iterations for EM algorithm,whose effectiveness is demonstrated by theoretical analysis and experiments.The flows of multimedia services are classified in this paper.Experiments show that the proposed algorithm can achieve higher accuracy than existing methods and the fitted model is better than K-Means algorithm and EM algorithm for Gaussian mixture model.
作者 董育宁 朱善胜 赵家杰 DONG Yuning;ZHU Shansheng;ZHAO Jiajie(College of Telecommunications and Information Engineering,Nanjing University of Posts and Telecommunications,Nanjing 210003,China)
出处 《计算机工程与应用》 CSCD 北大核心 2018年第10期31-38,共8页 Computer Engineering and Applications
基金 国家自然科学基金(No.61271233) 华为HIRP创新项目
关键词 网络流分类 t分布混合模型 期望最大化算法 半监督分类 internet traffic classification t-distribution mixture model Expectation Maximization(EM)algorithm semisupervised learning
  • 相关文献

参考文献2

二级参考文献22

  • 1Moore A W, Zuev D. Internet traffic classification using bayesian analysis techniques [J]. ACM SIOMETRICS Performance Evaluation Review, 2005, 33(1):50-60.
  • 2Erman J, Mahanti A, Arlitt M. Traffic classification using clustering algorithms[C] // The 2006 SIGCOMM Workshop on Mining Network Data. USA:ACM Press, 2006:281-286.
  • 3McGregor A, Hall M, Lorier P, et al. Flow clustering using machine learning techniques [C]//The Fifth International Workshop in PAM 2004. France:Springer, 2004:205-214.
  • 4Erman J, Mahanti A, Arlitt M. Internet traffic identification using machine learning[C] //The 49th IEEE Global Telecommunications Conference ( GLOBECOM 2006 ). USA: IEEE Computer Society, 2006 : 1-6.
  • 5张景顺,李名楚.基于统计学的网络业务流建模及分类研究[EB/OL].[2009-4-20].http://www.paper.edu.cn/down-loadpaper.php?seriaLnumber=200811-288.
  • 6Antoniou I, Ivanov V V, Ivanov Valery V, etal, On the lognormal distribution of network traffic[J].Physica D: Nonlinear Phenomena, 2002,167(1-2) :72-85.
  • 7van de Meent R, Mandjes M R H, Pras A. Gaussian traffic everywhere? [C]//IEEE International Conference on Communications. Turkey: IEEE Computer Society, 2006: 573- 578.
  • 8Schwardt Ludwig. Gaussian mixture models [ EB/OL]. [2009-4-20]. http://staff.ee. sun. ac, za/-dupreez/pr813/ lectures/lecture06/pr813_lecture06, pdf.
  • 9Lan K, Heidemann J. A measurement study of correlations of Internet flow characteristics [J]. Computer Networks, 2006,50(1) :46-62.
  • 10Mitzenmacher M. A history of and new directions for power law research[EB/OL]. [2009-4-20]. http://www, eecs. harvard. edu/-michaelm/TALKS/Buffalo, ppt.

共引文献6

同被引文献39

引证文献2

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部