期刊文献+

基于SBN模型的Internet应用协议识别方法

Internet application protocol identification method based on SBN model
原文传递
导出
摘要 针对网络流量协议标注比较困难的问题,提出一种基于贝叶斯网络的半监督学习模型,以提高Inter-net协议的识别精度.该模型首先使用少量的标注样本训练贝叶斯网络分类模型,并对未标注样本进行初始分类,然后从未标注样本中挑选分类损失最小的样本加入到训练集中并重复训练分类模型,经过多次循环训练出最终的分类器.该模型可以使用未标注样本和标注样本共同训练分类模型,非常适合于标注比较困难的Internet应用协议的识别.实验结果表明:在标注样本较少的情况下,该模型的识别精度和稳定性均优于朴素贝叶斯模型和贝叶斯网络模型,对于提高Internet协议的识别精度是有效的. As it is difficult to label the protocol of Internet traffic,a semi-supervised learning model based on Bayesian network was proposed to improve the accuracy of Internet protocol identification.First of all,a few labeled samples were used to train classification model of Bayesian network and the model was used to classify unlabeled samples,and then the unlabeled sample which has the lowest classification loss was selected to join the training set and retrain the classification model.After several cycles,the final classifier was trained to complete.It is an important advantage of the model that it can be trained by labeled samples and unlabeled samples,which is fit for the identification of Internet application protocol,because it is difficult to label the Internet traffic.The results of experiment show that the accuracy and stability of the model are better than Naive Bayes and Bayesian network,and it is an effective way to improve the accuracy of Internet protocol identification.
出处 《华中科技大学学报(自然科学版)》 EI CAS CSCD 北大核心 2012年第9期44-47,71,共5页 Journal of Huazhong University of Science and Technology(Natural Science Edition)
基金 国家高技术研究发展计划资助项目(2009AA01Z424) 陕西省教育厅专项基金资助项目(12JK0933) 西北工业大学基础研究基金资助项目(JC201149)
关键词 贝叶斯网络 互联网 半监督学习 损失函数 流量识别 Bayesian networks Internet semi-supervised learning loss function traffic identification
  • 相关文献

参考文献15

  • 1徐鹏,刘琼,林森.基于支持向量机的Internet流量分类研究[J].计算机研究与发展,2009,46(3):407-414. 被引量:59
  • 2Moore A W, Zuev D. lnternet traffic classification u sing Bayesian analysis techniques[C]//Proceedings of the 2005 ACM SIGMETRICS Conference on Meas- urement and Modeling of Computer Systems. New York: ACM, 2005: 50-60.
  • 3LI Wei, Canini M, Moore A W. Efficient application identification and the temporal and spatial stability of classification schema[J]. Computer Networks, 2009, 53(6): 790-809.
  • 4Company of Sourceforge. L7-filter[EB/OL]. E2011- 06-021. http://17-filter, sourceforge, net/.
  • 5Zhu Xiaojin. Semi-supervised learning literature sur- vey, TR1530[R]. Madison: University of Wiscon- sin, 2008: 1-14.
  • 6Rotsos C, Gael ] V, Moore A W. Probabilistic graphical models for semi-supervised traffic classifica- tion[C]//Proceedings of the 2010 ACM International Wireless Communications and Mobile Computing Conference. Caen: ACM, 2010: 752-757.
  • 7Koller D, Friedman N. Probabilistic graphical mod- els: principles and teehniques[M]. Cambridge: The MIT Press, 2009: 45- 140.
  • 8宫秀军,刘少辉,史忠植.一种增量贝叶斯分类模型[J].计算机学报,2002,25(6):645-650. 被引量:55
  • 9Roy N, Mccallum A. Toward optimal active learning through sampling estimation of error reduction[C] ff Proceedings of the 2001 International Conference on Machine Learning (ICML). Western Massachusetts: ACM, 2001.- 441-448.
  • 10刘琼,刘珍,黄敏.基于机器学习的IP流量分类研究[J].计算机科学,2010,37(12):35-40. 被引量:20

二级参考文献27

  • 1刘琼,徐鹏,杨海涛,彭芸.Peer-to-Peer文件共享系统的测量研究[J].软件学报,2006,17(10):2131-2140. 被引量:36
  • 2宫秀军 史忠植.基于贝叶斯潜在语义模型的半监督Web挖掘[J].软件学报,已录用,.
  • 3Madhukar A, Williamson C. A longitudinal study of P2P traffic classification [C]//Proc of the 14th IEEE Int Syrup on Modeling, Analysis, and Simulation. Washington, DC IEEE Computer Society, 2006:179-188
  • 4Moore A W, Papagiannaki K. Toward the accurate identification of network applications [G]//Dovrolis C. LNCS 3431: Proc of the PAM 2005. Heidelberg: Springer, 2005:41-54
  • 5Karagiannis T, Papagiannaki K, Faloutsos M. BLINC: Multilevel traffic classification in the dark [C]//Proc of ACM SIGCOMM. New York: ACM, 2005.. 229-240
  • 6Roughan M, Sen S, Spatscheck O, et al. Class of service mapping for QoS: A statistical signature-hased approach to IP traffic classification [C]//Proc of ACM SIGCOMM Internet Measurement Conf 2004. New York: ACM, 2004: 135-148
  • 7Zuev D. Moore A W. Traffic classification using a statistical approach [G]//Dovrolis C. LNCS 3431: Proc of the PAM. Heidelberg, Germany: Springer, 2005:321-324
  • 8Moore A W, Zuev D. Internet traffic classification using Bayesian analysis techniques [C] //Proc of the 2005 ACM SIGMETRICS Int Conf on Measurement and Modeling of Computer Systems. New York: ACM, 2005: 50-60
  • 9Tan P N, Steinbach M, Kumar V. Introduction to Data Mining [M]. Boston: Addison Wesley, 2006
  • 10Moore A W, Zuev D, Crogan M. Discriminators for use in flow-based classification, RR-05-13 [R]. London: Queen Mary University of London, 2005

共引文献131

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部