基于SBN模型的Internet应用协议识别方法

Internet application protocol identification method based on SBN model

导出

摘要针对网络流量协议标注比较困难的问题,提出一种基于贝叶斯网络的半监督学习模型,以提高Inter-net协议的识别精度.该模型首先使用少量的标注样本训练贝叶斯网络分类模型,并对未标注样本进行初始分类,然后从未标注样本中挑选分类损失最小的样本加入到训练集中并重复训练分类模型,经过多次循环训练出最终的分类器.该模型可以使用未标注样本和标注样本共同训练分类模型,非常适合于标注比较困难的Internet应用协议的识别.实验结果表明:在标注样本较少的情况下,该模型的识别精度和稳定性均优于朴素贝叶斯模型和贝叶斯网络模型,对于提高Internet协议的识别精度是有效的. As it is difficult to label the protocol of Internet traffic,a semi-supervised learning model based on Bayesian network was proposed to improve the accuracy of Internet protocol identification.First of all,a few labeled samples were used to train classification model of Bayesian network and the model was used to classify unlabeled samples,and then the unlabeled sample which has the lowest classification loss was selected to join the training set and retrain the classification model.After several cycles,the final classifier was trained to complete.It is an important advantage of the model that it can be trained by labeled samples and unlabeled samples,which is fit for the identification of Internet application protocol,because it is difficult to label the Internet traffic.The results of experiment show that the accuracy and stability of the model are better than Naive Bayes and Bayesian network,and it is an effective way to improve the accuracy of Internet protocol identification.

作者丁要军蔡皖东姚烨

机构地区西北工业大学计算机学院咸阳师范学院信息工程学院

出处《华中科技大学学报（自然科学版）》 EI CAS CSCD 北大核心 2012年第9期44-47,71,共5页 Journal of Huazhong University of Science and Technology(Natural Science Edition)

基金国家高技术研究发展计划资助项目(2009AA01Z424) 陕西省教育厅专项基金资助项目(12JK0933) 西北工业大学基础研究基金资助项目(JC201149)

关键词贝叶斯网络互联网半监督学习损失函数流量识别 Bayesian networks Internet semi-supervised learning loss function traffic identification

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献15

1徐鹏,刘琼,林森.基于支持向量机的Internet流量分类研究[J].计算机研究与发展,2009,46(3):407-414. 被引量：59
2Moore A W, Zuev D. lnternet traffic classification u sing Bayesian analysis techniques[C]//Proceedings of the 2005 ACM SIGMETRICS Conference on Meas- urement and Modeling of Computer Systems. New York: ACM, 2005: 50-60.
3LI Wei, Canini M, Moore A W. Efficient application identification and the temporal and spatial stability of classification schema[J]. Computer Networks, 2009, 53(6): 790-809.
4Company of Sourceforge. L7-filter[EB/OL]. E2011- 06-021. http://17-filter, sourceforge, net/.
5Zhu Xiaojin. Semi-supervised learning literature sur- vey, TR1530[R]. Madison: University of Wiscon- sin, 2008: 1-14.
6Rotsos C, Gael ] V, Moore A W. Probabilistic graphical models for semi-supervised traffic classifica- tion[C]//Proceedings of the 2010 ACM International Wireless Communications and Mobile Computing Conference. Caen: ACM, 2010: 752-757.
7Koller D, Friedman N. Probabilistic graphical mod- els: principles and teehniques[M]. Cambridge: The MIT Press, 2009: 45- 140.
8宫秀军,刘少辉,史忠植.一种增量贝叶斯分类模型[J].计算机学报,2002,25(6):645-650. 被引量：55
9Roy N, Mccallum A. Toward optimal active learning through sampling estimation of error reduction[C] ff Proceedings of the 2001 International Conference on Machine Learning (ICML). Western Massachusetts: ACM, 2001.- 441-448.
10刘琼,刘珍,黄敏.基于机器学习的IP流量分类研究[J].计算机科学,2010,37(12):35-40. 被引量：20

二级参考文献27

1刘琼,徐鹏,杨海涛,彭芸.Peer-to-Peer文件共享系统的测量研究[J].软件学报,2006,17(10):2131-2140. 被引量：36
2宫秀军史忠植.基于贝叶斯潜在语义模型的半监督Web挖掘[J].软件学报，已录用,.
3Madhukar A, Williamson C. A longitudinal study of P2P traffic classification [C]//Proc of the 14th IEEE Int Syrup on Modeling, Analysis, and Simulation. Washington, DC IEEE Computer Society, 2006:179-188
4Moore A W, Papagiannaki K. Toward the accurate identification of network applications [G]//Dovrolis C. LNCS 3431: Proc of the PAM 2005. Heidelberg: Springer, 2005:41-54
5Karagiannis T, Papagiannaki K, Faloutsos M. BLINC: Multilevel traffic classification in the dark [C]//Proc of ACM SIGCOMM. New York: ACM, 2005.. 229-240
6Roughan M, Sen S, Spatscheck O, et al. Class of service mapping for QoS: A statistical signature-hased approach to IP traffic classification [C]//Proc of ACM SIGCOMM Internet Measurement Conf 2004. New York: ACM, 2004: 135-148
7Zuev D. Moore A W. Traffic classification using a statistical approach [G]//Dovrolis C. LNCS 3431: Proc of the PAM. Heidelberg, Germany: Springer, 2005:321-324
8Moore A W, Zuev D. Internet traffic classification using Bayesian analysis techniques [C] //Proc of the 2005 ACM SIGMETRICS Int Conf on Measurement and Modeling of Computer Systems. New York: ACM, 2005: 50-60
9Tan P N, Steinbach M, Kumar V. Introduction to Data Mining [M]. Boston: Addison Wesley, 2006
10Moore A W, Zuev D, Crogan M. Discriminators for use in flow-based classification, RR-05-13 [R]. London: Queen Mary University of London, 2005

共引文献131

1熊刚,孟姣,曹自刚,王勇,郭莉,方滨兴.网络流量分类研究进展与展望[J].集成技术,2012,1(1):32-42. 被引量：23
2姜卯生,王浩,姚宏亮.朴素贝叶斯分类器增量学习序列算法研究[J].计算机工程与应用,2004,40(14):57-59. 被引量：10
3杨森,徐海涛,柴乔林.应用支持向量机实现增量入侵检测[J].计算机工程与应用,2004,40(27):142-143. 被引量：1
4张晓辉,李莹,常桂然,赵宏.适于Internet新闻文本实时分类的动态向量空间模型DVSM[J].计算机科学,2004,31(6):64-67. 被引量：1
5李家春,李之棠.分布式入侵告警关联分析[J].计算机研究与发展,2004,41(11):1919-1923. 被引量：6
6王利民,苑森淼.具有抗噪音能力的增量式混合贝叶斯网络[J].仪器仪表学报,2005,26(3):221-225. 被引量：1
7钱玲飞,刘玉树,李侃.朴素贝叶斯分类器在地形评估中的应用方法[J].计算机工程与应用,2005,41(12):189-191. 被引量：6
8王小英,赵海,林涛,张文波,尹震宇.基于信任的普适计算服务选择模型[J].通信学报,2005,26(5):1-8. 被引量：17
9胡学钢,郭亚光.一种基于粗糙集的朴素贝叶斯分类算法[J].合肥工业大学学报（自然科学版）,2006,29(2):169-172. 被引量：11
10周俐霞,何安元.基于流量均衡的垃圾邮件过滤网关集群系统模型[J].盐城工学院学报（自然科学版）,2006,19(1):23-27. 被引量：2

1杨正新,苏德,乐光明.YSBNC—Ⅱ型微机数控系统干扰问题及抗干扰措施[J].烟台大学学报（自然科学与工程版）,1993,6(4):55-61.
2夏命辉,王小平,林秦颖,狄方旭,王哲.复杂环境下基于动态贝叶斯网络的目标识别[J].空军工程大学学报（自然科学版）,2016,17(4):24-28. 被引量：9
3张兵利,裴亚辉.贝叶斯网络模型概述[J].电脑与信息技术,2008,16(5):41-42. 被引量：11
4吴巧敏,林亚平.一种基于重复训练的支持向量机方法[J].计算机工程与应用,2007,43(31):165-168. 被引量：2
5张良,蔡跃明.基于多示例学习的图像检索算法研究[J].军事通信技术,2009,30(2):12-15.
6翁胜龙,单外平,何国林.基于萤火虫BP神经网络的轴承故障诊断研究[J].电子设计工程,2014,22(24):4-7. 被引量：6
7唐焱,胡正国.SBN:一种新的Peer-to-Peer覆盖网络构造协议[J].航空学报,2003,24(5):447-451. 被引量：2
8黄玉倩.任务重复训练在对外汉语训练式课堂中的作用[J].科技风,2014(7):192-193.
9孙宁,邓承志,汪胜前.基于滤波后处理的主动学习高光谱遥感图像分类[J].南昌工程学院学报,2015,34(1):7-11. 被引量：4
10张兵利,裴亚辉.贝叶斯网络模型在反垃圾邮件中的应用[J].计算机系统应用,2009,18(3):137-140. 被引量：1

华中科技大学学报（自然科学版）

2012年第9期

浏览历史

内容加载中请稍等...

基于SBN模型的Internet应用协议识别方法

参考文献15

二级参考文献27

共引文献131

相关作者

相关机构

相关主题

浏览历史