基于计算似然比的分布外网络流量数据检测方法被引量：1

Out-of-Distribution Network Traffic Data Detection Technology Based on Calculation of Likelihood Ratio

下载PDF

导出

摘要深度学习模型有时会将一些未知类别数据误分类为已知类别,这些未知类别数据定义为在某些领域的分布外数据,例如生物信息、医疗保健、自动驾驶和网络安全等,这样的误分行为将会导致严重的后果。对网络流量识别与分类技术以及分布外数据进行了简要介绍,提出了一种在测试样本中检测存在分布外数据的方法。根据分布外数据特点,通过训练并计算2个模型得到的结果的似然比判断分布外数据。在网络流量公开数据集Moore数据集和4个自采集数据集上进行了测试,该检测方法的识别精度可以达到92.3%。 Deep learning models sometimes misclassify some unknown categories of data into known categories.These unknown categories of data are defined as out-of-distribution data in some fields,such as biological information,medical care,automatic driving,network security and so on.These mistakes will lead to serious consequences.The identification and classification of network traffic and the out-of-distribution data are briefly introduced,and a method to detect the out-of-distribution data in test samples is proposed.According to the characteristics of out-of-distribution data,the out-of-distribution data can be judged by training two models and calculating the likelihood ratio of the results of the two models.The proposed method is tested on Moore data set and four self-collected data sets.The accuracy of the proposed method can reach 92.3%.

作者卓子寒吕欣润刘立坤车佳臻余翔湛叶麟张晓慧 ZHUO Zihan;LYU Xinrun;LIU Likun;CHE Jiazhen;YU Xiangzhan;YE Lin;ZHANG Xiaohui(National Computer Network Emergency Response Technical Team/Coordination Center of China,Beijing 100029,China;School of Cyberspace Science,Faculty of Computing,Harbin Institute of Technology,Harbin 150001,China)

机构地区国家计算机网络应急技术处理协调中心哈尔滨工业大学计算学部网络空间安全学院

出处《无线电工程》北大核心 2022年第8期1322-1329,共8页 Radio Engineering

基金国家自然科学基金面上项目(61872111)。

关键词深度学习分布外数据机器学习似然比 deep learning out-of-distribution data machine learning likelihood ratio

分类号 TP399 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1Chunxiao Li,Yu Guo,Xia Wang.Towards privacy-preserving dynamic deep packet inspection over outsourced[J].High-Confidence Computing,2022,2(1):1-8. 被引量：2
2付文亮,嵩天,周舟.RocketTC:一个基于FPGA的高性能网络流量分类架构[J].计算机学报,2014,37(2):414-422. 被引量：12
3鲁刚,余翔湛,张宏莉,郭荣华.基于集成聚类的流量分类架构[J].软件学报,2016,27(11):2870-2883. 被引量：11

二级参考文献19

1Moore A, Papagiannaki K. Toward the accurate identification of network applications//Proceedings of the International Passive and Active Measurement Workshop(PAM). Boston, USA, 2005:41 54.
2Sen S, Spatschcck O, Wang D. Accurate, scalable in network identification of P2P traffic using application signatures//Proceedings of the International World Wide Web Conference(WWW). New York, USA, 2004:512 521.
3Aceto G, Dainotti A, de Donalo W, Pescape A. PortLoad: Taking the best of two worlds in traffic classification// Proceedings of the IEEE International Conference on Com- puter Communications (INFOCOM) Workshops. San Diego, USA, 2010:1 5.
4Haffner P, Sen S, Spatscheck O, Wang D. ACAS: Automated construction of application signatures//Proceedings of the SIGCOMM MineNct Workshops. Philadelphia, USA, 2005: 197 202.
5Ma J, Levchenko K, Kreibich C, et al. Unexpected means of protocol inference//Proceedings of the ACM Internet Meas urement Conference (IMC). Rio de Janeiro, Brazil, 2006: 313 326.
6Ye M, Wu J, Xu K, Chiu D. Identify P2P traffic by inspecting data transfer behavior//Procecdings of the IPIF Networking. Aachen, Germany, 2009:1141-1150.
7Karagiannis T, Papagiannaki K, Faloutsos M. BLIN(': Multilevel traffic classification in the dark//Proceedings of the ACM SIGCOMM. Philadelphia, USA, 2005:229-240.
8lliofotou M, Faloutsos M, Mitzenmacher M. Exploiting dynamicity in graph-based traffic analysis: Techniques and applications//Proceedings of the ACM CoNEXT. Rome, Italy, 2009:241 252.
9Gallagher B, lliofotou M, Eliassi Rad T, Faloutsos M. Homophily in application layer and its usage in traffic classifi- cation//Proceedings of the IEEE International Conference on Computer Communications (1NFOCOM). San Diego, USA,2010: 1-5.
10Nguyen T T, Armitage G. A survey of techniques for Internet traffic classification using machine learning. IEEE Communications Surveys g> Tutorials, 2008, 10(4) : 56-76.

共引文献22

1袁春蕾,欧阳志友,王堃.基于nDPI的流量监控分析实验平台研究[J].实验技术与管理,2015,32(3):97-100. 被引量：2
2彭立志.互联网流量识别研究综述[J].济南大学学报（自然科学版）,2016,30(2):95-104. 被引量：13
3王鑫,陈曙晖,苏金树.一种基于硬件的大规模哈希流表设计与实现[J].计算机工程与科学,2016,38(10):1955-1960. 被引量：3
4Wenliang Fu,Xin Xin,Ping Guo,Zhou Zhou.A Practical Intrusion Detection System for Internet of Vehicles[J].China Communications,2016,13(10):263-275. 被引量：1
5付文亮,郭平,周舟.一种面向100Gbps网络的L7-filter硬件加速方法[J].电子学报,2016,44(11):2561-2568. 被引量：1
6姜腊林,胡念,熊兵.基于MTF启发法的OpenFlow虚拟流表高效查找算法[J].小型微型计算机系统,2017,38(7):1533-1537. 被引量：1
7李素霞.机器人探测气体泄漏源定位研究[J].实验技术与管理,2017,34(7):46-48.
8镇佳,朱国胜.网络流量分类方法研究[J].信息通信,2017,30(8):171-173. 被引量：5
9罗冬梅.网络协议流不平衡环境下基于机器学习算法的在线流量分类方法[J].科学技术与工程,2017,17(28):103-107. 被引量：4
10蒋海军,谢钧,郭小帆,邱宏琼,强振.SDN网络流量分类技术研究综述[J].信息技术与网络安全,2018,37(2):40-45. 被引量：2

同被引文献7

1展鹏,陈琳,曹鲁慧,李学庆.基于特征符号表示的网络异常流量检测算法[J].浙江大学学报（工学版）,2020,54(7):1281-1288. 被引量：14
2周伯阳,郭志民,王延松,阮伟,吴春明,周宁,张伟,程国振.基于多尺度低秩模型的电力无线接入网异常流量检测方法[J].电子学报,2020,48(8):1552-1557. 被引量：29
3刘奕,李建华,张一瑫,孟涛.基于特征属性信息熵的网络异常流量检测方法[J].信息网络安全,2021(2):78-86. 被引量：9
4杭梦鑫,陈伟,张仁杰.基于改进的一维卷积神经网络的异常流量检测[J].计算机应用,2021,41(2):433-440. 被引量：32
5麻文刚,张亚东,郭进.基于LSTM与改进残差网络优化的异常流量检测方法[J].通信学报,2021,42(5):23-40. 被引量：58
6董卫宇,李海涛,王瑞敏,任化娟,孙雪凯.基于堆叠卷积注意力的网络流量异常检测模型[J].计算机工程,2022,48(9):12-19. 被引量：11
7宣萍,房朝辉,丁宏.基于自注意力机制的网络流量异常检测方法[J].安徽大学学报（自然科学版）,2023,47(1):24-28. 被引量：9

引证文献1

1陈育才.基于多维度融合注意力的舰船网络异常流量检测[J].无线电工程,2024,54(8):2040-2047.

1Wenhua Li,Constance Senior.2020 Clean Energy Best Paper Prize[J].Clean Energy,2021,5(2):339-339.
2陈恩勇,鲜春兰,潘朝晖.夜蛾科(鳞翅目)3中国新记录种记述[J].东北林业大学学报,2022,50(7):109-111.
3Qi Qin,Miaocheng Zhang,Suhao Yao,Xingyu Chen,Aoze Han,Ziyang Chen,Chenxi Ma,Min Wang,Xintong Chen,Yu Wang,Qiangqiang Zhang,Xiaoyan Liu,Ertao Hu,Lei Wang,Yi Tong.Fabrication and investigation of ferroelectric memristors with various synaptic plasticities[J].Chinese Physics B,2022,31(7):637-642.

无线电工程

2022年第8期

浏览历史

内容加载中请稍等...

基于计算似然比的分布外网络流量数据检测方法被引量：1

参考文献3

二级参考文献19

共引文献22

同被引文献7

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于计算似然比的分布外网络流量数据检测方法 被引量：1

参考文献3

二级参考文献19

共引文献22

同被引文献7

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于计算似然比的分布外网络流量数据检测方法被引量：1