一种HTTPS应用的层次分类方法

A Hierarchical Classification Method for HTTPS Applications

下载PDF

导出

摘要 HTTPS协议用以网站服务器的身份认证,提供交换数据的保密性和完整性。然而一些不法分子利用HTTPS页面散布不良信息,这给通信流量的管理和安全分析带来了新的挑战。因此,准确识别基于SSL/TLS的HTTPS加密应用,对于提高网络服务质量、优化网络带宽分配、加强安全管控有着重要意义。现有的方法大多侧重于直接识别网站和应用程序,而很少关注类别的层次性结构。本文提出一种根据HTTPS应用类别的树状层次结构,自顶向下,逐层分类识别的方法,在顶层根据签名和样本流的关联关系将业务流识别为对应的大类,在次顶层提取检测流的特征值,使用随机森林模型分类为对应的最底层子类。实验结果表明,该方法能克服直接识别方法分类误差高的缺点,提高业务识别的精确率。 The HTTPS protocol uses the identity authentication of the web server to provide confidentiality and integrity of the exchanged data. However, some criminals use HTTPS pages to spread bad information, which brings new challenges to the management and security analysis of communication traffic. Therefore, accurately identifying SSL/TLS-based HTTPS encryption applications is of great significance for improving network service quality, optimizing network bandwidth allocation, and strengthening security management. Most of the existing methods focus on directly identifying websites and applications, and rarely pay attention to the hierarchical structure of categories. This paper proposes a tree-level hierarchical structure based on the HTTPS application category, which is a top-down, layer-by-layer classification and recognition method. At the top level, the business flow is identified as the corresponding large class according to the association relationship between the signature and the sample stream, and is extracted at the top level. The feature values of the stream are detected and classified into the corresponding lowest level subclasses using a random forest model. The experimental results show that the proposed method can overcome the shortcomings of the direct recognition method and improve the accuracy of business identification.

作者张磊赵辉 ZHANG Lei;ZHAO Hui(College of Cybersecurity,Sichuan University,Chengdu,610065,China)

机构地区四川大学网络空间安全学院

出处《网络新媒体技术》 2020年第3期14-20,共7页 Network New Media Technology

基金国家重点研发计划(2016YFB0800604,2016YFB0800605) 国家自然科学基金项目(61572334,U1736212) 四川省重点研发项目(2018GZ0183)

关键词流量识别 SSL/TLS协议 HTTPS 随机森林 Traffic identification SSL/TLS protocol HTTPS Random forest

分类号 TP393.08 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1王勇,周慧怡,俸皓,叶苗,柯文龙.基于深度卷积神经网络的网络流量分类方法[J].通信学报,2018,39(1):14-23. 被引量：66
2邱密,阳爱民,刘永定,何震凯.使用贝叶斯学习算法分类网络流量[J].计算机工程与应用,2010,46(25):78-81. 被引量：6
3张泽鑫,李俊,常向青.基于特征加权的朴素贝叶斯流量分类方法研究[J].高技术通讯,2016,26(2):119-128. 被引量：8

二级参考文献37

1Sen S, Wang J.Analyzing peer-to-peer traffic across large networks[J].IEEE/ACM Transactions on Networking, 2004, 12(2): 219-232.
2Gerber A, Houle J, Nguyen H, et al.P2P, the gorilla in the cable[R]. AT & T Labs-Research,2004.
3Yuan Huang, Tseng Shian-Shyong,Wu Gang-shan, et al.A two-phase feature selection method using both filter and wrapper[C]//Proc of 1999 IEEE Inter'l Conf on Systems,Man,and Cybernetics, 1999,2: 132-136.
4Mitchell T M.Machine leaming[M].[S.l.]: McGraw-Hill Education, 1997.
5Mitchell T M.Does machine learning really work[J].AI Magazine, 1997,18 (3) : 11-20.
6Williams N, Zander S, Armitage G.Evaluating machine leaming algorithms for automated network application identification, Technical Report 060410B[R].2006.
7Kohavi R,John G H.Wrappers for feature subset selection[J].Artificial Intelligence Journal, 1997,97(1/2) :273-324.
8Liu H, Setiono R.A probabilistic approach to feature selection: A filter solution[C]//Proc of Intel Conf on Machine Learning, 1996:319-327.
9Das S.Filters, wrappers and a boosting based hybrid for feature selection[C]//Proc of the 8th Intel Conf on Machine Learning, 2001 : 74-81.
10Yu Lei,Liu Huan.Feature selection for high-dimensional data: A fast correlation-based filter solution[C]//Proeeedings of the 20th International Conference on Machine Learning(ICML 2003),2003.

共引文献75

1潘嘉,翟江涛,刘伟伟.基于改进递归残差网络的恶意流量分类算法[J].计算机应用研究,2020,37(S02):227-229. 被引量：4
2周剑峰,阳爱民,刘吉财.基于改进的C4.5算法的网络流量分类方法[J].计算机工程与应用,2012,48(5):71-74. 被引量：18
3林锥,王立德,周洁琼,刘力源.基于自适应SPI总线的列车PIS系统研究[J].电子测量与仪器学报,2012,26(4):312-319. 被引量：5
4张建伟,王玲艳,姚云磊.一种基于OPTICS聚类的流量分类算法[J].郑州轻工业学院学报（自然科学版）,2013,28(2):83-86. 被引量：4
5马力,王致,张丹,洪永健,王天安.基于深度学习的人脸识别技术在电力巡检机器人中的应用研究[J].自动化与仪器仪表,2019(2):36-38. 被引量：3
6黄琳凯.一种基于OPTICS聚类的流量分类算法[J].中国新通信,2017,19(8):38-39.
7陈雷,肖创柏,禹晶,张亚红,王真理.自适应聚类Hough变换及地震断层检测[J].高技术通讯,2017,27(3):193-202. 被引量：2
8冯军军,贺晓春,王海沛.基于朴素贝叶斯网络的微博话题追踪技术研究[J].计算机与数字工程,2017,45(11):2244-2247. 被引量：5
9郭丽,刘磊.基于多层感知器的流量分类方法研究[J].电子测量与仪器学报,2019,0(7):56-64. 被引量：6
10吴迪,方滨兴,崔翔,刘奇旭.BotCatcher:基于深度学习的僵尸网络检测系统[J].通信学报,2018,39(8):18-28. 被引量：14

1王丹丹.网络级联抗毁攻击信息层次化分类仿真研究[J].计算机仿真,2020,37(2):329-333.
2薛艳锋,高志娥,高文莲.基于PC端网站的移动阅读解决方案[J].软件工程,2018,21(2):27-29.
3投稿须知[J].包装工程,2020,41(19).
4冉亚鑫,韩红旗,张运良,翁梦娟,高雄,彭柯芸.基于Stacking集成学习的大规模文本层次分类方法[J].情报理论与实践,2020,43(10):171-176. 被引量：15
5杨伟凤.小学英语核心素养教学探究[J].今天,2020(21):182-182.
6周雄飞.湘西旅游公路景观资源利用研究[J].公路工程,2020,45(3):223-228. 被引量：4
7夏炎.C++Builder中的地级市电子地图制作[J].电脑编程技巧与维护,2020(9):145-146.
8赵金龙.诗韵和鸣,情感共振——类文阅读理念下组诗教学的实践研究[J].学园,2020(13):97-98.
9彭光生,陈怡.甲状腺结节过度超声诊断及应对策略[J].中国超声医学杂志,2020,36(10):883-886. 被引量：14
10余其鹏,汪波.基于WebGIS的地震应急快速反应系统设计与实现[J].地震地磁观测与研究,2020,41(4):239-250. 被引量：2

网络新媒体技术

2020年第3期

浏览历史

内容加载中请稍等...

一种HTTPS应用的层次分类方法

参考文献3

二级参考文献37

共引文献75

相关作者

相关机构

相关主题

浏览历史