期刊文献+

基于深度森林的网络流量分类方法 被引量:8

Network traffic classification method based on deep forest
下载PDF
导出
摘要 随着网络应用的迅猛发展,流量分类在网络资源分配、流量调度和网络安全等诸多研究领域受到广泛关注。现有的机器学习流量分类方法对流量数据特征的选取和分布要求苛刻,导致在实际应用中的复杂流量场景下分类精确度和稳定度难以提高。为了解决样本特征属性的复杂性给分类性能带来的不利影响,引入了基于深度森林的流量分类方法。该算法通过级联森林和多粒度扫描机制,能够在样本数量规模和特征属性选取规模有限的情况下,有效地提高流量整体分类性能。通过网络流量公开数据集Moore对支持向量机、随机森林和深度森林机器学习算法进行训练和测试,结果表明基于深度森林的网络流量分类器的分类准确率能够达到96.36%,性能优于其他机器学习模型。 With the rapid development of network applications,the Internet traffic classification has a profound impact on the research fields of network resource allocation,traffic scheduling and network security.The traditional flow analysis method based on machine learning has strict requirements for the feature selection and distribution of network flows,which makes it difficult to accurately and stably classify the complex and changeable flow data in practical application.In order to solve the adverse impact of the complexity of sample features on the traffic classification,a new classification method based on deep forest,which utilizes the cascade forest of decision trees and the multi-grained scanning mechanisms aiming to improve classification performance in the case of limited scale of samples and features,was proposed.The machine learning algorithms including support vector machine,random forest and deep forest were trained and tested by using Moore,which is a flow data set in public domain.The experiment results show that the classification accuracy using deep forest model reaches 96.36%,which outperforms the other machine learning models.
作者 戴瑾 王天宇 王少尉 DAI Jin;WANG Tianyu;WANG Shaowei(School of Information Science and Engineering, Jinling College, Nanjing University, Nanjing 210089, China;School of Electronic Science and Engineering, Nanjing University, Nanjing 210023, China;National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China)
出处 《国防科技大学学报》 EI CAS CSCD 北大核心 2020年第4期30-34,共5页 Journal of National University of Defense Technology
基金 国家自然科学基金资助项目(61801208,61671233,61931023,U1936202)。
关键词 特征选取 多粒度级联森林 机器学习 网络流量分类 feature selection multi-grained cascade forest machine learning network traffic classification
  • 相关文献

参考文献2

二级参考文献18

  • 1Dainotti A,Pescapè A,Claffy K C.Issues and future directions in traffic classification[J].Network,IEEE,2012,26(1):35-40.
  • 2Zhang H,Lu G,Qassrawi M T,et al.Feature selection for optimizing traffic classification[J].Computer Communications,2012,35(12):1457-1471.
  • 3Moore A,Zuev D,Crogan M.Discriminators for Use in Flow-Based Classification[R].London:Queen Mary and Westfield College,Department of Computer Science,2005.
  • 4Jain A,Zongker D.Feature selection:Evaluation,application,and small sample performance[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1997,19(2):153-158.
  • 5Hall M A.Correlation-based feature selection for machine learning[D].Waikato,New Zealand:The University of Waikato,1999.
  • 6E Rich,K Knight.Artificial Intelligence[M].New York,US: McGraw-Hill,1991.
  • 7Sun M,Chen J,Zhang Y,et al.A new method of feature selection for flow classification[J].Physics Procedia,2012,24:1729-1736.
  • 8Jamil H A,Zarei R,Fadlelssied N O,et al.Analysis of features selection for P2P traffic detection using support vector machine[A].Information and Communication Technology (ICoICT)[C].Bandung:IEEE,2013.116-121.
  • 9Alazab A,Hobbs M,Abawajy J,et al.Using feature selection for intrusion detection system[A].Communications and Information Technologies (ISCIT)[C].Gold Coast,QLD:IEEE,2012.296-301.
  • 10Moore A W,Zuev D.Internet traffic classification using bayesian analysis techniques[A].ACM SIGMETRICS Performance Evaluation Review[C].Banff,Alberta,Canada:ACM,2005.50-60.

共引文献68

同被引文献74

引证文献8

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部