期刊文献+

基于随机森林算法的网络流量分类方法 被引量:9

Research on Classification of Network Traffic Based on Random Forests Algorithm
下载PDF
导出
摘要 精确的网络流量分类是实现互联网可控可管的关键,传统的单一分类算法需要构建基于特定假设的某种模型,算法对于待分类数据的分布要求高,不能满足复杂多变的网络流量的分类要求。基于此,采用多决策树组合的随机森林算法实现网络流量分类。通过实际网络流量数据实验表明,在各种情况下,随机森林算法都能显著改善网络流量特别是小比例样本的分类效果,算法降低了单一算法过于依赖特定假设模型的要求,对于待分类样本的分布要求低,随机森林算法具有良好的分类效果和鲁棒性。 The accurate classification of network traffic is the key of implementing the function of the controllable and manageability of Internet, which is important for network management and network security. The common single classification algorithms usually need to abide by specific hypothesis for modeling and have strict restriction on the distribution of datasets to be processed. Thus, these algorithms can't satisfy the need of classification of network traffic with the property of muhifractal and burst at all time. On the basis, the Random Forests (RF) algorithm which combines with multiple decision trees is used to classify traffic. The experiments on network traffic show that RF is capable of improving the classification effect of traffic especially for small flows whose ratio is small in the datasets and loosening the restriction of single classifier which abides by specific hypothesis obviously. Meanwhile, RF has a weaker restriction on the distribution of network traffic and it performs perfectly on the classification of network traffic and has stronger adaptability and robustness than any single classifier.
出处 《中国电子科学研究院学报》 2013年第2期184-190,共7页 Journal of China Academy of Electronics and Information Technology
基金 陕西省自然科学基础研究计划重点项目(2012JZ8005) 全军军事学研究生课题项目(2010XXXX-488 2011XXX-X23)
关键词 流量分类 流量特征选择 组合分类器 随机森林算法 traffic classification traffic feature selection ensemble classifier Random Forests algorithm
  • 相关文献

参考文献11

  • 1ETHEMALPAYDIN.机器学习导论[M].范明,昝红英,牛常勇,译.北京:机械工业出版社,2009:230-231.
  • 2BREIMAN L. Random Forests [ J]. Machine Learning, 2001, 45(1) : 5-32.
  • 3NGUYEN THUY T T, GRENVILLE ARMITAGE. A Sur- vey of Techniques for Internet Traffic Classification Using Machine Learning[J]. IEEE Communications Surveys & Tutorials, 2008, 10(4) : 56-76.
  • 4HYUNCHUL KIM, KIMBERLY C CLAFFY, MARINA FO- MENKOV, et al. Internet Traffic Classification Demysti- fied : Myths, Caveats, and the Best Practices [ C ]//2008 ACM CoNEXT Conference, New York : ACM ,2008 : 1-12.
  • 5WEI LI, MARCO CANINI, ANDREW W MOORE, et al. Efficient Application Identification and the Temporal and Spatial Stability of Classification Schema [ J ]. Computer Networks, 2009, 53 (6) : 790-809.
  • 6ARTHUR CALLADO,JUDITH KELENER, DJAMEL SA- DOK,et al. Better Network Traffic Identification Through the Independent Combination of Techniques [ J ]. Journal of Network and Computer Applications, 2010,33 ( 4 ) :433- 446.
  • 7ALBERTO DAINOTYI, ANTONIO PESCAPE, KIMBER- LY C CLAFFY. Issues and Future Directions in Traffic Classification[ J]. IEEE Network, 2012, 26(1) : 35-40.
  • 8MOORE A W,ZUEV D. Internet Traffic Classification Using Bayesian Analysis Techniques [ C ]//in Proc. ACM Sigmet- rics, 2005:50-60.
  • 9PIETRZYK M, URVOY-KEI.I.ER G, COSTEUX J-L. Re- vealing the Unknown ADSL Traffic Using Statistical Meth- ods [ J ]. Lecture Notes in Computer Science, 2009,5 537 ( 1 ) : 75-83.
  • 10GRINGOLI F, SALGARELLI L, DUSI M, et al. GT: Pick- ing up the Truth from the Ground for Interuet Traffic [ J ]. ACM SIGCOMM Computer Communication Review, 2009, 39(5) : 13-18.

同被引文献68

引证文献9

二级引证文献54

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部