期刊文献+

基于Spark的大规模网络流量分类研究 被引量:1

Study on large scale network traffic classification on Spark platform
下载PDF
导出
摘要 机器学习算法处理流量分类问题已经成为网络安全领域一个研究热点。为了提高大规模网络流的分类效率,引入并行SVM算法来识别网络流量,提出了一种基于Spark平台的大规模网络流在线分类方案。该方案利用置信域牛顿法(TRON)并行优化线性SVM算法构建流量分类模型,然后融合最新的实时计算框架,实现对大规模网络流的在线识别。实验结果表明,利用并行SVM算法在损失较小精度的前提下可以加快网络流的模型训练和分类速度,符合大规模网络流在线分类的需要。 Internet traffic classification using machine learning has become a hot research topic in the field of network security. In order to improve the classification efficiency of large scale network flow, this paper introduces a parallel SVM algorithm to identify the network traffic, and proposes a real-time classification scheme for large scale network flow based on Spark. This method builds a classification model using parallel SVM algorithm, and then it is integrated with the latest flow processing framework for real-time classification of large-scale networks. Experimental results show that parallel SVM algorithm can greatly improve the training and classification speed of the network flow model, on the premise of little loss of precision.
出处 《计算机时代》 2016年第4期1-5,共5页 Computer Era
基金 国家自然科学基金项目(61473149)
关键词 流量分类 网络安全 SPARK 并行SVM 大规模数据 traffic classification network security Spark parallel SVM large scale data
  • 相关文献

参考文献13

  • 1Madhukar A, Williamson C. A Longitudinal Study of P2P Traffic Classification[C]//Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, 2006. MASCOTS 2006. 14th IEEE Intemational Symposium on. IEEE,2006:179-188.
  • 2Kumar, Sailesh, Dharmapurikar, Sarang, Yu, Fang, et al. Algorithms to accelerate multiple regular expressions matching for deep packet inspection[M].ACM,2006.
  • 3Erman J, Arlitt M, Mahanti A. Traffic Classification Using Clustering Algorithms[C]//In ACM SIGCOMM MineNet Workshop,2006:281-286.
  • 4Moore A W, Zuev D, Internet traffic classification using bayesian analysis techniques[J]. Acm Sigmetrics Performance Evaluation Review, 2005.33(1):50-60.
  • 5徐鹏,林森.基于C4.5决策树的流量分类方法[J].软件学报,2009,20(10):2692-2704. 被引量:169
  • 6徐鹏,刘琼,林森.基于支持向量机的Internet流量分类研究[J].计算机研究与发展,2009,46(3):407-414. 被引量:59
  • 7Yang L, Hu G, Li D, et al. Anomaly detection based on efficient Euclidean projection[J]. Security & Communication Networks,2015.
  • 8Groleat T, Arzel M, Vaton S. Hardware Acceleration of SVM-Based Traffic Classification on FPGA[C]// Wireless Communications and Mobile Computing Conference (IWCMC), 2012 8th International. IEEE, 2012:443-449.
  • 9M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica,"Spark: cluster computing with working sets," in Proceedings of the 2nd USENIX conference on Hot topics in cloud computing USENIX Association, 2010:10-10.
  • 10M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley,M. J. Franklin, S. Shenker, and I. Stoica, "Resilient distributed datasets:A fault-tolerant abstraction for in-memory cluster computing," in Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation,2012.

二级参考文献29

  • 1Madhukar A, Williamson C. A longitudinal study of P2P traffic classification [C]//Proc of the 14th IEEE Int Syrup on Modeling, Analysis, and Simulation. Washington, DC IEEE Computer Society, 2006:179-188
  • 2Moore A W, Papagiannaki K. Toward the accurate identification of network applications [G]//Dovrolis C. LNCS 3431: Proc of the PAM 2005. Heidelberg: Springer, 2005:41-54
  • 3Karagiannis T, Papagiannaki K, Faloutsos M. BLINC: Multilevel traffic classification in the dark [C]//Proc of ACM SIGCOMM. New York: ACM, 2005.. 229-240
  • 4Roughan M, Sen S, Spatscheck O, et al. Class of service mapping for QoS: A statistical signature-hased approach to IP traffic classification [C]//Proc of ACM SIGCOMM Internet Measurement Conf 2004. New York: ACM, 2004: 135-148
  • 5Zuev D. Moore A W. Traffic classification using a statistical approach [G]//Dovrolis C. LNCS 3431: Proc of the PAM. Heidelberg, Germany: Springer, 2005:321-324
  • 6Moore A W, Zuev D. Internet traffic classification using Bayesian analysis techniques [C] //Proc of the 2005 ACM SIGMETRICS Int Conf on Measurement and Modeling of Computer Systems. New York: ACM, 2005: 50-60
  • 7Tan P N, Steinbach M, Kumar V. Introduction to Data Mining [M]. Boston: Addison Wesley, 2006
  • 8Moore A W, Zuev D, Crogan M. Discriminators for use in flow-based classification, RR-05-13 [R]. London: Queen Mary University of London, 2005
  • 9Witten I H, Frank E. Data Mining: Practical Machine Learning Tools and Techniques [M]. 2nd ed. Amsterdam: Elsevier Inc. , 2005
  • 10Chang C C, Lin C J. LIBSVM: A library for support vector machines[EB/OL]. 2001 [2007-08-06]. http://www.csie. ntu. edu. tw/-ejlin/libsvm

共引文献209

同被引文献3

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部