期刊文献+

基于选择性集成策略的嵌入式网络流特征选择 被引量:10

An Embedded Feature Selection Using Selective Ensemble for Network Traffic
下载PDF
导出
摘要 机器学习在网络流量分类中存在特征选择度量指标单一、类别不平衡和概念漂移等问题,使得模型复杂度提高、泛化能力下降.该文提出基于选择性集成策略的嵌入式特征选择方法,根据选择性集成策略选取部分特征选择器集成,再改进序列前向搜索和封装器组合方法二次搜索最优特征子集.实验结果表明该算法在保证分类效果的同时有效降低了特征子集复杂度,从而达到了分类效果、效率和稳定性的最优平衡. The problems of feature selection metrics single,class imbalance and concept drift exist in machine learning for network traffic classification,leading the model complexity increased,the generalization ability decreased.Therefore,an embedded feature selection method based on selective ensemble is proposed,according to the selective ensemble strategy to ensemble part of feature selectors,and then through the combination method of improved sequence forward search and wrapper secondary search optimal feature subset.Experimental results show that the proposed algorithm can reduce the complexity of feature subset effectively while ensuring the classification performance,so as to achieve the optimal balance of the classification performance,efficiency and stability.
出处 《计算机学报》 EI CSCD 北大核心 2014年第10期2128-2138,共11页 Chinese Journal of Computers
基金 江苏省未来网络前瞻性研究项目(BY2013095-5-03) 江苏省科技支撑计划(工业)项目(BE2011173) 江苏省"六大人才高峰"高层次人才项目(2011-DZ024)资助
关键词 选择性集成 特征选择 嵌入式 稳定性 selective ensemble feature selection embedded stability
  • 相关文献

参考文献23

  • 1Grimaudo L,Mellia M,Baralis E,et al.Self-learning classifier for Internet traffic//Proceedings of the IEEE INFOCOM 2013.Turin,Italy,2013:3381-3386.
  • 2Ding L,Yu F,Peng S,et al.A classification algorithm for network traffic based on improved Support Vector Machine.Journal of Computers,2013,8(4):1090-1096.
  • 3Zhou Z H,Wu J,Tang W.Ensembling neural networks:Many could be better than all.Artificial intelligence,2002,137(1):239-263.
  • 4Saeys Y,Inza I,Larraiaga P.A review of feature selection techniques in bioinformatics.Bioinformatics,2007,23(19):2507-2517.
  • 5Jin Y,Duffield N,Erman J,et al.A modular machine learning system for flow level traffic classification in large networks.ACM Transactions on Knowledge Discovery from Data,2012,6(1):4.
  • 6Xie G,Iliofotou M,Keralapura R,et al.Subflow:Towards practical flowlevel traffic classification//Proceedings of the IEEEINFOCOM2012.Orlando,USA,2012:2541-2545.
  • 7Lee S,Kim H,Barman D,et al.Netramark:A network traffic classification benchmark.ACM SIGCOMM Computer Communication Review,2011,41(1):22-30.
  • 8Zhang H,Lu G,Qassrawi M T,et al.Feature selection for optimizing traffic classification.Computer Communications,2012,35(12):1457-1471.
  • 9Chen X,Wasikowski M.Fast:A roc-based feature selection metric for small samples and imbalanced data classification problems//Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Las Vegas,USA,2008:124-132.
  • 10Nguyen T T T,Armitage G.A survey of techniques for internet traffic classification using machine learning.IEEE Communications Surveys & Tutorials,2008,10(4):56-76.

二级参考文献35

  • 1Moore AW, Zuev D. Internet traffic classification using Bayesian analysis techniques. In: Proc. of the 2005 ACM SIGMETRICS Int'l Conf. on Measurement and Modeling of Computer Systems, Banff, 2005. 50-60. http://www.cl.cam.ac.uk/-awm22 /publications/moore2005internet.pdf.
  • 2Madhukar A, Williamson C. A longitudinal study of P2P traffic classification. In: Proc. of the 14th IEEE Int'l Syrup. on Modeling, Analysis, and Simulation. Monterey, 2006. http://ieeexplore.ieee.org/xpl/ffeeabs_all.jsp?arnumber=1698549.
  • 3Moore AW, Papagiannaki K. Toward the accurate identification of network applications. In: Dovrolis C, ed. Proc. of the PAM 2005. LNCS 3431, Heidelberg: Springer-Verlag, 2005.41-54.
  • 4Karagiannis T, Papagiannaki K, Faloutsos M. BLINC: Multilevel traffic classification in the dark. In: Proc. of the ACM SIGCOMM. Philadelphia, 2005. 229-240. http://conferences.sigcomm.org/sigcomm/2005/paper-KarPap.pdf.
  • 5Roughan M, Sen S, Spatscheck O, Dutfield N. Class-of-Service mapping for QoS: A statistical signature-based approach to IP traffic classification. In: Proc. of the ACM SIGCOMM Internet Measurement Conf. Taormina, 2004. 135-148. http://www.imconf.net/imc-2004/papers/p 135-roughan.pdf.
  • 6Zuev D, Moore AW. Traffic classification using a statistical approach. In: Dovrolis C, ed. Proc. of the PAM 2005. LNCS 3431, Heidelberg: Springer-Verlag, 2005. 321-324.
  • 7Nguyen T, Armitage G. Training on multiple sub-flows to optimise the use of Machine Learning classifiers in real-world IP networks. In: Proc. of the 31 st IEEE LCN 2006. Tampa, 2006. http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=4116573.
  • 8Eerman J, Mahanti A, Arlitt M. Internct traffic identification using machine learning techniques. In: Proc. of the 49th IEEE GLOBECOM. San Francisco, 2006. http://pages.cpsc.ucalgary.ca/-mahanti/papers/globecom06.pdf.
  • 9Erman J, Arlitt M, Mahanti A. Traffic classification using clustering algorithms. In: Proc. of the ACM SIGCOMM Workshop on Mining Network Data (MineNet). Pisa, 2006. http://conferences.sigcomm.org/sigcomm/2006/papers/minenet-01.pdf.
  • 10Bernaille L, Teixeira R, Salamatian K. Early application identification. In: Proc. of the Conf. on Future Networking Technologies 2006 (CoNEXT 2006). Lisboa, 2006. http://portal.acm.org/citation.efm?id=1368445.

共引文献321

同被引文献127

引证文献10

二级引证文献103

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部