期刊文献+

一种基于统计排序的网络流量特征选择方法 被引量:10

A feature selection method of network traffic based on statistic and ranking strategy
下载PDF
导出
摘要 利用网络流量的统计特征进行流量的分类识别需要从众多的特征中选取最优特征集合,以避免冗余和不相关特征造成的系统模型复杂度过高、分类准确率和效率下降等问题。针对该问题,提出一种基于统计排序的网络流量特征选择方法。首先利用基于统计方法定义的特征选择系数生成初始特征子集,再将基于分类准确率构建的特征影响系数作为特征评估排序的依据,对初始特征子集进行二次特征选择,生成最优特征子集。实验结果表明,该方法在保证分类整体准确率的同时有效减少了流量统计特征的个数,在分类效果、效率以及稳定性之间实现了较好的平衡。 It is required to select the best features from so many ones in order to avoid the high complexity of the model, the low classification accuracy and efficiency caused by redundant and irrelevant features, if network traffic classification is obtained by us-ing the statistical characteristics. To solve the problem, a network traffic feature selection method based on statistic and ranking is proposed, according to generate the initial feature subset by using feature selection coefficient defined by statistic and then generate the optimal feature subset through the second feature selection of the initial feature subset by using feature influence coefficient defined by classification accuracy as the reference of extraction and ranking. Experimental results show that the proposed algorithm can reduce the number of features effectively while ensuring the overall classification accuracy and a good balance is achieved be-tween classification effectiveness, efficiency and stability.
出处 《电子技术应用》 2018年第1期84-87,共4页 Application of Electronic Technique
基金 国家计算机网络与信息安全技术研究专项(242研究计划)(2016QN027)
关键词 网络流量分类 特征选择 统计排序 特征影响 network traffic classification feature selection statistic and ranking feature influence
  • 相关文献

参考文献6

二级参考文献59

  • 1谢宏,程浩忠,牛东晓.基于信息熵的粗糙集连续属性离散化算法[J].计算机学报,2005,28(9):1570-1574. 被引量:134
  • 2杨建华,谢高岗,张广兴,李忠诚.一种高效的业务流分类算法[J].电子学报,2006,34(3):549-552. 被引量:2
  • 3Cover TM1. The best two independent measurements are not the two best [ J]. IEEE Transactions on Systems,Man, and Cybernet- ics, 1974,1:116-1171.
  • 4CacheLogic[ EB/OL]. http ://www. cachelogic, corn, 2008-01.
  • 5Hyunchul Kim, KC Claffy, Marina Fomenkov, et al. Internot traf- fic classification demystified: myths, caveats, and the best prac- tices [C]. Proceedings of the 2008 ACM CoNEXT Conference, December 09-12, 2008:1-12.
  • 6Moore A W, Zuev Z. lnternet traffic classification using Bayesian analysis techniques [C]. Proceedings of ACM SIGMETRICS, Banff, Canada, June,2005:50-60.
  • 7Wimams N, Zander S, Annitage G. A preHraJnary performance comparison of five machine learning algorithms for practical ip traf- fic flow classification[J]. ACM SIGCOMM CCR,October,2006, 36(5) :7-15.
  • 8Yang Yue-xiang, Wang Rui-wang, Liu Yang, et al. Solving P2P traffic identification problems via optimized support vector ma- chines [ C]. Proceeding of IEEF_/ACS International Conference on Computer Systems and Applications ( aieesa ), Amman, Jordan, 2007 : 165-171.
  • 9Zander S, Nguyen T, Armitagc G. Self-learning IP traffic classifi- cation based on statistical flow characteristics [ C ]. PAM 2005, Boston, USA, March 31-April 1, 2005.
  • 10Li Zhu, Yuan Rui-xi, Guan Xiao-hong. Accurate classification of the Intermet traffic based on the SVM method CC]. Proceeding of the 42th ]FEE International Cortference on Communication (ICC), Glasgow, Scfland, 2007 : 1373-1378.

共引文献31

同被引文献85

引证文献10

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部