期刊文献+

Support Vector Machine and Random Forest Modeling for Intrusion Detection System (IDS) 被引量:17

Support Vector Machine and Random Forest Modeling for Intrusion Detection System (IDS)
下载PDF
导出
摘要 The success of any Intrusion Detection System (IDS) is a complicated problem due to its nonlinearity and the quantitative or qualitative network traffic data stream with many features. To get rid of this problem, several types of intrusion detection methods have been proposed and shown different levels of accuracy. This is why the choice of the effective and robust method for IDS is very important topic in information security. In this work, we have built two models for the classification purpose. One is based on Support Vector Machines (SVM) and the other is Random Forests (RF). Experimental results show that either classifier is effective. SVM is slightly more accurate, but more expensive in terms of time. RF produces similar accuracy in a much faster manner if given modeling parameters. These classifiers can contribute to an IDS system as one source of analysis and increase its accuracy. In this paper, KDD’99 Dataset is used and find out which one is the best intrusion detector for this dataset. Statistical analysis on KDD’99 dataset found important issues which highly affect the performance of evaluated systems and results in a very poor evaluation of anomaly detection approaches. The most important deficiency in the KDD’99 dataset is the huge number of redundant records. To solve these issues, we have developed a new dataset, KDD99Train+ and KDD99Test+, which does not include any redundant records in the train set as well as in the test set, so the classifiers will not be biased towards more frequent records. The numbers of records in the train and test sets are now reasonable, which make it affordable to run the experiments on the complete set without the need to randomly select a small portion. The findings of this paper will be very useful to use SVM and RF in a more meaningful way in order to maximize the performance rate and minimize the false negative rate. The success of any Intrusion Detection System (IDS) is a complicated problem due to its nonlinearity and the quantitative or qualitative network traffic data stream with many features. To get rid of this problem, several types of intrusion detection methods have been proposed and shown different levels of accuracy. This is why the choice of the effective and robust method for IDS is very important topic in information security. In this work, we have built two models for the classification purpose. One is based on Support Vector Machines (SVM) and the other is Random Forests (RF). Experimental results show that either classifier is effective. SVM is slightly more accurate, but more expensive in terms of time. RF produces similar accuracy in a much faster manner if given modeling parameters. These classifiers can contribute to an IDS system as one source of analysis and increase its accuracy. In this paper, KDD’99 Dataset is used and find out which one is the best intrusion detector for this dataset. Statistical analysis on KDD’99 dataset found important issues which highly affect the performance of evaluated systems and results in a very poor evaluation of anomaly detection approaches. The most important deficiency in the KDD’99 dataset is the huge number of redundant records. To solve these issues, we have developed a new dataset, KDD99Train+ and KDD99Test+, which does not include any redundant records in the train set as well as in the test set, so the classifiers will not be biased towards more frequent records. The numbers of records in the train and test sets are now reasonable, which make it affordable to run the experiments on the complete set without the need to randomly select a small portion. The findings of this paper will be very useful to use SVM and RF in a more meaningful way in order to maximize the performance rate and minimize the false negative rate.
出处 《Journal of Intelligent Learning Systems and Applications》 2014年第1期45-52,共8页 智能学习系统与应用(英文)
关键词 INTRUSION Detection KDD’99 SVM KERNEL Random FOREST Intrusion Detection KDD’99 SVM Kernel Random Forest
  • 相关文献

同被引文献90

  • 1孙焕良,鲍玉斌,于戈,赵法信,王大玲.一种基于划分的孤立点检测算法[J].软件学报,2006,17(5):1009-1016. 被引量:16
  • 2孙云,李舟军,陈火旺.孤立点检测算法及其在数据流挖掘中的可用性[J].计算机科学,2007,34(10):200-203. 被引量:15
  • 3Alvarez J M, Lopez A M. Combining priors, appearance, and context for road detection[J]. IEEE Transactions on Intelligent Transportation Systems, 2013, 15(3): 1168-1178.
  • 4Nguyen D V, Kuhnert L, Thamke S, et al. A novel approach for a double-check of passable vegetation detection in au- tonomous ground vehicles[C]//l 5th IEEE International Confer- ence on Intelligent Transportation Systems. Piscataway, USA: IEEE, 2012: 230-236.
  • 5Nguyen D V, Kuhnert L, Jiang T, et al. Vegetation detection for outdoor automobile[C]//IEEE International Conference Guid- ance on Industrial Technology. Piscataway, USA: IEEE, 21)11: 358-364.
  • 6Bradley D M, Unnikrishnan R, Bagnell J. Vegetation detec- tion for driving in complex environments[C]//IEEE Internation- al Conference on Robotics and Automation. Piscataway, USA: IEEE, 2007: 503-508.
  • 7Zhao Y P, Wang H, Yan R C. Unstructured road edge detec- tion and initial positioning approach based on monocular vi- sion[C]//AASRI Conference on Computational Intelligence and Bioinformatics. Amsterdam, Netherlands: Elsevier Science, 2012: 486-491.
  • 8Salim N N A, Cheng X, Xiao D G. Improved shadow re- moval for unstructured road detection[C/OL]//Proceedings of the International Conference on Image Processing, Comput- er Vision, and Pattern Recognition. 2013: 1-5. [2015-01-01]. http://worldcomp-proceedings.com/proc/p2013/IPC4037.pdf.
  • 9Gu Y J, Jin Z. Grass detection based on color features[C[// Proceedings of Chinese Conference on Pattern Recognition. Piscataway, USA: IEEE, 2010: 1-5.
  • 10Ren X, Malik J. Learning a classification model for segmen- tation[C]//9th IEEE International Conference on Computer Vi- sion. Piscataway, USA: IEEE, 2003: 10-17.

引证文献17

二级引证文献98

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部