Performance analysis of machine learning models for intrusion detection system using Gini Impurity-based Weighted Random Forest (GIWRF) feature selection technique 被引量：5

导出

摘要 To protect the network, resources, and sensitive data, the intrusion detection system (IDS) has become a fundamental component of organizations that prevents cybercriminal activities. Several approaches have been introduced and implemented to thwart malicious activities so far. Due to the effectiveness of machine learning (ML) methods, the proposed approach applied several ML models for the intrusion detection system. In order to evaluate the performance of models, UNSW-NB 15 and Network TON_IoT datasets were used for offline analysis. Both datasets are comparatively newer than the NSL-KDD dataset to represent modern-day attacks. However, the performance analysis was carried out by training and testing the Decision Tree (DT), Gradient Boosting Tree (GBT), Multilayer Perceptron (MLP), AdaBoost, Long-Short Term Memory (LSTM), and Gated Recurrent Unit (GRU) for the binary classification task. As the performance of IDS deteriorates with a high dimensional feature vector, an optimum set of features was selected through a Gini Impurity-based Weighted Random Forest (GIWRF) model as the embedded feature selection technique. This technique employed Gini impurity as the splitting criterion of trees and adjusted the weights for two different classes of the imbalanced data to make the learning algorithm understand the class distribution. Based upon the importance score, 20 features were selected from UNSW-NB 15 and 10 features from the Network TON_IoT dataset. The experimental result revealed that DT performed well with the feature selection technique than other trained models of this experiment. Moreover, the proposed GIWRF-DT outperformed other existing methods surveyed in the literature in terms of the F1 score.

作者 Raisa Abedin Disha Sajjad Waheed

机构地区 Department of Information and Communication Technology Department of Information and Communication Technology

出处《Cybersecurity》 EI CSCD 2022年第2期119-140,共22页 网络空间安全科学与技术（英文）

关键词 Cyber security Feature selection Intrusion Detection System Machine learning Network security

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

参考文献1

1Ansam Khraisat,Iqbal Gondal,Peter Vamplew,Joarder Kamruzzaman.Survey of intrusion detection systems:techniques,datasets and challenges[J].Cybersecurity,2019,2(1):1-22. 被引量：30

二级参考文献1

1Saravanan Subramanian,Vijay Bhanu Srinivasan,Chandrasekaran Ramasamy.Study on Classification Algorithms for Network Intrusion Systems[J].通讯和计算机（中英文版）,2012,9(11):1242-1246. 被引量：5

共引文献29

1金志刚,吴桐.基于特征选取与树状Parzen估计的入侵检测[J].系统工程与电子技术,2021,43(7):1954-1960. 被引量：7
2李舒沁.欧盟网络安全战略新动向及其启示[J].网络安全技术与应用,2021(7):175-177. 被引量：1
3张玲翠,许瑶冰,李凤华,房梁,郭云川,李子孚.天地一体化信息网络安全动态赋能架构[J].通信学报,2021,42(9):87-95. 被引量：4
4Ansam Khraisat,Ammar Alazab.A critical review of intrusion detection systems in the internet of things:techniques,deployment strategy,validation strategy,attacks,public datasets and challenges[J].Cybersecurity,2021,4(1):251-277. 被引量：4
5Ke LIU,Mufeng WANG,Rongkuan MA,Zhenyong ZHANG,Qiang WEI.Detection and localization of cyber attacks on water treatment systems:an entropy-based approach[J].Frontiers of Information Technology & Electronic Engineering,2022,23(4):587-603. 被引量：1
6Lewis Nkenyereye,Bayu Adhi Tama,Sunghoon Lim.A Stacking-Based Deep Neural Network Approach for Effective Network Anomaly Detection[J].Computers, Materials & Continua,2021(2):2217-2227. 被引量：3
7Shumaila Shahzadi,Fahad Ahmad,Asma Basharat,Madallah Alruwaili,Saad Alanazi,Mamoona Humayun,Muhammad Rizwan,Shahid Naseem.Machine Learning Empowered Security Management and Quality of Service Provision in SDN-NFV Environment[J].Computers, Materials & Continua,2021(3):2723-2749. 被引量：6
8Nahida Islam,Fahiba Farhin,Ishrat Sultana,M.Shamim Kaiser,Md.Sazzadur Rahman,Mufti Mahmud,A.S.M.Sanwar Hosen,Gi Hwan Cho.Towards Machine Learning Based Intrusion Detection in IoT Networks[J].Computers, Materials & Continua,2021(11):1801-1821. 被引量：3
9Magdy M.Fadel,Sally M.El-Ghamrawy,Amr M.T.Ali-Eldin,Mohammed K.Hassan,Ali I.El-Desoky.HDLIDP: A Hybrid Deep Learning Intrusion Detection and Prevention Framework[J].Computers, Materials & Continua,2022(11):2293-2312.
10郭森森,王同力,慕德俊.基于生成对抗网络与自编码器的网络流量异常检测模型[J].信息网络安全,2022(12):7-15. 被引量：6

同被引文献37

1童小钟,魏俊宇,苏绍璟,孙备,左震.融合注意力和多尺度特征的典型水面小目标检测[J].仪器仪表学报,2023,44(1):212-222. 被引量：12
2祁荣,陈军,余邵民.关于抑郁症的研究综述[J].心理月刊,2020(17):238-240. 被引量：11
3徐琳宏,林鸿飞,潘宇,任惠,陈建美.情感词汇本体的构造[J].情报学报,2008,27(2):180-185. 被引量：384
4沈玲玲,刘连友,许冲,王静璞.基于多模型的滑坡易发性评价——以甘肃岷县地震滑坡为例[J].工程地质学报,2016,24(1):19-28. 被引量：26
5蔡雅婷(图/文).掌上听歌、玩游戏京东叮咚mini2智能音箱体验评测[J].消费电子,2018,0(10):69-71. 被引量：1
6卢兵.小米发布新款小爱音箱组合立体声,支持蓝牙Mesh网关[J].计算机与网络,2019,45(18):27-27. 被引量：1
7王瑞琪,王学良,刘海洋,孙娟娟,王新辉,张苏.基于精细DEM的崩塌滑坡灾害识别及主控因素分析——以雅鲁藏布江缝合带加查—朗县段为例[J].工程地质学报,2019,27(5):1146-1152. 被引量：22
8于晓明.语音识别技术的发展及应用[J].计算机时代,2019,0(11):28-31. 被引量：18
9赵永辉.雅鲁藏布江流域嘎贡沟巨型滑坡变形破坏模式及演化过程研究[J].防灾科技学院学报,2019,21(4):1-7. 被引量：8
10袁钦湄,王星,帅建伟,林海,曹玉萍.基于人工智能技术的抑郁症研究进展[J].中国临床心理学杂志,2020,28(1):82-86. 被引量：29

引证文献5

1林琴,郭永刚,吴升杰,臧烨祺,王国闻.基于梯度提升的优化集成机器学习算法对滑坡易发性评价:以雅鲁藏布江与尼洋河两岸为例[J].西北地质,2024,57(1):12-22. 被引量：1
2Ali Hamid Farea,Omar H.Alhazmi,Kerem Kucuk.Advanced Optimized Anomaly Detection System for IoT Cyberattacks Using Artificial Intelligence[J].Computers, Materials & Continua,2024,78(2):1525-1545.
3倪志伟,行鸿彦,侯天浩,梁欣怡,王心怡.基于生成对抗网络和混合时空神经网络的入侵检测[J].电子测量技术,2024,47(2):17-24.
4张楠,蔡莉,杨文洁,余治国.面向抑郁症群体的情感化智能音箱设计与实现[J].计算机仿真,2024,41(3):334-341.
5Asad Raza,Shahzad Memon,Muhammad Ali Nizamani,Mahmood Hussain Shah.Intrusion Detection System for Smart Industrial Environments with Ensemble Feature Selection and Deep Convolutional Neural Networks[J].Intelligent Automation & Soft Computing,2024,39(3):545-566.

二级引证文献1

1刘伊铭,徐胜华,刘春阳,马钰.顾及多方法集成特征选择与负样本优化的滑坡易发性评价[J].测绘通报,2024(9):74-79.

1Guo Pu,Lijuan Wang,Jun Shen,Fang Dong.A Hybrid Unsupervised Clustering-Based Anomaly Detection Method[J].Tsinghua Science and Technology,2021,26(2):146-153. 被引量：7
2胡艳羽,赵龙,董祥军.一种用于癌症分类的两阶段深度特征选择提取算法[J].计算机科学,2022,49(7):73-78.
3吴喆君,黄睿.基于依赖最大化和稀疏回归的多标签特征选择[J].计算机工程与设计,2022,43(7):1898-1904. 被引量：1
4时林,时绍森,文伟平.基于LSTM的Linux系统下APT攻击检测研究[J].信息安全研究,2022,8(8):736-750. 被引量：1
5张蓝天.数据驱动下基于GPSO-FFS算法的吞吐量评估[J].信息技术与信息化,2022(6):55-59.
6Jialing Zhang,Stephan Stanislaw Späth,Sadie L.Marjani,Wengeng Zhang,Xinghua Pan.Characterization of cancer genomic heterogeneity by next-generation sequencing advances precision medicine in cancer treatment[J].Precision Clinical Medicine,2018,1(1):29-48. 被引量：7
7姜新盈,江开忠,严涛,王舒梵.不平衡数据中基于权重的边界混合采样[J].计算机工程与设计,2022,43(5):1265-1272. 被引量：3
8王佳.中国区域金融发展差异的度量[J].科技经济市场,2022(5):58-60. 被引量：1
9王飞雪,戴蓉.基于投票ELM和黑洞优化的云计算DDoS攻击检测[J].西南大学学报（自然科学版）,2022,44(8):205-215. 被引量：9
10张海翔,李培培,胡学钢.基于自适应领域粗糙集的多标签在线流特征选择[J].微电子学与计算机,2022,39(7):44-53. 被引量：1

Cybersecurity

2022年第2期

浏览历史

内容加载中请稍等...