摘要
为了有效处理迅速增长的海量信息数据安全问题,在Hadoop云计算平台上,应用朴素贝叶斯算法和Logistic回归算法对入侵检测大数据进行并行计算分析。实验在伪分布模式和分布模式下进行计算,结果表明2种算法分类准确率均超过90%,Logistic回归算法比朴素贝叶斯算法运行时间更长;集群环境下运行的朴素贝叶斯算法可以有效降低运行时间。综合算法运行时间和分类准确率等因素,朴素贝叶斯算法比Logistic回归算法更能有效处理入侵检测大数据;并行计算下朴素贝叶斯算法可以有效分析入侵检测大数据。
To handle huge amounts of network data effectively which is increasing rapidly, Naive Bayesian parallel algorithm and Logistic Regression parallel algorithm were used to analyze the intrusion detection big data based on Hadoop which is a cloud computing system. The intrusion detection data was computed in the model of pseudo-distribution model and distribution model. The experimental results show that the classification accuracy of the two algorithms can exceed 90% and Logistic Regression algorithm spent more time than Naive Bayesian algorithm. Naive Bayesian algorithm can reduce run time effectively in Hadoop cluster. So Naive Bayesian algorithm is more effectively than Logistic Regression algorithm with the classification accuracy and the algorithm running time considered. Naive Bayesian algorithm can analyze the intrusion detection big data.
出处
《计算机与现代化》
2015年第12期43-47,共5页
Computer and Modernization
基金
教育部人文社会科学研究青年基金资助项目(11YJCZH005)