摘要
随着因特网的日益发展,网络空间的安全形势也愈发严峻。其中,以盗取用户敏感信息或者用户名口令为目的的网络钓鱼活动是网络犯罪行为中危害较大、影响较为严重的一种。针对网络钓鱼频发的现状,文中提出了一种基于Hadoop和mahout的钓鱼邮件检测方法,此方法采用hadoop平台的HDFS作为存储基础,Map Reduce作为并行计算框架.该方法对邮件信息进行特征提取,利用mahout的贝叶斯算法对钓鱼邮件进行检测。使用真实邮件数据集对该方法进行测试,取得了良好的效果。
With the development of the Internet, the network space safety situation is increasingly serious.Phishing with purpose of stealing users' sensitive information and password is one of cyber crime acitivity which harm a lot.In view of the situation of frequent phishing, this paper puts forward a fishing mail detection method based on Hadoop and mahout.This method uses the HDFS of Hadoop platform as the foundation of storage, Map Reduce as the parallel computing framework.It extracts feature for E-mail messages and uses the bayesian algorithm of mahout to test the phishing emails.Using real email data set to test the method which has obtained good effect.
出处
《电脑知识与技术(过刊)》
2016年第4X期27-30,共4页
Computer Knowledge and Technology