期刊文献+
共找到353篇文章
< 1 2 18 >
每页显示 20 50 100
Mobile SMS Spam Filtering for Nepali Text Using Naive Bayesian and Support Vector Machine 被引量:2
1
作者 Tej Bahadur Shahi Abhimanu Yadav 《International Journal of Intelligence Science》 2014年第1期24-28,共5页
Spam is a universal problem with which everyone is familiar. A number of approaches are used for Spam filtering. The most common filtering technique is content-based filtering which uses the actual text of message to ... Spam is a universal problem with which everyone is familiar. A number of approaches are used for Spam filtering. The most common filtering technique is content-based filtering which uses the actual text of message to determine whether it is Spam or not. The content is very dynamic and it is very challenging to represent all information in a mathematical model of classification. For instance, in content-based Spam filtering, the characteristics used by the filter to identify Spam message are constantly changing over time. Na?ve Bayes method represents the changing nature of message using probability theory and support vector machine (SVM) represents those using different features. These two methods of classification are efficient in different domains and the case of Nepali SMS or Text classification has not yet been in consideration;these two methods do not consider the issue and it is interesting to find out the performance of both the methods in the problem of Nepali Text classification. In this paper, the Na?ve Bayes and SVM-based classification techniques are implemented to classify the Nepali SMS as Spam and non-Spam. An empirical analysis for various text cases has been done to evaluate accuracy measure of the classification methodologies used in this study. And, it is found to be 87.15% accurate in SVM and 92.74% accurate in the case of Na?ve Bayes. 展开更多
关键词 SMS spam filtering Classification Support Vector Machine Naive Bayes PREPROCESSING Feature Extraction Nepali SMS Datasets
下载PDF
Efficient Spam Filtering System Based on Smart Cooperative Subjective and Objective Methods
2
作者 Samir A. Elsagheer Mohamed 《International Journal of Communications, Network and System Sciences》 2013年第2期88-99,共12页
Most of the spam filtering techniques are based on objective methods such as the content filtering and DNS/reverse DNS checks. Recently, some cooperative subjective spam filtering techniques are proposed. Objective me... Most of the spam filtering techniques are based on objective methods such as the content filtering and DNS/reverse DNS checks. Recently, some cooperative subjective spam filtering techniques are proposed. Objective methods suffer from the false positive and false negative classification. Objective methods based on the content filtering are time consuming and resource demanding. They are inaccurate and require continuous update to cope with newly invented spammer’s tricks. On the other side, the existing subjective proposals have some drawbacks like the attacks from malicious users that make them unreliable and the privacy. In this paper, we propose an efficient spam filtering system that is based on a smart cooperative subjective technique for content filtering in addition to the fastest and the most reliable non-content-based objective methods. The system combines several applications. The first is a web-based system that we have developed based on the proposed technique. A server application having extra features suitable for the enterprises and closed work groups is a second part of the system. Another part is a set of standard web services that allow any existing email server or email client to interact with the system. It allows the email servers to query the system for email filtering. They can also allow the users via the mail user agents to participate in the subjective spam filtering problem. 展开更多
关键词 ANTI-spam SYSTEM Objective spam filterING Cooperative SUBJECTIVE spam filterING WEB Application WEB Services
下载PDF
Survey on Spam Filtering Techniques
3
作者 Saadat Nazirova 《Communications and Network》 2011年第3期153-160,共8页
In the recent years spam became as a big problem of Internet and electronic communication. There developed a lot of techniques to fight them. In this paper the overview of existing e-mail spam filtering methods is giv... In the recent years spam became as a big problem of Internet and electronic communication. There developed a lot of techniques to fight them. In this paper the overview of existing e-mail spam filtering methods is given. The classification, evaluation, and comparison of traditional and learning-based methods are provided. Some personal anti-spam products are tested and compared. The statement for new approach in spam filtering technique is considered. 展开更多
关键词 E-MAIL spam Unsolicited BULK MESSAGES filterING Traditional METHODS Learning-Based METHODS Classification
下载PDF
Large margin classification for combatingdisguise attacks on spam filters 被引量:1
4
作者 Xi-chuan ZHOU Hai-bin SHEN +1 位作者 Zhi-yong HUANG Guo-jun LI 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2012年第3期187-195,共9页
This paper addresses the challenge of large margin classification for spam filtering in the presence of an adversary who disguises the spam mails to avoid being detected. In practice, the adversary may strategically a... This paper addresses the challenge of large margin classification for spam filtering in the presence of an adversary who disguises the spam mails to avoid being detected. In practice, the adversary may strategically add good words indicative of a legitimate message or remove bad words indicative of spam. We assume that the adversary could afiord to modify a spam message only to a certain extent, without damaging its utility for the spammer. Under this assumption, we present a large margin approach for classification of spam messages that may be disguised. The proposed classifier is formulated as a second-order cone programming optimization. We performed a group of experiments using the TREC 2006 Spam Corpus. Results showed that the performance of the standard support vector machine (SVM) degrades rapidly when more words are injected or removed by the adversary, while the proposed approach is more stable under the disguise attack. 展开更多
关键词 大边缘 垃圾过滤 秒顺序锥编程(SOCP ) 对手的分类
原文传递
基于SpamAssassin的中文垃圾邮件过滤系统的设计与实现 被引量:9
5
作者 李玉峰 《内蒙古农业大学学报(自然科学版)》 CAS 北大核心 2012年第3期245-249,共5页
随着电子邮件在人们生活中的广泛应用,垃圾邮件的防范也日益引起人们的重视。本文详细介绍了在linux下基于SpamAssassin中文垃圾邮件过滤系统的设计与实现。
关键词 spamAssassin 垃圾邮件过滤 LINUX 中文分词 特征选取
下载PDF
A Solution for Fighting Spammer's Resources and Minimizing the Impact of Spam
6
作者 Samir A. Elsagheer Mohamed 《International Journal of Communications, Network and System Sciences》 2012年第7期416-422,共7页
Spam or unsolicited emails constitute a major threat to the Internet, the corporations, and the end-users. Statistics show that about 70% - 80% of the emails are spam. There are several techniques that have been imple... Spam or unsolicited emails constitute a major threat to the Internet, the corporations, and the end-users. Statistics show that about 70% - 80% of the emails are spam. There are several techniques that have been implemented to react to the spam on its arrival. These techniques consist in filtering the emails and placing them in the Junk or Spam folders of the users. Regardless of the accuracy of these techniques, they are all passive. In other words, they are like someone is hitting you and you are trying by all the means to protect yourself from these hits without fighting your opponent. As we know the proverbs 'The best defense is a good offense' or 'Attack is the best form of defense'. Thus, we believe that attacking the spammers is the best way to minimize their impact. Spammers send millions of emails to the users for several reasons and usually they include some links or images that direct the user to some web pages or simply to track the users. The proposed idea of attacking the spammers is by building some software to collect these links from the Spam and Junk folders of the users. Then, the software periodically and actively visit these links and the subsequent redirect links as if a user clicks on these links or as if the user open the email containing the tracking link. If this software is used by millions of users (included in the major email providers), then this will act as a storm of Distributed Denial of Service attack on the spammers servers and there bandwidth will be completely consumed by this act. In this case, no human can visit their sites because they will be unavailable. In this paper, we describe this approach and show its effectiveness. In addition, we present an application we have developed that can be used for this reason. 展开更多
关键词 spam Emails Attacking spammers spam filterING Distributed DENIAL of Service ATTACKS Software Development
下载PDF
An Improved Bayesian with Application to Anti-Spam Email 被引量:2
7
作者 詹川 卢显良 +1 位作者 周旭 侯孟书 《Journal of Electronic Science and Technology of China》 2005年第1期30-33,共4页
Along with the wide application of e-mail nowadays, many spam e-mails flood into people’s email-boxes and cause catastrophes to their study and life. In anti-spam e-mails campaign, we depend on not only legal measure... Along with the wide application of e-mail nowadays, many spam e-mails flood into people’s email-boxes and cause catastrophes to their study and life. In anti-spam e-mails campaign, we depend on not only legal measures but also technological approaches. The Bayesian classifier provides a simple and effective approach to discriminate classification. This paper presents a new improved Bayesian-based anti-spam e-mail filter. We adopt a way of attribute selection based on word entropy, use vector weights which are represented by word frequency, and deduce its corresponding formula. It is proved that our filter improves total performances apparently in our experiment. 展开更多
关键词 word entropy Bayesian classification anti-spam e-mail filter attribute selection VECTOR
下载PDF
A Heuristic Reputation Based System to Detect Spam Activities in a Social Networking Platform, HRSSSNP
8
作者 Manoj Rameshchandra Thakur Sugata Sanyal 《Social Networking》 2013年第1期42-45,共4页
The introduction of the social networking platform has drastically affected the way individuals interact. Even though most of the effects have been positive, there exist some serious threats associated with the intera... The introduction of the social networking platform has drastically affected the way individuals interact. Even though most of the effects have been positive, there exist some serious threats associated with the interactions on a social networking website. A considerable proportion of the crimes that occur are initiated through a social networking platform [1]. Almost 33% of the crimes on the internet are initiated through a social networking website [1]. Moreover activities like spam messages create unnecessary traffic and might affect the user base of a social networking platform. As a result preventing interactions with malicious intent and spam activities becomes crucial. This work attempts to detect the same in a social networking platform by considering a social network as a weighted graph wherein each node, which represents an individual in the social network, stores activities of other nodes with respect to itself in an optimized format which is referred to as localized data set. The weights associated with the edges in the graph represent the trust relationship between profiles. The weights of the edges along with the localized data set are used to infer whether nodes in the social network are compromised and are performing spam or malicious activities. 展开更多
关键词 spam Social GRAPH Collaborative filtering Weighted GRAPH LOCALIZED Data-Set Trust Level
下载PDF
稀疏矩阵的概念与应用
9
作者 许春荣 买买提依明·哈斯木 《信息与电脑》 2023年第21期254-256,共3页
稀疏矩阵在数据存储、机器学习、图像处理及文本处理方面有广泛的应用。在数据结构等一些课程中,教材会介绍稀疏矩阵,包括压缩存储方法等内容。由于稀疏矩阵的讲解比较少,理论的内容比较多,案例讲解较少,学生不易理解。为了让学生更好... 稀疏矩阵在数据存储、机器学习、图像处理及文本处理方面有广泛的应用。在数据结构等一些课程中,教材会介绍稀疏矩阵,包括压缩存储方法等内容。由于稀疏矩阵的讲解比较少,理论的内容比较多,案例讲解较少,学生不易理解。为了让学生更好地理解稀疏矩阵及其应用,通过垃圾邮件过滤分类的案例,对比采用稠密型矩阵和稀疏矩阵两种形式,验证稀疏矩阵在存储和训练运算方面的优越性。 展开更多
关键词 稀疏矩阵 数据结构 垃圾邮件过滤 案例理解
下载PDF
数据挖掘技术在计算机软件工程中的应用
10
作者 郑盼盼 《移动信息》 2023年第9期208-211,共4页
文中阐述了数据挖掘的定义和分类,然后从垃圾邮件过滤、用户行为分析、软件代码分析、深度学习和自动化数据挖掘技术等方面,详细介绍了数据挖掘技术在软件工程中的应用现状和未来发展趋势。这些应用展示了数据挖掘技术在计算机软件工程... 文中阐述了数据挖掘的定义和分类,然后从垃圾邮件过滤、用户行为分析、软件代码分析、深度学习和自动化数据挖掘技术等方面,详细介绍了数据挖掘技术在软件工程中的应用现状和未来发展趋势。这些应用展示了数据挖掘技术在计算机软件工程中的多样性和重要性,同时也提出了一些问题和挑战,如数据隐私和安全问题、算法的解释和解释性评估等。因此,在数据挖掘技术的发展和应用中,需要继续加强对技术的研究和创新,推进技术与法律、伦理等方面的平衡发展。 展开更多
关键词 数据挖掘 计算机软件工程 垃圾邮件过滤 用户行为分析
下载PDF
基于改进朴素贝叶斯算法的垃圾邮件过滤器的研究 被引量:26
11
作者 郑炜 沈文 张英鹏 《西北工业大学学报》 EI CAS CSCD 北大核心 2010年第4期622-627,共6页
基于朴素贝叶斯算法的垃圾邮件过滤器是目前比较高效、经济的垃圾邮件过滤技术之一,它已经广泛应用到垃圾邮件过滤领域。文章在对朴素贝叶斯过滤器分析的基础上,针对朴素贝叶斯算法的缺陷结合损失最小化的思想,并根据垃圾邮件的特性对... 基于朴素贝叶斯算法的垃圾邮件过滤器是目前比较高效、经济的垃圾邮件过滤技术之一,它已经广泛应用到垃圾邮件过滤领域。文章在对朴素贝叶斯过滤器分析的基础上,针对朴素贝叶斯算法的缺陷结合损失最小化的思想,并根据垃圾邮件的特性对朴素贝叶斯算法做了改进,提出了改进朴素贝叶斯算法,该算法能够通过调整k值,降低合法邮件被错判为垃圾邮件的概率,从而最大程度减少用户的损失。 展开更多
关键词 概率 朴素贝叶斯 垃圾邮件过滤器
下载PDF
基于TF*IDF的垃圾邮件过滤特征选择改进算法 被引量:6
12
作者 陈琦 伍朝辉 +2 位作者 姚芳 宋秀荣 张付志 《计算机应用研究》 CSCD 北大核心 2009年第6期2165-2167,共3页
随着电子邮件的普及与应用,垃圾邮件的泛滥也越来越受到人们的关注。而如何进行邮件特征选择,是邮件分类中的重要问题。在介绍词频和倒文档频度的基础上,对几种常用的特征选择算法进行了分析和比较,针对现有特征选择算法过于机械的缺点... 随着电子邮件的普及与应用,垃圾邮件的泛滥也越来越受到人们的关注。而如何进行邮件特征选择,是邮件分类中的重要问题。在介绍词频和倒文档频度的基础上,对几种常用的特征选择算法进行了分析和比较,针对现有特征选择算法过于机械的缺点,将关键字权重引入到邮件分类中,提出了一种基于关键词权重的TF*IDF特征选择改进算法,并进行了实验验证。实验结果表明,采用该算法改进后的贝叶斯过滤器具有更好的过滤效果。 展开更多
关键词 垃圾邮件 过滤器 贝叶斯 特征选择 TF*IDF
下载PDF
改进的朴素贝叶斯算法在垃圾邮件过滤中的研究 被引量:20
13
作者 杨雷 曹翠玲 +1 位作者 孙建国 张立国 《通信学报》 EI CSCD 北大核心 2017年第4期140-148,共9页
提出了一种利用支持向量机改进的朴素贝叶斯算法——TSVM-NB算法。首先利用NB算法对样本集进行初次训练,利用支持向量机构造一个最优分类超平面,每个样本根据与其距离最近样本的类型是否相同进行取舍,这样既降低样本空间规模,又提高每... 提出了一种利用支持向量机改进的朴素贝叶斯算法——TSVM-NB算法。首先利用NB算法对样本集进行初次训练,利用支持向量机构造一个最优分类超平面,每个样本根据与其距离最近样本的类型是否相同进行取舍,这样既降低样本空间规模,又提高每个样本类别的独立性,最后再次用朴素贝叶斯算法训练样本集从而生成分类模型。仿真实验结果表明,该算法在样本空间进行取舍过程当中消除了冗余属性,可以快速得到分类特征子集,提高了垃圾邮件过滤的分类速度、召回率和正确率。 展开更多
关键词 邮件过滤 朴素贝叶斯 支持向量机 修剪策略
下载PDF
一种基于人工免疫的多层垃圾邮件过滤算法 被引量:16
14
作者 张泽明 罗文坚 王煦法 《电子学报》 EI CAS CSCD 北大核心 2006年第9期1616-1620,共5页
随着电子邮件日益广泛的使用,如何有效地避免和防范垃圾邮件的侵扰已成为一个亟待解决的问题.受生物免疫系统自我保护机制的启发,本文提出了一种基于人工免疫的多层垃圾邮件过滤算法,利用分层检测的思想来过滤垃圾邮件.文中给出了针对... 随着电子邮件日益广泛的使用,如何有效地避免和防范垃圾邮件的侵扰已成为一个亟待解决的问题.受生物免疫系统自我保护机制的启发,本文提出了一种基于人工免疫的多层垃圾邮件过滤算法,利用分层检测的思想来过滤垃圾邮件.文中给出了针对多层过滤算法中获得性免疫层的垃圾邮件过滤测试实验,实验结果表明本算法在垃圾邮件过滤中能得到较高的召回率、精确率和正确率.文中也指出了可以通过合理地设置各检测器层之间的与或关系来得到更好的垃圾邮件过滤效果. 展开更多
关键词 人工免疫 获得性免疫 垃圾邮件 多层垃圾邮件过滤
下载PDF
基于粗糙集的加权朴素贝叶斯邮件过滤方法 被引量:20
15
作者 邓维斌 王国胤 洪智勇 《计算机科学》 CSCD 北大核心 2011年第2期218-221,共4页
邮件过滤中有两个关键问题,一是如何选择有效的邮件特征集,二是设计较好的邮件过滤算法。在对邮件特性进行分析的基础上,综合邮件头及邮件内容的主要形象特征给出了一种新的邮件特征集提取方法。用粗糙集的信息观点度量了各属性的重要性... 邮件过滤中有两个关键问题,一是如何选择有效的邮件特征集,二是设计较好的邮件过滤算法。在对邮件特性进行分析的基础上,综合邮件头及邮件内容的主要形象特征给出了一种新的邮件特征集提取方法。用粗糙集的信息观点度量了各属性的重要性,并以此为权重进行加权朴素贝叶斯垃圾邮件过滤,有效地解决了朴素贝叶斯分类中的条件依赖性问题。通过在中英文邮件集上的测试实验,证明了所提出的邮件过滤方法的有效性。 展开更多
关键词 垃圾邮件过滤 特征选择 粗糙集 加权朴素贝叶斯
下载PDF
基于文本区域特征的图像型垃圾邮件过滤算法 被引量:8
16
作者 耿技 万明成 +1 位作者 程红蓉 周俊怡 《计算机应用》 CSCD 北大核心 2008年第8期1904-1906,共3页
垃圾邮件图像中通常含有大量文本区域,且这些区域常含有较多区分能力强的特征。提出一种基于图像中文本区域特征的垃圾邮件图像识别算法。首先提取出图像中文本区域的特征,包括:文本区域数量和面积、色饱和度、文字数量和颜色数量,以及... 垃圾邮件图像中通常含有大量文本区域,且这些区域常含有较多区分能力强的特征。提出一种基于图像中文本区域特征的垃圾邮件图像识别算法。首先提取出图像中文本区域的特征,包括:文本区域数量和面积、色饱和度、文字数量和颜色数量,以及图像的一些属性特征如图像面积等;然后利用支持向量机分类算法来识别垃圾邮件图像。实验表明,对于真实的邮件图像集,算法能够识别出98.5%的垃圾邮件图像,且正确率超过98%。 展开更多
关键词 图像型垃圾邮件 文本区域 垃圾邮件过滤 支持向量机
下载PDF
基于改进的局部敏感哈希算法实现图像型垃圾邮件过滤 被引量:13
17
作者 曹玉东 刘艳洋 +1 位作者 贾旭 王冬霞 《计算机应用研究》 CSCD 北大核心 2016年第6期1693-1696,共4页
提出一种快速的图像型垃圾邮件过滤方案,结合半监督机器学习技术改进局部敏感哈希(LSH)算法,基于改进的LSH算法构建垃圾图像特征库索引,提高图像的查找速度。搜集并构造了60 000个垃圾图像样本,实验结果表明,利用改进的LSH算法能有效地... 提出一种快速的图像型垃圾邮件过滤方案,结合半监督机器学习技术改进局部敏感哈希(LSH)算法,基于改进的LSH算法构建垃圾图像特征库索引,提高图像的查找速度。搜集并构造了60 000个垃圾图像样本,实验结果表明,利用改进的LSH算法能有效地提高垃圾图像的过滤速度。 展开更多
关键词 垃圾图像过滤 局部敏感哈希 图像特征提取 高维数据索引
下载PDF
基于变精度粗糙集决策树垃圾邮件过滤 被引量:14
18
作者 王靖 王兴伟 赵悦 《系统仿真学报》 CAS CSCD 北大核心 2016年第3期705-710,共6页
电子邮件以方便快捷、收费低廉的特点,深受人们青睐,成为最常用的通信手段之一。近年来,电子邮件被恶意利用,导致网络上垃圾邮件泛滥,浪费了网络资源,干扰邮件系统的正常运行,给用户的日常生活带来影响。为了过滤垃圾邮件,决策树算法被... 电子邮件以方便快捷、收费低廉的特点,深受人们青睐,成为最常用的通信手段之一。近年来,电子邮件被恶意利用,导致网络上垃圾邮件泛滥,浪费了网络资源,干扰邮件系统的正常运行,给用户的日常生活带来影响。为了过滤垃圾邮件,决策树算法被引入,根据提取出的邮件头部信息进行分析训练,并构建一棵决策树用于垃圾邮件的过滤。为了减少正常邮件被当作垃圾邮件情况的发生,降低给用户造成的损失,变精度粗糙集模型被引入,将少数特定实例或噪声数据分到合适的类别中。实验结果表明,该机制可用于垃圾邮件过滤,降低了正常邮件被判定为垃圾邮件的误报率。 展开更多
关键词 垃圾邮件 过滤 特征信息 变精度粗糙集 决策树
下载PDF
基于用户反馈的反垃圾邮件技术 被引量:9
19
作者 李洋 方滨兴 王申 《计算机工程》 CAS CSCD 北大核心 2007年第8期130-132,共3页
在分析传统垃圾邮件过滤技术的基础上,提出了一种基于用户反馈的反垃圾邮件技术。该技术通过引入用户反馈机制,使用改进的朴素贝叶斯方法,构建面向特定用户的过滤器,从而进行垃圾邮件过滤。邮件语料库实验和原型系统的测试证明,该方法... 在分析传统垃圾邮件过滤技术的基础上,提出了一种基于用户反馈的反垃圾邮件技术。该技术通过引入用户反馈机制,使用改进的朴素贝叶斯方法,构建面向特定用户的过滤器,从而进行垃圾邮件过滤。邮件语料库实验和原型系统的测试证明,该方法能够有效地降低误报率,提高反垃圾邮件系统的可用性,具有较好的实用效果。 展开更多
关键词 垃圾邮件过滤 机器学习 朴素贝叶斯方法 用户反馈
下载PDF
基于改进Nave Bayes的垃圾邮件过滤模型研究 被引量:10
20
作者 王涛 裘国永 何聚厚 《计算机工程与应用》 CSCD 北大核心 2007年第13期186-190,共5页
分析了目前在垃圾邮件过滤中广泛应用的NaveBayes过滤模型(NBF),指出了期望交叉熵(ECE)特征词选取方法的不足。提出了改进的NaveBayes垃圾邮件过滤模型(A-NBF),用改进的期望交叉熵(AECE)选取垃圾邮件特征词,并在邮件分类过程中对特征词... 分析了目前在垃圾邮件过滤中广泛应用的NaveBayes过滤模型(NBF),指出了期望交叉熵(ECE)特征词选取方法的不足。提出了改进的NaveBayes垃圾邮件过滤模型(A-NBF),用改进的期望交叉熵(AECE)选取垃圾邮件特征词,并在邮件分类过程中对特征词进行加权,从而提高对垃圾邮件过滤的精度。实验结果可以看出A-NBF比NBF在过滤精度方面有明显的提高。 展开更多
关键词 垃圾邮件过滤 朴素贝叶斯 期望交叉熵 特征选取
下载PDF
上一页 1 2 18 下一页 到第
使用帮助 返回顶部