期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Mobile SMS Spam Filtering for Nepali Text Using Naive Bayesian and Support Vector Machine 被引量:2
1
作者 Tej Bahadur Shahi Abhimanu Yadav 《International Journal of Intelligence Science》 2014年第1期24-28,共5页
Spam is a universal problem with which everyone is familiar. A number of approaches are used for Spam filtering. The most common filtering technique is content-based filtering which uses the actual text of message to ... Spam is a universal problem with which everyone is familiar. A number of approaches are used for Spam filtering. The most common filtering technique is content-based filtering which uses the actual text of message to determine whether it is Spam or not. The content is very dynamic and it is very challenging to represent all information in a mathematical model of classification. For instance, in content-based Spam filtering, the characteristics used by the filter to identify Spam message are constantly changing over time. Na?ve Bayes method represents the changing nature of message using probability theory and support vector machine (SVM) represents those using different features. These two methods of classification are efficient in different domains and the case of Nepali SMS or Text classification has not yet been in consideration;these two methods do not consider the issue and it is interesting to find out the performance of both the methods in the problem of Nepali Text classification. In this paper, the Na?ve Bayes and SVM-based classification techniques are implemented to classify the Nepali SMS as Spam and non-Spam. An empirical analysis for various text cases has been done to evaluate accuracy measure of the classification methodologies used in this study. And, it is found to be 87.15% accurate in SVM and 92.74% accurate in the case of Na?ve Bayes. 展开更多
关键词 SMS spam filtering Classification Support Vector Machine Naive Bayes PREPROCESSING Feature Extraction Nepali SMS Datasets
下载PDF
Large margin classification for combatingdisguise attacks on spam filters 被引量:1
2
作者 Xi-chuan ZHOU Hai-bin SHEN +1 位作者 Zhi-yong HUANG Guo-jun LI 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2012年第3期187-195,共9页
This paper addresses the challenge of large margin classification for spare filtering in the presence of an adversary who disguises the spam mails to avoid being detected. In practice, the adversary may strategically ... This paper addresses the challenge of large margin classification for spare filtering in the presence of an adversary who disguises the spam mails to avoid being detected. In practice, the adversary may strategically add good words indicative of a legitimate message or remove bad words indicative of spam. We assume that the adversary could afford to modify a spam message only to a certain extent, without damaging its utility for the spammer. Under this assumption, we present a large margin approach for classification of spare messages that may be disguised. The proposed classifier is formulated as a second-order cone programming optimization. We performed a group of experiments using the TREC 2006 Spam Corpus. Results showed that the performance of the standard support vector machine (SVM) degrades rapidly when more words are injected or removed by the adversary, while the proposed approach is more stable under the disguise attack. 展开更多
关键词 Large margin spam filtering Second-order cone programming (SOCP) Adversarial classification
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部