期刊文献+

垃圾短信的智能识别和实时处理 被引量:7

Intellectual Recognition and Real-time Processing of the Junk Short Message
下载PDF
导出
摘要 本文提出了短信语义和号码特征相结合的垃圾信息智能识别方法。在分类器的设计上,采用了BP神经网络与支撑矢量机(SVM)的分类集成技术,使得分类识别效果明显。垃圾短信的学习样本识别正确率达99.86%,测试样本识别正确率达到97.4%。由于本文方法提取的特征构成了稀疏矩阵,因此大大缩短了机器学习时间,使得系统具有实时学习和实时提高分类能力的功能。 In this paper, an intellectual recognition method with the features of the number combined with short message's semantic is taken out. Considering the design of collecting classifier, both the BP neural network and the support vector machine (for short SVM) are adopted. Therefore the effect of classifying and recognizing is obvious. Correct rate reaches 99.86% as for learning samples and is up to 97.4% as for testing samples, Based on the features forming sparse-matrix, learning time is shorter and the function of real-tlme learning and improving the classifying ability is held.
出处 《电信科学》 北大核心 2008年第8期61-64,共4页 Telecommunications Science
关键词 垃圾短信 BP神经网络 SVM 机器学习 junk short message, BP neural network, SVM, machine learning
  • 相关文献

参考文献11

  • 1许建宏,李慧.移动短信业务发展中存在的问题及解决方案探讨[J].邮电设计技术,2004(6):25-29. 被引量:6
  • 2张燕,傅建明.垃圾短信的识别与追踪研究[J].计算机应用研究,2006,23(3):245-247. 被引量:21
  • 3Sebastiani F. Machine learning in automated text categorization. ACM Computing Surveys, 2002,34(1).
  • 4Bigi B. Using Kullback-1eibler distance for text categorization. In: Proc of the 25th European Conf on Information Retrieval (ECIR-03), Pisa, Springer-Verlag, 2003.
  • 5Li F, Yang Y. A loss function analysis for classification methods in text categorization. In: Proc of the ICML 2003, Washington, USA,2003.
  • 6http://www.nlplab.com/chinese/source.htm
  • 7加卢什金[俄].神经网络理论.北京:清华大学出版社,2002.
  • 8Lanckriet G, Cristianini N, Bartlett P, et al. Learning the kernel matrix with semidefinte programming. J Mach Learn Res, 2004(5).
  • 9Amari S, Wu S. Improving support vector machine classifiers by modifying kernel functions. Neural Networks, 1999, 12(6).
  • 10Smits G, Jordan E. Improved SVM regression using mixtures Of kernels .In : IJCNN, Honolulu,USA, 2002.

二级参考文献4

共引文献26

同被引文献35

  • 1费洪晓,戴宏伟,肖新华.Snort中BM模式匹配算法的研究与改进[J].计算机系统应用,2007,16(8):113-116. 被引量:6
  • 2何效金.垃圾短信过滤系统的设计与实现[D].成都:电子科技大学,2009.
  • 3万晓枫,惠孛.基于贝叶斯分类法的智能垃圾短信过滤系统[J].实验科学与技术,2013,11(5):44-47,76.
  • 4Schmidhuber J. Deep learning in neural networks:an over- view [ J]. Neural Networks,2015,61 ( 1 ) :85 - 117.
  • 5Bengio, Ducharme R, Vincent P, et al. A neural probabilistic language model [ J ]. Journal of Machine Learning Research, 2003(3) :1137 - 1155.
  • 6Mikolov T, Chen K, Corrado G, et al. Efficient estimation of word representations in vector space [ C]. Scottsdale, Arizo- na : ICLR Workshop ,2013.
  • 7Hinton G E, Osindero S, The Y W. A fast learning algorithm for deep belief nets [ J ]. Neural Computation, 2006 ( 18 ) : 1527 - 1554.
  • 8Tieleman. Training restricted bohzmann machines using ap- preximations to the likelihood gradient [ C]. Helsinki, Fin- land : ICML, 2008.
  • 9Kazuhiro Shin - ike. A two phase method for determining the number of neurons in the hidden layer of a 3 - Layer neural network [ C ]. Taipei, Taiwan: SICE Annual Conference,2010.
  • 10LE Q V.Building high-level features using large scale unsupervised learning[C]//Proceedings of 2013 IEEE International Conference on Acoustics,Speech and Signal Processing.Vancouver:IEEE,2013:8595-8598.

引证文献7

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部