期刊文献+

图像垃圾邮件中文本区域的自动提取方法 被引量:1

Text region extraction in image-based spam email
下载PDF
导出
摘要 图像中的文本区域为判别图像垃圾邮件提供了重要依据。为了获得图像中的文本区域信息,提出了基于Hough变换提取图像中倾斜文本区域的算法和降低图像背景干扰的八邻域细小边缘去除算法,实现了一种不受图像中文本颜色、字体、大小、位置、方向限制的文本区域的自动提取方法。在包含100幅垃圾图像的数据集上进行提取图像文本区域的实验。实验结果显示,新方法具有良好的文本区域提取性能。 Text regions provide an important clue for filtering image spam. To get the information of the text region in image spam, an algorithm based on Hough transform was proposed for slant text region extraction, and a tiny region removal algorithm based on eight-neighbor pixels was also proposed for effectively eliminating the disturbance of background image. The two algorithms were integrated to implement an approach of automatic extraction of the text region. The new approach was insensitive to the orientation, location, color, font, and size of the text. The simulation experiments were carried on among a collection of 100 spam images. Results show a good performance of text region extraction.
出处 《解放军理工大学学报(自然科学版)》 EI 北大核心 2009年第3期258-261,共4页 Journal of PLA University of Science and Technology(Natural Science Edition)
基金 国家863计划资助项目(2006AA01Z411)
关键词 HOUGH变换 文本区域提取 图像垃圾邮件判别 彩色边缘检测 Hough transformation text region extraction image-based spare filtering color edge detection
  • 相关文献

参考文献8

  • 1WIKIPEDIA.Image spam[EB/OL].http://en.wikipedia.org/wiki/Image-spam.[2008-12-01].
  • 2许洋洋,袁华.一种基于内容的广告垃圾图像过滤方法[J].山东大学学报(理学版),2006,41(3):73-78. 被引量:9
  • 3HRISHIKESH B A,GREGORY K M,JAMES A H.Image analysis for efficient categorization of image-based spam e-mail[C].Seoul:Proceedings of the 2005 Eight International Conference on Document Analysis and Recognition (ICDAR'05),2005.
  • 4WU Ching-tung,CHENG Kwang-ting,ZHU Qing,et al.Using visual features for anti-spam filtering[C].Genova:IEEE International Conference on Image Processing(ICIP 2005),2005.
  • 5KEECHUL J,ANIL K J.Hybrid approach to efficient text extraction in complex color images[J].Pattern Recognition Letters,2004,25(1):679-699.
  • 6Céline Mancas-Thillou BEMARD G.Color text extractopm from camera-based images the impact of the choice of the clustering distance[C].Seoul:Proceedings of the 2005 Eight International Conference on Document Analysis and Recognition (ICDAR'05),2005.
  • 7Céline Mancas-Thillou BEMARD G.Spatial and color spaces combination for natural scene text extraction[C].Atlanta:Proceedings of the International Conference on Image Processing (ICIP'06),2006.
  • 8张引,潘云鹤.复杂背景下文本提取的彩色边缘检测算子设计[J].软件学报,2001,12(8):1129-1135. 被引量:20

二级参考文献11

共引文献27

同被引文献7

  • 1王斌,潘文锋.基于内容的垃圾邮件过滤技术综述[J].中文信息学报,2005,19(5):1-10. 被引量:129
  • 2YIH W T,MCCANN R,KOLCZ A.Improving spam filtering by detecting gray mail[C] //Fourth Conference on Email and Anti-Sparn.Mountain View,CA:CEAS,2007.
  • 3CLEARY J G,WRITTEN I H.Data compressing using adaptive coding and partial string matching[J].IEEE Transaction on Communications,1984,32(4):396-402.
  • 4ZEITOUN I K,YEH L.Join indices as a tool for spatial data mining[C] //International Workshop on Temporal,Spatial and Spatio-Temporal Data Mining,Lecture Notes in Artificial Intelligence.Paris:Springer Press,2007:102-114.
  • 5刘洋,杜孝平,罗平,等.垃圾邮件的智能分析、过滤及Rough集讨论[R].武汉:第十二届中国计算机学会网络与数据通信学术会议,2002.
  • 6万明成,耿技,程红蓉,陈佳.图像型垃圾邮件过滤技术综述[J].计算机应用研究,2008,25(9):2579-2582. 被引量:6
  • 7王龙,李晓光,钟绍春.基于K-近邻法及移动agent技术的垃圾邮件检测系统研究[J].计算机应用研究,2009,26(7):2630-2632. 被引量:3

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部