期刊文献+

基于协作式标注图像数据的垃圾标签检测方法 被引量:1

Spam Tag Detection Method Based on Collaborative Annotation Image Data
下载PDF
导出
摘要 由于用户标签的不准确和语义模糊使得协作式标注图像检索正确率低,而现有垃圾标签过滤方法往往关注标签本身,忽略了协作式标签与图像的关联性。本文在分析协作式标注图像视觉内容与标签的关联性的基础上,提出一种基于协作式标注图像视觉内容的垃圾标签检测方法。该方法分析同一标签下图像视觉内容,设计不同的核函数用于颜色和SIFT(Scale-invariant feature transform)特征子集,同时将2种低维特征映射到高维多模特征空间形成混合核函数,对同一标签下的图像进行基于混合核的最大最小距离聚类,少数群体的标签说明与图像内容关联性小则为用户标注错误的标签,从而检测垃圾标签。实验结果表明,该方法能够提高协作式图像垃圾标签检测的正确性。 The accuracy of the collaborative tagging image retrieval is lower because of the inaccuracy of user’ s annotation. Exist-ing spam tag detection methods tend to focus on label itself, ignoring the correlation between collaborative label and image. Ana-lyzing the correlation of collaborative tagging image visual content and image tags, the spam tag detection method of collaboration annotation based on visual content of collaborative tagging image is proposed. The method analyze visual content of images which have the same tag and design different kernel functions for color and SIFT feature subset. The two features will be mapped form low dimensional space to high dimensional character space, while the mixed-kernel function is established. Finally, the images which have the same tag is clustered by max-min distance means, and the tag of images in the class which has a few images are spam tags because of weak correlation. The experimental results show that the method can improve the accuracy of the tag spam detection on collaborative annotation images.
出处 《计算机与现代化》 2015年第6期41-45,共5页 Computer and Modernization
基金 河南省高等学校重点科研项目(15A520025)
关键词 高斯核 混合核 最大最小聚类 协作式标注 垃圾标签 SIFT Gaussian kernel mixed-kernel SIFT max-min cluster collaborative annotation spam tag
  • 相关文献

参考文献16

  • 1Saini J S. A study of spam detection algorithm on social media networks[J]. Computational Intelligence, Cyber Security and Computational Models, 2014,246:195-202.
  • 2Zhou Ning, Shen Yi, Fan Jianping. Automatic image annotation by using relevant keywords extracted from auxiliary text documents[C]// Proceedings of the International Workshop on Verylargescale Multimedia Corpus, Mining and Retrieval. New York:ACM. 2010:7-12.
  • 3易明,王学东,邓卫华.基于社会网络分析的社会化标签网络分析与个性化信息服务研究[J].中国图书馆学报,2010,36(2):107-114. 被引量:34
  • 4Wang De, Irani D, Pu C. SPADE: A socialspam analytics and detection framework[J]. Social Network Analysis and Mining, 2014,4(1):1-18.
  • 5Yue Jun, Li Zhenbo, Liu Lu, et al. Contentbased image retrieval using color and texture fused features[J]. Mathematical and Computer Modelling, 2011,54(3-4):1121-1127.
  • 6刘广海,吴璟莉.基于颜色体积直方图的图像检索[J].计算机科学,2012,39(1):273-275. 被引量:14
  • 7刘峤,秦志光,程红蓉,钱伟中.基于颜色和边缘特征直方图的图像型垃圾邮件分类模型[J].计算机应用研究,2010,27(7):2608-2610. 被引量:5
  • 8Burger W, Burge M J. SIFTScaleInvariant local features[J]. Principles of Digital Image Processing, 2013:229-296.
  • 9Fan Jianping, Shen Yi, Zhou Ning, et al. Harvesting largescale weaklytagged image databases from the Web[C]// Proceedings of 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2010:802-809.
  • 10Fan Jianping, Daniel A K, Gao Yuli, et al. JustClick: Personalized image recommendation via exploratory search from largescale Flickr image collections[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2009,19(2):273-28.

二级参考文献51

  • 1孔锐,张国宣,施泽生,郭立.基于核的K-均值聚类[J].计算机工程,2004,30(11):12-13. 被引量:46
  • 2余力,刘鲁.电子商务个性化推荐研究[J].计算机集成制造系统,2004,10(10):1306-1313. 被引量:104
  • 3VAPNIK V. The nature of statistical learning theory[ M]. New York: Springer-Verlag, 1995.
  • 4邓乃阳,田英杰.数据挖掘中的新方法:支持向量机[M].北京:科学出版社,2004.
  • 5LIN H T, LIN C J. A study on sigmoid kernels for SVM and the training of non-PSD kernels by SMO-type methods[ EB/OL]. [2009 -8 -17]. http:// www. csie. ntu. edu. tw /- cjlin/papers /tanh. pdf.
  • 6BACH F R, LANCKRIET G R G, JORDAN M I. Multiple kernel learning, conic duality, and the SMO algorithm[ C]// Proceedings of the 21st International Conference on Machine-Learning. Banff. New York: ACM, 2004, 69:41 -48.
  • 7KEERTHI S S, SHEVADE S K, BHATTACHARYYA C, et al. Improvements to Platt's SMO algorithm for SVM classifier design [ J]. Neural Computation, 2001, 13(3) : 637 - 649.
  • 8SMITS G F, JORDAAN E M. Improved SVM regression using mixtures of kernels [ C]// Proceedings of the 2002 International Joint Conference on Neural Networks. Washington, DC: IEEE, 2002, 3: 2785 - 2790.
  • 9ZHOU S S, LIU H W, YE F. Variant of gaussian kernel and parameter setting method for nonlinear SVM[ J]. Neurocomputing, 2009, 72( 13 - 15) : 2931 - 2937.
  • 10STEINWART I. On the optimal parameter choice for support vector machines[ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2003, 25(10) : 1274 - 1284.

共引文献64

同被引文献11

引证文献1

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部