期刊文献+

基于贪婪树的外部支持向量机近似重复图像聚类算法 被引量:1

External Support Vector Machine Near-Duplicate Image Clustering Algorithm Based on Greedy Tree
下载PDF
导出
摘要 准确地检测出近似重复图像对于冗余去除和版权侵犯检测具有重要的意义。为了改善基于均匀分裂外部支持向量机聚类算法的性能,提出了一种结合贪婪树和外部支持向量机的近似重复图像聚类算法。该方法先利用外部支持向量机将数据集聚为两类,然后采用贪婪树生长算法选择"最优"的类进行分解,重复上述过程直到不可分为止。此外,为了克服图像视觉单词的同义性问题,利用概率潜在语义分析模型将同现的图像视觉单词映射到潜在语义空间中的同一方向上。实验结果表明,与内部支持向量聚类算法和基于均匀分裂的外部支持向量机聚类算法相比,该方法在聚类性能方面有了明显的提高。 Detecting near-duplicate images accurately is very important for redundancy removal and copyright infringement detection.To improve the performance of Uniform Splitting based Support Vector Machine External Clustering(US-SVMEC),an near-duplicate image clustering algorithm which combines Greedy Tree with SVMEC(GT-SVMEC) is proposed in this paper.Firstly,SVMEC is applied to cluster the dataset into two clusters.Then,greedy tree growing algorithm is used to choose the "best" cluster to split.Repeat above procedure until no improvement can be achieved.In addition,to overcome the problem of visual word synonymy,Probabilistic Latent Semantic Analysis(PLSA) model is adopted to map the co-occurring image visual words to the same direction in the latent semantic space.Experimental results show that compared with SVM-Internal Clustering(SVMIC) and US-SVMEC,our proposed approach improves the clustering performance obviously.
出处 《信号处理》 CSCD 北大核心 2012年第4期601-606,共6页 Journal of Signal Processing
基金 国家自然科学基金资助项目(60872142)
关键词 聚类 贪婪树 支持向量机 概率潜在语义分析 Clustering greedy tree support vector machine probabilistic latent semantic analysis
  • 相关文献

参考文献13

  • 1Ke .Y, Suthankar R, Huston L. Efficient near-duplicate detection and sub-image retrieval[ C ]//Proc. of the ACM International Conference on Multimedia. New York: ACM Press, 2004 : 869876.
  • 2Xie H T, Gao K, Zhang Y D, et al. Efficient feature detection and effective post-verification for large scale nearduplicate image search[ J]. IEEE Transactions on Multimedia, 2011, 13(6) : 1319-1332.
  • 3Campbell C, Ying Y M. Leafing with support vector machines. Morgan&Claypool Publishers, 2011.
  • 4Winters-Hih S, Yelundur A, McChesney C, et al. Support vector machine implementations for classification & clustering[J]. BMC Bioinformatics,2006,7(suppl2) :S4.
  • 5Ben-Hur A, Horn D, Siegelmann H T, et al. Support vector clustering [ J ]. Journal of Machine Learning Re-search ,2001,2 : 125-137.
  • 6Winters-Hilt S, Merat S. SVM clustering[J]. BMC Bioinformatics ,2007,8 ( suppl 7) : S18.
  • 7Liang Y X, Yang J W, Li Y, et al. Most dispersed and greedy tree growing algorithm for designing LBG initial codebook[C]. Vehicular Technology Conference. Yokohama(Japan) : IEEE Press, 2011 : 1-4.
  • 8Hofmann T. Probabilistic latent semantic analysis [ C ]// Proc. of the 22nd Annual ACM Conference on Research and Development in Information Retrieval. New York: ACM Press, 1999 : 50-57.
  • 9Sivic J, Zisserman A. Video Google: A text retrieval approach to object matching in videos [ C ]//Proceedings of the IEEE International Conference on Computer Vision. Nice : IEEE Press, 2003, 2 : 1470-1477.
  • 10David G L. Distinctive image features from scale-invariant keypoints [ J ]. International Journal of Computer Vision, 2004, 60(2) :91-110.

同被引文献12

  • 1SIVIC J,RUSSELL B,EFROS A A. Discovering objects and their location in images[A].Piscataway,NJ:IEEE Press,2005.370-377.
  • 2HAMPAPUR A,BOLLE R M. Comparison of distance measures for video copy detection[A].Piscataway,NJ:IEEE Press,2001.737-740.
  • 3ZHANG D Q,CHANG S F. Detecting image near-duplicate by stochastic attributed relational graph matching with learning[A].New York:acm Press,2004.877-884.
  • 4QAMRA A,MENG Y,CHANG E Y. Enhanced perceptual distance functions and indexing for image replicate recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,(03):379-391.doi:10.1109/TPAMI.2005.54.
  • 5PHILBIN C J,ISARD M,ZISSERMAN A. Scalable near identical image and shot detection[A].New York:acm Press,2007.
  • 6MARET Y,NIKOLOPOULOS S,DUFAUX F. A novel replica detection system using binary classifiers,R-trees,and PCA[A].Piscataway,NJ:IEEE Press,2006.
  • 7QAMRA L,MENG Y,CHANG E. Enhanced perceptual distance functions and indexing for image replica recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,(03):379-391.doi:10.1109/TPAMI.2005.54.
  • 8PHILBIN C J,ZISSERMAN A. Near duplicate image detection:min-Hash and TF-IDF weighting[EB/OL].http://cmp.felk.cvut.cz/~ chum/papers/chum_bmvc08.pdf,2010.
  • 9TANG X. Book retrieval based on near-duplicate image matching[A].Piscataway,NJ:IEEE Press,2012.2616-2619.
  • 10WANG G,ZHANG Y,LI F F. Using dependent regions for object categorization in a generative framework[A].Washington,DC:IEEE Computer Society,2006.1597-1604.

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部