基于贪婪树的外部支持向量机近似重复图像聚类算法被引量：1

External Support Vector Machine Near-Duplicate Image Clustering Algorithm Based on Greedy Tree

下载PDF

导出

摘要准确地检测出近似重复图像对于冗余去除和版权侵犯检测具有重要的意义。为了改善基于均匀分裂外部支持向量机聚类算法的性能,提出了一种结合贪婪树和外部支持向量机的近似重复图像聚类算法。该方法先利用外部支持向量机将数据集聚为两类,然后采用贪婪树生长算法选择"最优"的类进行分解,重复上述过程直到不可分为止。此外,为了克服图像视觉单词的同义性问题,利用概率潜在语义分析模型将同现的图像视觉单词映射到潜在语义空间中的同一方向上。实验结果表明,与内部支持向量聚类算法和基于均匀分裂的外部支持向量机聚类算法相比,该方法在聚类性能方面有了明显的提高。 Detecting near-duplicate images accurately is very important for redundancy removal and copyright infringement detection.To improve the performance of Uniform Splitting based Support Vector Machine External Clustering（US-SVMEC）,an near-duplicate image clustering algorithm which combines Greedy Tree with SVMEC（GT-SVMEC） is proposed in this paper.Firstly,SVMEC is applied to cluster the dataset into two clusters.Then,greedy tree growing algorithm is used to choose the ＂best＂ cluster to split.Repeat above procedure until no improvement can be achieved.In addition,to overcome the problem of visual word synonymy,Probabilistic Latent Semantic Analysis（PLSA） model is adopted to map the co-occurring image visual words to the same direction in the latent semantic space.Experimental results show that compared with SVM-Internal Clustering（SVMIC） and US-SVMEC,our proposed approach improves the clustering performance obviously.

作者蔺博宇李弼程高毫林胡文博

机构地区解放军信息工程大学信息工程学院 [

出处《信号处理》 CSCD 北大核心 2012年第4期601-606,共6页 Journal of Signal Processing

基金国家自然科学基金资助项目(60872142)

关键词聚类贪婪树支持向量机概率潜在语义分析 Clustering greedy tree support vector machine probabilistic latent semantic analysis

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献13

1Ke .Y, Suthankar R, Huston L. Efficient near-duplicate detection and sub-image retrieval[ C ]//Proc. of the ACM International Conference on Multimedia. New York: ACM Press, 2004 : 869876.
2Xie H T, Gao K, Zhang Y D, et al. Efficient feature detection and effective post-verification for large scale nearduplicate image search[ J]. IEEE Transactions on Multimedia, 2011, 13(6) : 1319-1332.
3Campbell C, Ying Y M. Leafing with support vector machines. Morgan&Claypool Publishers, 2011.
4Winters-Hih S, Yelundur A, McChesney C, et al. Support vector machine implementations for classification & clustering[J]. BMC Bioinformatics,2006,7(suppl2) :S4.
5Ben-Hur A, Horn D, Siegelmann H T, et al. Support vector clustering [ J ]. Journal of Machine Learning Re-search ,2001,2 : 125-137.
6Winters-Hilt S, Merat S. SVM clustering[J]. BMC Bioinformatics ,2007,8 ( suppl 7) : S18.
7Liang Y X, Yang J W, Li Y, et al. Most dispersed and greedy tree growing algorithm for designing LBG initial codebook[C]. Vehicular Technology Conference. Yokohama(Japan) : IEEE Press, 2011 : 1-4.
8Hofmann T. Probabilistic latent semantic analysis [ C ]// Proc. of the 22nd Annual ACM Conference on Research and Development in Information Retrieval. New York: ACM Press, 1999 : 50-57.
9Sivic J, Zisserman A. Video Google: A text retrieval approach to object matching in videos [ C ]//Proceedings of the IEEE International Conference on Computer Vision. Nice : IEEE Press, 2003, 2 : 1470-1477.
10David G L. Distinctive image features from scale-invariant keypoints [ J ]. International Journal of Computer Vision, 2004, 60(2) :91-110.

同被引文献12

1SIVIC J,RUSSELL B,EFROS A A. Discovering objects and their location in images[A].Piscataway,NJ:IEEE Press,2005.370-377.
2HAMPAPUR A,BOLLE R M. Comparison of distance measures for video copy detection[A].Piscataway,NJ:IEEE Press,2001.737-740.
3ZHANG D Q,CHANG S F. Detecting image near-duplicate by stochastic attributed relational graph matching with learning[A].New York:acm Press,2004.877-884.
4QAMRA A,MENG Y,CHANG E Y. Enhanced perceptual distance functions and indexing for image replicate recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,(03):379-391.doi:10.1109/TPAMI.2005.54.
5PHILBIN C J,ISARD M,ZISSERMAN A. Scalable near identical image and shot detection[A].New York:acm Press,2007.
6MARET Y,NIKOLOPOULOS S,DUFAUX F. A novel replica detection system using binary classifiers,R-trees,and PCA[A].Piscataway,NJ:IEEE Press,2006.
7QAMRA L,MENG Y,CHANG E. Enhanced perceptual distance functions and indexing for image replica recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,(03):379-391.doi:10.1109/TPAMI.2005.54.
8PHILBIN C J,ZISSERMAN A. Near duplicate image detection:min-Hash and TF-IDF weighting[EB/OL].http://cmp.felk.cvut.cz/～ chum/papers/chum_bmvc08.pdf,2010.
9TANG X. Book retrieval based on near-duplicate image matching[A].Piscataway,NJ:IEEE Press,2012.2616-2619.
10WANG G,ZHANG Y,LI F F. Using dependent regions for object categorization in a generative framework[A].Washington,DC:IEEE Computer Society,2006.1597-1604.

引证文献1

1王誉天,袁江涛,秦海权,刘鑫.基于Bag-of-words和Hash编码的近似重复图像检测算法[J].计算机应用,2013,33(3):667-669.

1唐坚刚,王泽兴.基于Hash值的重复图像检测算法[J].计算机工程,2009,35(1):183-185. 被引量：3
2郑端建,郭磊,魏世民.MiniGUI图形库在嵌入式Linux平台上的移植与实现[J].仪表技术,2008(10):10-11. 被引量：6
3王金德,李晓燕,寿黎但,陈刚.面向近重复图像匹配的SIFT特征裁减算法[J].计算机辅助设计与图形学学报,2010,22(6):1042-1049. 被引量：5
4王树鹏,陈明,吴广君.面向互联网的大规模重复图像检索技术研究[J].通信学报,2014,35(12):196-202. 被引量：3
5杨子琼.图像匹配算法在图像消冗领域的应用[J].消费电子,2013(16):101-101.
6田甜,张振国.一种基于PLSA和词袋模型的图像分类新方法[J].咸阳师范学院学报,2010,25(4):50-55. 被引量：1
7白燕,楼燚航.存储系统中数据冗余去除技术[J].电子制作,2014,22(12X):72-73.
8王誉天,袁江涛,秦海权,刘鑫.基于Bag-of-words和Hash编码的近似重复图像检测算法[J].计算机应用,2013,33(3):667-669.
9徐义静,张世栋,张群.网络环境下XPath查询集的冗余去除[J].山东大学学报（理学版）,2007,42(11):40-44. 被引量：1
10石晶,李万龙.三种主题分割方法的对比研究[J].计算机工程与应用,2009,45(18):135-138. 被引量：2

信号处理

2012年第4期

浏览历史

内容加载中请稍等...

基于贪婪树的外部支持向量机近似重复图像聚类算法被引量：1

参考文献13

同被引文献12

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于贪婪树的外部支持向量机近似重复图像聚类算法 被引量：1

参考文献13

同被引文献12

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于贪婪树的外部支持向量机近似重复图像聚类算法被引量：1