期刊文献+

基于改进视觉词袋模型的图像标注方法 被引量:5

Image Annotation Method Based on Improved BoVW Model
下载PDF
导出
摘要 针对传统视觉词袋模型对图像尺度变化较为敏感的缺点,提出一种基于改进视觉词袋模型的图像标注方法。该方法引入图像的多尺度空间信息,对图像进行多尺度变换并构建多尺度视觉词汇表,将图像表示为不同尺度特征,结合多核学习的方法优化各尺度特征的相应权重,获取特征表示。实验结果验证了该方法的有效性,其标注准确率比传统BoVW模型提高17.8%-25.7%。 Aiming at overcoming the traditional Bag of Visual Word(BoVW) model's sensitivity to image scale's variation, this paper proposes an image annotation method based on improved BoVW model. It incorporates with multiple spaces information and transfers original images into multiple scale spaces and constructs multiple scale vocabularies. Images are represented as a family of feature histograms with different scale. Multiple kernel learning is introduced to optimize the histograms weights of different scale in order to acquire discriminative classifying power. Experimental results prove the validity of the method, it outperforms BoVW on image annotation precision ranged from 17.8% to 25.7%.
作者 霍华 赵刚
出处 《计算机工程》 CAS CSCD 2012年第22期276-278,282,共4页 Computer Engineering
基金 国家自然科学基金资助项目(60743008) 河南省国际科技合作计划基金资助项目(104300510063)
关键词 图像标注 视觉词袋模型 多尺度空间 多尺度视觉词 多核学习 权重优化 image annotation Bag of Visual Word(BoVW) model multiple scale space multiple scale visual word multiple kernel learning weight optimization
  • 相关文献

参考文献9

  • 1Sivic J. Video Google: A Text Retrieval Approach to Object Matching in Videos[C]//Proc. of the International Conf. on Computer Vision. Nice, France: IEEE Press, 2003.
  • 2程蕾,吴秀清.局部特征几何结构用于目标识别[J].计算机工程与应用,2010,46(26):191-193. 被引量:3
  • 3Lopez-Sastre R J, Tuytelaars T, Acevedo-Rodriguez F J, et al. Towards a More Discriminative and Semantic Visual Voca-bulary[J]. Computer Vision and Image Understanding, 2010, 115(3): 415-425.
  • 4Elsayad I, Martinet J, Urruty T, et al. A New Spatial Weighting Scheme for Bag-of-visual-words[C]//Proc. of IEEE International Workshop on Content-Based Multimedia Indexing. Grenoble, France: IEEE Press, 2010.
  • 5Ding Guiguang, Wang Jianmin, Qin Kai. A Visual Word Weighting Scheme Based on Emerging Itemset for Video Annotatio[J]. Information Processing Letters, 2010, 110(16): 692-696.
  • 6Sonnenburg S. Large Scale Multiple Kernel Learning[J]. Journal of Machine Learning Research, 2006, 7(1): 1531-1565.
  • 7Chang Chih-Chung, Lin Chih-Jen. LIBSVM: A Library for Support Vector Machines[EB/OL]. (2011-11-05). http://www.csie. ntu.edu.tw/cjlin/.
  • 8van Gemert J C, Veenman C J, Smeulders A W M. Visual Word Ambiguity[J]. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2009, 32(7): 1271-1283.
  • 9Yang Jun, Jiang Yugang, Hauptmann A G, et al. Evaluating Bag-of-Visual-Words Representations in Scene Classification[C]// Proc. of ACM SIGMM International Workshop on Multimedia Information Retrieval. New York, USA: ACM Press, 2007.

二级参考文献8

  • 1Lowe D.Distinctive image features from scale-invariant keypoints[J].International Journal of Computer Vision, 2004,60 (2) : 91-110.
  • 2Bay H,Tuytelaars T,van Gool L.SURF:Speeded up robust features[C]//European Conference on Computer Vision, Graz, Austria, 2006 : 404-417.
  • 3Csurka G, Dance C, Lixin F, et al.Visual categorization with bags of keypoints[C]//European Conference on Computer Vision,Prague,Czech Republic,2004: 59-74.
  • 4Nister D, Stewenius H.Scalable recognition with a vocabulary tree[C]//IEEE Proceedings of the International Conference on Computer Vision and Pattern Recognition, 2006: 2161-2168.
  • 5Fergus R, Perona P, Zisserman A.Weakly supervised scale-invariant learning of models for visual recognition[J].Intemational Journal of Computer Vision, 2007,71 (3) : 273-303.
  • 6Fergus R,Perona P,Zisserman A.A sparse object category model for efficient learning and exhaustive reeognition[C]//IEEE Proceedings of the International Conference on Computer Vision and Pattern Recognition,2005:380-387.
  • 7Fergus R,Perona P,Zisserman A.Object class recognition by unsupervised scale-invariant leaming[C]//IEEE Proceedings of the International Conference on Computer Vision and Pattern Recognition, 2003 : 264-271.
  • 8Mikolajcayk K,Schmid C.A performance evaluation of local descriptors[J].IEEE Trans on Pattern Analysis and Machine Intelligence,2005,27(10) : 1615-1630.

共引文献2

同被引文献28

  • 1杨桄,张柏,王宗明,刘岩鹤.基于阴影搜索法的飞机目标遥感图像分割研究[J].地理与地理信息科学,2006,22(1):48-50. 被引量:5
  • 2徐大琦,倪国强,许廷发.中高分辨力遥感图像中飞机目标自动识别算法研究[J].光学技术,2006,32(6):855-858. 被引量:9
  • 3蔡红苹,耿振伟,粟毅.遥感图像飞机检测新方法——圆周频率滤波法[J].信号处理,2007,23(4):539-543. 被引量:9
  • 4L6pez-Sastre R J, Tuytelaars T, Aeevedo-Rodriguez F J, et al. Towards a more discriminative and se- mantic visual vocabulary[J]. Computer Vision and Image Understanding, 2011, 115(3): 415-425.
  • 5Elsayad I, Martinet J, Urruty T, et al. A new spa- tial weighting scheme for bag-of-visual-words[C]// 2010 International Workshop on Content-Based Mul-timedia Indexing (CBMI). [S. 1.]:IEEE, 2010.. 1-6.
  • 6Lowe D G. Distinctive image features from scale-in- variant key points[J]. International Journal of Com- puter Vision, 2004, 60(2):91-110.
  • 7MacQueen J. Some methods for classification and a- nalysis of multivariate observations[C]//Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability. [S. 1] ; University of Calif. Press, 1967,(1) : 281-297.
  • 8Dash M, Liu H. Feature selection for classification [J]. Intelligent Data Analysis, 1997, 1(3).. 131- 156.
  • 9Jurie F, Triggs B. Creating efficient codebooks for visual recognition [C]//Tenth IEEE International Conference on Computer Vision, ICCV 2005. [S. I. ]..IEEE, 2005, 1: 604-610.
  • 10Wang L. Toward a discriminative codebook: code- word selection across multi-resolution [ C]//IEEE Conference on Computer Vision and Pattern Recogni- tion, CVPR'07. [S. 1.]:IEEE, 2007:1-8.

引证文献5

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部