期刊文献+

采用上下文金字塔特征的场景分类 被引量:14

Scene Classification with Context Pyramid Features
下载PDF
导出
摘要 为了能有效地表述场景图像的语义特性,提出一种基于图像块上下文信息的场景图像分类框架.首先用规则网格将图像分块,并提取每个块的SIFT特征;然后用K均值算法对训练图像的块特征聚类,形成块类型的码本;再根据此码本对图像块进行量化,得到图像的视觉词汇表示,形成视觉词汇图,并在其上建立2类视觉词汇模型:相邻共现的不同视觉词汇对模型和连续共现的相同视觉词汇群模型;最后应用空间金字塔匹配建立视觉词汇的上下文金字塔特征,并采用SVM分类器进行分类.实验结果证明,在常用的场景图像库上,文中方法比已有的典型方法具有更好的场景分类性能. To describe the semantic characteristic of scene images efficiently, this paper proposes a scene image classification framework based on image patch context information. First, the patches of images are got by a regular grid, and their SIFT (scale invariant feature transform) features are extracted. Then the SIFT features of training images are clustered with the K-means algorithm to form a codebook of the patches. We quantize the patches of images according to this codebook and get the visual word representation of the image, which forms a visual word map. In the map, two kinds of visual word models are set up: one is visual word pair with different words and the other is visual word group that consists of the same and consecutive words. Finally by applying spatial pyramid matching, the context pyramid features of visual words are obtained and classified with SVM. Experiments in frequently used scene image databases show that our method has got better performance than the existing typical methods in classifying scene images.
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2010年第8期1366-1373,共8页 Journal of Computer-Aided Design & Computer Graphics
基金 国家自然科学基金(40971245)
关键词 场景分类 上下文信息 空间金字塔匹配 图像块 scene classification context information spatial pyramid matching image patch
  • 相关文献

参考文献20

  • 1Vailaya A, Figueiredo M, Jain A, et al. Content-based hierarchical classification of vacation images [C]//Proceedings of IEEE International Conference on Multimedia Computing and Systems, Florence, 1999, 1:9518-9523.
  • 2Szummer M, Picard R W. Indoor-outdoor image classification [C] //Proceedings of IEEE International Workshop on Content-Based Access of Image and Video Databases, Bombay, 1998:42-52.
  • 3Oliva A, Torralba A. Modeling the shape of the scene: a holistic representation of the spatial envelope [J]. International Journal of Computer Vision, 2001, 42(3) : 145- 175.
  • 4Carson C, Thomas M, Belongie S, et al. Blobworld: a system for region-based image indexing and retrieval [C] // Proceedings of International Conference on Visual Information Systems, Amsterdam, 1999:660-672.
  • 5Sivic J, Zisserman A. Video Google: a text retrieval approach to object matching in videos [C]//Proceedings of the 9th IEEE International Conference on Computer Vision, Nice, 2003, 2: 1470-1478.
  • 6Lazebnik S, Schmid C, Ponce J. Beyond bags of features: spatial pyramid matching for recognizing natural categories [C]//Proceedings of IEEE Computer Sociely Conference on Computer Vision and Pattern Recognition, New York, 2006:2169-2178.
  • 7Quelhas P, Monay F, Odobez J M, et al. A thousand words in a scene [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007, 29(9): 1575-1589.
  • 8Bosch A, Zisserman A, Mufioz X. Scene classification using a hybrid generative/discriminative approach [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(4): 712-727.
  • 9Hofmann T. Unsupervised learning by probabilistic latent semantic analysis [J]. Machine Learning, 2001, 42 (1/2) : 177-196.
  • 10BleiDM, Ng A Y, Jordan M I. Latent Dirichlet allocation [J]. Journal of Machine Learning Research, 2003, 3:993- 1022.

二级参考文献90

  • 1吴洪,卢汉清,马颂德.基于内容图像检索中相关反馈技术的回顾[J].计算机学报,2005,28(12):1969-1979. 被引量:52
  • 2王润生.图像理解[M].长沙:国防科技大学出版社,1994..
  • 3施智平,李清勇,史俊,史忠植.集成视觉特征和语义信息的相关反馈方法[J].计算机辅助设计与图形学学报,2007,19(9):1138-1142. 被引量:4
  • 4Ulaby F, Kouyate F, Brisco B,Williams L.Textural information in SAR images [J]. IEEE Trans Geoscience and Remote Sensing, 1986, GE-24:235 - 245.
  • 5Canny J. A computational approach to edge detection [ J ]. IEEE Trans on Pattern Analysis Machine Intelligence, 1986,8( 11 ) :679 - 698.
  • 6Dainty J. Laser Speckle and Related Phenomena ( Vol. 9 ) [ M ]. New York : Springer-Verlag Berlin Heidelberg, 1975.
  • 7Touzi R, Lopes A, Bousquet P. A statistical and geometrical edge detector for SAR images [J]. IEEE Trans on Geoscience and Remote Sensing, 1988,26(6) :764 - 773.
  • 8Skingley J, Rye A. The Hough transform applied to SAR images for thin line detection [J]. Pattern Recognit Lett, 1987,6(3) :61 - 67.
  • 9Burns B. Extracting straight lines [J] .IEEE Trans Part Anal Machine Intell, 1986, PAMI-8(4) :425 -455.
  • 10Merlet N, Zerubia J. New prospects in line detection by dynamic programming [J]. IEEE Trans Pattern Anal Machine Intell, 1996, 18(4) :426 - 431.

共引文献66

同被引文献173

引证文献14

二级引证文献65

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部