期刊文献+

一种视觉词软直方图的图像表示方法 被引量:2

Visual Word Soft-Histogram for Image Representation
下载PDF
导出
摘要 基于视觉词的统计建模和判别学习,提出一种视觉词软直方图的图像表示方法.假设属于同一视觉词的图像局部特征服从高斯混合分布,利用最大-最小后验伪概率判别学习方法从样本中估计该分布,计算局部特征与视觉词的相似度.累加图像中每个视觉词与对应局部特征的相似度,在全部视觉词集合上进行结果的归一化,得到图像的视觉词软直方图.讨论了两种具体实现方法:一种是基于分类的软直方图方法,该方法根据相似度最大原则建立局部特征与视觉词的对应关系;另一种是完全软直方图方法,该方法将每个局部特征匹配到所有视觉词.在数据库Caltech-4和PASCAL VOC 2006上的实验结果表明,该方法是有效的. This paper proposes a visual word soft-histogram for image representation based on statistical modeling and discriminative learning of visual words. This type of learning uses Gaussian mixture models (GMM) to reflect the appearance variation of each visual word and employs the max-min posterior pseudo-probabilities discriminative learning method to estimate GMMs of visual words. The similarities between each visual word and corresponding local features are computed, summed, and normalized to construct a soft-histogram. This paper also discusses the implementation of two representation methods. The first one is called classification-based soft histogram, in which each local feature is assigned to only one visual word with maximum similarity. The second one is called completely soft histogram, in which each local feature is assigned to all the visual words. The experimental results of Caltech-4 and PASCAL VOC 2006 confirm the effectiveness of this method.
出处 《软件学报》 EI CSCD 北大核心 2012年第7期1787-1795,共9页 Journal of Software
基金 国家自然科学基金(60973059 90920009)
关键词 视觉词 软直方图 图像表示 高斯混合模型 判别学习 visual word soft-histogram image representation Gaussian mixture model discriminative learning
  • 相关文献

参考文献1

二级参考文献16

  • 1Csurka G,Dance C,Fan L,Willamowski J,Bray C.Visual categorization with bags of keypoints//Proceedings of the 2004 ECCV International Workshop on Statistical Learning in Computer Vision.Prague,Czech Republic,2004:59-74
  • 2Sivic J,Russell B,Efros A,Zisserman A,Freeman W.Discovering objects and their localization in images//Proceedings of the 10th International Conference on Computer Vision(ICCV'05).Beijing,China,2005,1:370-377
  • 3Winn J,Criminisi A,Minks T.Object categorization by learned universal visual dictionary//Proceedings of the 10th International Conference on Computer Vision (ICCV' 05).Beijing,China,2005,2:1800-1807
  • 4Li Fei-Fei,Fergus R,Perona P.One-shot learning of object categories.IEEE Transactions on Pattern Analysis and Machine Intelligence,2006,28(4):594-611
  • 5Burl M,Weber M,Perona P.A probabilistic approach to object recognition using local photometry and global geometry//Proceedings of the 5th European Conference on Computer Vision.Freiburg,Germany,1998,2:628-641
  • 6Weber M,Welling M,Perona P.Unsupervised learning of models for recognition//Proceedings of the 6th European Conference on Computer Vision.Dublin,Ireland,2000,1:18-32
  • 7Fergus R,Perona P,Zisserman A.Object class recognition by unsupervised scale-invariant learning//Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'03).Madison,Wisconsin,USA,2003,2:264-271
  • 8Agarwal S,Roth D.Learning a sparse representation for object detection//Proceedings of the 7th European Conference on Computer Vision.Copenhagen,Denmark,2002,4:113-130
  • 9Felzenszwalb P,Huttenlocher D.Pictorial structures for object recognition.International Journal of Computer Vision,2005,61(1):55-79
  • 10Crandall D,Felzenszwalb P,Huttenlocher D.Spatial priors for part-based recognition using statistical models//Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).San Diego,California,USA,2005,1:10-17

共引文献4

同被引文献17

  • 1Sivic J, Zisserman. Video Google: A Text Retrieval Approach to Object Matching in Videos [ C ]//ICCV, Vol. 2, 2003 : 1470 - 1477.
  • 2Philbin J, Chum O, Isard M, et al. Lost in Quantization: Improving Particular Object Retrieval in Large Scale Image Databases [ C ]// CVPR, 2008.
  • 3Nisiar D, Stew6nius H. Scalable Recognition with a Vocabulary Tree [C]//CVPR, Vol. 2, 2006:2161-2168.
  • 4Sehindler G, Brown M, Szeliski R. City-Scale Location Recognition [ C]//CVPR, 2007.
  • 5Liefu Ai, Junqlng Yu, Tao Guan. Spherical Soft Assignment: Impro- ving Image Representation in Content-Based Image Retrieval [ C ]// 13^th Pacific-Rim Conference on Multimedia, 2012:801 -810.
  • 6Herve Jegou, Matthijs Douze, Cordelia Sehmid, et al. Improving Bag- of-Features for Large Scale Image Search [ J]. International Journal of Computer Vision, 2010,87 (3) : 316 - 336.
  • 7Phil.hi.n J, Chum O, Lsard M, et al. Object Retrieval with Large Vo- cabularies and Fast Spatial Matching[ C ]//CVPR, 2007:1 -8.
  • 8Mikolajezyk K, Leibe B, Schiele B. Multiple Object Class Detection with a Generative Model[C]//CVPR, Vol. 1,2006:26-36.
  • 9Lowe D G. Distinctive Image Features from Scale-Invariant Keypoints [J]. IJCV, 2004, 60(2):91-110.
  • 10Gionis A, Indyky P, Motwaniz R. Similarity Search in High Dimen- sions via Hashing[ C]//VLDB, 1999, 518-529.

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部