一种视觉词软直方图的图像表示方法被引量：2

Visual Word Soft-Histogram for Image Representation

下载PDF

导出

摘要基于视觉词的统计建模和判别学习,提出一种视觉词软直方图的图像表示方法.假设属于同一视觉词的图像局部特征服从高斯混合分布,利用最大-最小后验伪概率判别学习方法从样本中估计该分布,计算局部特征与视觉词的相似度.累加图像中每个视觉词与对应局部特征的相似度,在全部视觉词集合上进行结果的归一化,得到图像的视觉词软直方图.讨论了两种具体实现方法:一种是基于分类的软直方图方法,该方法根据相似度最大原则建立局部特征与视觉词的对应关系;另一种是完全软直方图方法,该方法将每个局部特征匹配到所有视觉词.在数据库Caltech-4和PASCAL VOC 2006上的实验结果表明,该方法是有效的. This paper proposes a visual word soft-histogram for image representation based on statistical modeling and discriminative learning of visual words. This type of learning uses Gaussian mixture models （GMM） to reflect the appearance variation of each visual word and employs the max-min posterior pseudo-probabilities discriminative learning method to estimate GMMs of visual words. The similarities between each visual word and corresponding local features are computed, summed, and normalized to construct a soft-histogram. This paper also discusses the implementation of two representation methods. The first one is called classification-based soft histogram, in which each local feature is assigned to only one visual word with maximum similarity. The second one is called completely soft histogram, in which each local feature is assigned to all the visual words. The experimental results of Caltech-4 and PASCAL VOC 2006 confirm the effectiveness of this method.

作者王彦杰刘峡壁贾云得

机构地区北京理工大学计算机学院智能信息技术北京市重点实验室 [

出处《软件学报》 EI CSCD 北大核心 2012年第7期1787-1795,共9页 Journal of Software

基金国家自然科学基金(60973059 90920009)

关键词视觉词软直方图图像表示高斯混合模型判别学习 visual word soft-histogram image representation Gaussian mixture model discriminative learning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1韩东峰,李文辉,郭武.基于潜在局部区域空间关系学习的物体分类算法[J].计算机学报,2007,30(8):1286-1294. 被引量：5

二级参考文献16

1Csurka G,Dance C,Fan L,Willamowski J,Bray C.Visual categorization with bags of keypoints//Proceedings of the 2004 ECCV International Workshop on Statistical Learning in Computer Vision.Prague,Czech Republic,2004:59-74
2Sivic J,Russell B,Efros A,Zisserman A,Freeman W.Discovering objects and their localization in images//Proceedings of the 10th International Conference on Computer Vision(ICCV'05).Beijing,China,2005,1:370-377
3Winn J,Criminisi A,Minks T.Object categorization by learned universal visual dictionary//Proceedings of the 10th International Conference on Computer Vision (ICCV' 05).Beijing,China,2005,2:1800-1807
4Li Fei-Fei,Fergus R,Perona P.One-shot learning of object categories.IEEE Transactions on Pattern Analysis and Machine Intelligence,2006,28(4):594-611
5Burl M,Weber M,Perona P.A probabilistic approach to object recognition using local photometry and global geometry//Proceedings of the 5th European Conference on Computer Vision.Freiburg,Germany,1998,2:628-641
6Weber M,Welling M,Perona P.Unsupervised learning of models for recognition//Proceedings of the 6th European Conference on Computer Vision.Dublin,Ireland,2000,1:18-32
7Fergus R,Perona P,Zisserman A.Object class recognition by unsupervised scale-invariant learning//Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'03).Madison,Wisconsin,USA,2003,2:264-271
8Agarwal S,Roth D.Learning a sparse representation for object detection//Proceedings of the 7th European Conference on Computer Vision.Copenhagen,Denmark,2002,4:113-130
9Felzenszwalb P,Huttenlocher D.Pictorial structures for object recognition.International Journal of Computer Vision,2005,61(1):55-79
10Crandall D,Felzenszwalb P,Huttenlocher D.Spatial priors for part-based recognition using statistical models//Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).San Diego,California,USA,2005,1:10-17

共引文献4

1韩东峰,朱志良,李文辉.图像分类的随机半监督采样方法[J].计算机辅助设计与图形学学报,2009,21(9):1333-1338. 被引量：3
2陈慧中,陈永光,景宁,陈荦,王钧.基于显著区域的月球影像内容特征研究[J].电子学报,2012,40(5):911-919. 被引量：3
3赵仲秋,季海峰,高隽,胡东辉,吴信东.基于稀疏编码多尺度空间潜在语义分析的图像分类[J].计算机学报,2014,37(6):1251-1260. 被引量：25
4李伟生,陈曦.一种结合显著性检测与词袋模型的目标识别方法[J].计算机工程与科学,2017,39(9):1706-1713. 被引量：1

同被引文献17

1Sivic J, Zisserman. Video Google: A Text Retrieval Approach to Object Matching in Videos [ C ]//ICCV, Vol. 2, 2003 : 1470 - 1477.
2Philbin J, Chum O, Isard M, et al. Lost in Quantization: Improving Particular Object Retrieval in Large Scale Image Databases [ C ]// CVPR, 2008.
3Nisiar D, Stew6nius H. Scalable Recognition with a Vocabulary Tree [C]//CVPR, Vol. 2, 2006:2161-2168.
4Sehindler G, Brown M, Szeliski R. City-Scale Location Recognition [ C]//CVPR, 2007.
5Liefu Ai, Junqlng Yu, Tao Guan. Spherical Soft Assignment: Impro- ving Image Representation in Content-Based Image Retrieval [ C ]// 13^th Pacific-Rim Conference on Multimedia, 2012:801 -810.
6Herve Jegou, Matthijs Douze, Cordelia Sehmid, et al. Improving Bag- of-Features for Large Scale Image Search [ J]. International Journal of Computer Vision, 2010,87 (3) : 316 - 336.
7Phil.hi.n J, Chum O, Lsard M, et al. Object Retrieval with Large Vo- cabularies and Fast Spatial Matching[ C ]//CVPR, 2007:1 -8.
8Mikolajezyk K, Leibe B, Schiele B. Multiple Object Class Detection with a Generative Model[C]//CVPR, Vol. 1,2006:26-36.
9Lowe D G. Distinctive Image Features from Scale-Invariant Keypoints [J]. IJCV, 2004, 60(2):91-110.
10Gionis A, Indyky P, Motwaniz R. Similarity Search in High Dimen- sions via Hashing[ C]//VLDB, 1999, 518-529.

引证文献2

1Jian CAO,Dian-hui MAO,Qiang CAI,Hai-sheng LI,Jun-ping DU.A review of object representation based on local features[J].Journal of Zhejiang University-Science C(Computers and Electronics),2013,14(7):495-504. 被引量：4
2赵嵩,焦阳,曹海旺,杨恒.随机维哈希量化视词字典的目标检索方法[J].计算机应用与软件,2015,32(9):149-151. 被引量：1

二级引证文献5

1蔡强,刘亚奇,曹健,毛典辉,牛群.图像目标类别检测综述[J].计算机科学与探索,2015,9(3):257-265. 被引量：13
2祝晓斌,刘亚奇,蔡强,曹健.基于内容的图像检索技术研究[J].计算机仿真,2015,32(5):1-4. 被引量：20
3曹健,魏星,李海生,蔡强.基于局部特征的图像分类方法[J].电子科技大学学报,2017,46(1):69-74. 被引量：8
4韩冰,肖红章,娄亮杰,孔宪刚.基于C/S架构检测实验室样品管理系统的设计与实现[J].实验室科学,2018,21(6):57-60. 被引量：5
5许明英,杜军平,梁美玉,薛哲,李昂.面向科技大数据的科研团队精准立体画像生成方法[J].工程管理科技前沿,2022,41(3):15-22. 被引量：3

1王彦博.文档加图像互转均无忧[J].软件指南,2010(12):21-23.
2沈明峰.帮你的衣服添字加图——Photoshop制作实例[J].计算机应用文摘,2003(11):30-30.
3薛辉,高倩倩,罗欣.基于区域合并的图像分割方法[J].环球人文地理,2014,0(4X):27-27.
4韩纪庆.基于最小分类错误准则的判别学习方法[J].电子工程师,2001,27(2):1-3. 被引量：2
5高妍方,王继伟.贝叶斯网络生成学习和判别学习对比研究[J].山东建筑大学学报,2013,28(4):328-334.
6许宪东,王亚东,运海红.统计学习在图像分类中的应用研究综述[J].黑龙江科技信息,2012(32):108-108. 被引量：1
7韩纪庆,高文,张磊,王承发.基于MCE/GPD的语音识别及其一种Robust应用中初始参数的选择[J].高技术通讯,2000,10(7):41-44. 被引量：3
8赵桂儒,李卫东,刘典婷,吴敏,崔满丰.EM算法的改进及其在行为识别中的应用[J].电视技术,2014,38(13):196-199. 被引量：3
9李亚克,田青,高航.结合类标签关联度的有序核判别回归学习[J].数据采集与处理,2016,31(3):532-540. 被引量：2
10张炜,刘伟,普杰信.一种基于SIFT和区域选择的图像拼接方法[J].微电子学与计算机,2010,27(6):205-207. 被引量：8

软件学报

2012年第7期

浏览历史

内容加载中请稍等...

一种视觉词软直方图的图像表示方法被引量：2

参考文献1

二级参考文献16

共引文献4

同被引文献17

引证文献2

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

一种视觉词软直方图的图像表示方法 被引量：2

参考文献1

二级参考文献16

共引文献4

同被引文献17

引证文献2

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

一种视觉词软直方图的图像表示方法被引量：2