基于改进视觉词袋模型的图像标注方法被引量：5

Image Annotation Method Based on Improved BoVW Model

下载PDF

导出

摘要针对传统视觉词袋模型对图像尺度变化较为敏感的缺点，提出一种基于改进视觉词袋模型的图像标注方法。该方法引入图像的多尺度空间信息，对图像进行多尺度变换并构建多尺度视觉词汇表，将图像表示为不同尺度特征，结合多核学习的方法优化各尺度特征的相应权重，获取特征表示。实验结果验证了该方法的有效性，其标注准确率比传统BoVW模型提高17．8％-25．7％。 Aiming at overcoming the traditional Bag of Visual Word（BoVW） model＇s sensitivity to image scale＇s variation, this paper proposes an image annotation method based on improved BoVW model. It incorporates with multiple spaces information and transfers original images into multiple scale spaces and constructs multiple scale vocabularies. Images are represented as a family of feature histograms with different scale. Multiple kernel learning is introduced to optimize the histograms weights of different scale in order to acquire discriminative classifying power. Experimental results prove the validity of the method, it outperforms BoVW on image annotation precision ranged from 17.8% to 25.7%.

作者霍华赵刚

机构地区河南科技大学电子信息工程学院

出处《计算机工程》 CAS CSCD 2012年第22期276-278,282,共4页 Computer Engineering

基金国家自然科学基金资助项目(60743008) 河南省国际科技合作计划基金资助项目(104300510063)

关键词图像标注视觉词袋模型多尺度空间多尺度视觉词多核学习权重优化 image annotation Bag of Visual Word（BoVW） model multiple scale space multiple scale visual word multiple kernel learning weight optimization

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1Sivic J. Video Google: A Text Retrieval Approach to Object Matching in Videos[C]//Proc. of the International Conf. on Computer Vision. Nice, France: IEEE Press, 2003.
2程蕾,吴秀清.局部特征几何结构用于目标识别[J].计算机工程与应用,2010,46(26):191-193. 被引量：3
3Lopez-Sastre R J, Tuytelaars T, Acevedo-Rodriguez F J, et al. Towards a More Discriminative and Semantic Visual Voca-bulary[J]. Computer Vision and Image Understanding, 2010, 115(3): 415-425.
4Elsayad I, Martinet J, Urruty T, et al. A New Spatial Weighting Scheme for Bag-of-visual-words[C]//Proc. of IEEE International Workshop on Content-Based Multimedia Indexing. Grenoble, France: IEEE Press, 2010.
5Ding Guiguang, Wang Jianmin, Qin Kai. A Visual Word Weighting Scheme Based on Emerging Itemset for Video Annotatio[J]. Information Processing Letters, 2010, 110(16): 692-696.
6Sonnenburg S. Large Scale Multiple Kernel Learning[J]. Journal of Machine Learning Research, 2006, 7(1): 1531-1565.
7Chang Chih-Chung, Lin Chih-Jen. LIBSVM: A Library for Support Vector Machines[EB/OL]. (2011-11-05). http://www.csie. ntu.edu.tw/cjlin/.
8van Gemert J C, Veenman C J, Smeulders A W M. Visual Word Ambiguity[J]. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2009, 32(7): 1271-1283.
9Yang Jun, Jiang Yugang, Hauptmann A G, et al. Evaluating Bag-of-Visual-Words Representations in Scene Classification[C]// Proc. of ACM SIGMM International Workshop on Multimedia Information Retrieval. New York, USA: ACM Press, 2007.

二级参考文献8

1Lowe D.Distinctive image features from scale-invariant keypoints[J].International Journal of Computer Vision, 2004,60 (2) : 91-110.
2Bay H,Tuytelaars T,van Gool L.SURF:Speeded up robust features[C]//European Conference on Computer Vision, Graz, Austria, 2006 : 404-417.
3Csurka G, Dance C, Lixin F, et al.Visual categorization with bags of keypoints[C]//European Conference on Computer Vision,Prague,Czech Republic,2004: 59-74.
4Nister D, Stewenius H.Scalable recognition with a vocabulary tree[C]//IEEE Proceedings of the International Conference on Computer Vision and Pattern Recognition, 2006: 2161-2168.
5Fergus R, Perona P, Zisserman A.Weakly supervised scale-invariant learning of models for visual recognition[J].Intemational Journal of Computer Vision, 2007,71 (3) : 273-303.
6Fergus R,Perona P,Zisserman A.A sparse object category model for efficient learning and exhaustive reeognition[C]//IEEE Proceedings of the International Conference on Computer Vision and Pattern Recognition,2005:380-387.
7Fergus R,Perona P,Zisserman A.Object class recognition by unsupervised scale-invariant leaming[C]//IEEE Proceedings of the International Conference on Computer Vision and Pattern Recognition, 2003 : 264-271.
8Mikolajcayk K,Schmid C.A performance evaluation of local descriptors[J].IEEE Trans on Pattern Analysis and Machine Intelligence,2005,27(10) : 1615-1630.

共引文献2

1霍华,赵刚.基于视觉词模糊权重的视频语义标注[J].计算机工程,2012,38(13):131-133.
2顾漪,陈海燕.基于多图像特征的遥感图像配准新方法[J].信息通信,2019,0(8):36-39.

同被引文献28

1杨桄,张柏,王宗明,刘岩鹤.基于阴影搜索法的飞机目标遥感图像分割研究[J].地理与地理信息科学,2006,22(1):48-50. 被引量：5
2徐大琦,倪国强,许廷发.中高分辨力遥感图像中飞机目标自动识别算法研究[J].光学技术,2006,32(6):855-858. 被引量：9
3蔡红苹,耿振伟,粟毅.遥感图像飞机检测新方法——圆周频率滤波法[J].信号处理,2007,23(4):539-543. 被引量：9
4L6pez-Sastre R J, Tuytelaars T, Aeevedo-Rodriguez F J, et al. Towards a more discriminative and se- mantic visual vocabulary[J]. Computer Vision and Image Understanding, 2011, 115(3): 415-425.
5Elsayad I, Martinet J, Urruty T, et al. A new spa- tial weighting scheme for bag-of-visual-words[C]// 2010 International Workshop on Content-Based Mul-timedia Indexing (CBMI). [S. 1.]:IEEE, 2010.. 1-6.
6Lowe D G. Distinctive image features from scale-in- variant key points[J]. International Journal of Com- puter Vision, 2004, 60(2):91-110.
7MacQueen J. Some methods for classification and a- nalysis of multivariate observations[C]//Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability. [S. 1] ; University of Calif. Press, 1967,(1) : 281-297.
8Dash M, Liu H. Feature selection for classification [J]. Intelligent Data Analysis, 1997, 1(3).. 131- 156.
9Jurie F, Triggs B. Creating efficient codebooks for visual recognition [C]//Tenth IEEE International Conference on Computer Vision, ICCV 2005. [S. I. ]..IEEE, 2005, 1: 604-610.
10Wang L. Toward a discriminative codebook: code- word selection across multi-resolution [ C]//IEEE Conference on Computer Vision and Pattern Recogni- tion, CVPR'07. [S. 1.]:IEEE, 2007:1-8.

引证文献5

1张彩霞.图像理解技术现状[J].文存阅刊,2018,0(22):14-14.
2李士进,仇建斌,於慧.基于视觉单词选择的高分辨率遥感图像飞机目标检测[J].数据采集与处理,2014,29(1):60-65. 被引量：5
3张祯伟,石朝侠.改进视觉词袋模型的快速图像检索方法[J].计算机系统应用,2016,25(12):126-131. 被引量：3
4盛昀瑶,张福泉,任艳.应用MapReduce与视觉描述符的图像检索算法[J].重庆理工大学学报（自然科学）,2018,32(12):149-156. 被引量：1
5王亮,孙海燕.云计算中基于改进视觉描述符的图像检索[J].控制工程,2020,27(3):554-560. 被引量：2

二级引证文献11

1汪荆琪,徐林莉.一种基于多视图数据的半监督特征选择和聚类算法[J].数据采集与处理,2015,30(1):106-116. 被引量：8
2沈忱,祁昆仑,刘文轩,吴华意.基于FSFDP-BoV模型的遥感影像检索[J].地理与地理信息科学,2016,32(1):55-59. 被引量：2
3周治平,李文慧,周明珠.基于词包和特征融合的目标识别算法[J].数据采集与处理,2017,32(3):489-496. 被引量：2
4王艳,周小平,王睿,孙冰雪.长白山野生中草药植物图像检索方法研究[J].中国中医药信息杂志,2018,25(2):95-98. 被引量：3
5冯珂垚,饶鹏,陆福星,朱含露.基于神经网络的高分辨率快速目标检测方法[J].电子设计工程,2018,26(22):169-173. 被引量：7
6姜晗,贺付亮,王世元.基于生长四叉树结构的二维建图算法[J].西南大学学报（自然科学版）,2020,42(6):128-139. 被引量：7
7张金凤,石朝侠,王燕清.动态场景下基于视觉特征的SLAM方法[J].计算机工程,2020,46(10):95-102. 被引量：10
8高兴.云计算环境下的海量图像查重算法设计[J].绥化学院学报,2021,41(9):153-156. 被引量：2
9李韬睿,徐超,胡龙舟,朱彤,白海.基于云计算技术的海量信息分布式存储研究[J].微型电脑应用,2022,38(10):90-93. 被引量：6
10唐玮,赵保军,龙腾.基于轻量化网络的光学遥感图像飞机目标检测[J].信号处理,2019,35(5):768-774. 被引量：16

1陈一君,沈晓明.基于统计参数优化的质心定位算法探讨[J].电脑编程技巧与维护,2013(4):55-57.
2胡振涛,刘宇,杨树军.多传感器量测下权重优化粒子滤波算法[J].计算机科学,2013,40(12):152-155.
3刘帅,曹若文.利用SURF和PLSA的遥感图像场景分类[J].信息技术,2013,37(3):39-42.
4周亮亮.基于概念格的视觉单词约简方法[J].电脑开发与应用,2012,25(9):15-17. 被引量：1
5刘晓东,孙军,周军.优化Gabor小波权重的EBGM算法[J].信息技术,2009,33(1):59-62. 被引量：2
6李政泽,韩毅,周斌,贾焰.微博用户分类的特征词权重优化及推荐策略[J].信息网络安全,2012(8):136-139. 被引量：1
7成磊,朱龙英,郑帅,陆宝发,赫建立.并联机器人的粒子群优化神经网络自适应控制算法研究[J].制造业自动化,2014,36(12):5-7. 被引量：3
8何静,李桂梅.一种基于权重优化的σ策略的微粒群优化算法[J].山西电子技术,2012(4):22-23.
9李铖,龙华,李克,苏坡.基于权重优化的WMN信道分配算法研究[J].移动通信,2014,38(8):72-76.
10王昌达,石廷娟.基于概念图和权重优化的智能学习模型[J].计算机工程,2013,39(8):270-273.

计算机工程

2012年第22期

浏览历史

内容加载中请稍等...

基于改进视觉词袋模型的图像标注方法被引量：5

参考文献9

二级参考文献8

共引文献2

同被引文献28

引证文献5

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

基于改进视觉词袋模型的图像标注方法 被引量：5

参考文献9

二级参考文献8

共引文献2

同被引文献28

引证文献5

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

基于改进视觉词袋模型的图像标注方法被引量：5