基于视觉单词共生矩阵的图像分类方法被引量：2

Image Categorization Approach Based on Visual Words Co-Occurrence Matrix

下载PDF

导出

摘要针对传统的视觉词袋(bag of visual words,BoVW)模型忽略了视觉单词的空间位置信息的问题,文章提出一种基于视觉单词共生矩阵的图像分类方法。首先对整幅图像进行空间金字塔分解,得到一系列图像块;然后针对每一图像块中的SIFT点,在其空间邻域范围内构建视觉单词共生矩阵(visual words co-occurrence matrix,VWCM)单元,并得到该图像块对应的视觉单词共生矩阵;最后设计出一种新的空间金字塔共生矩阵核(spatial pyramid co-occurrence matrix kernel,SPCMK),并将其用于图像分类。该方法能够有效地刻画视觉单词的绝对和相对位置信息,极大地增强了图像表达的完整度与准确度。实验结果表明,文章方法确实能够大幅度提高图像分类的准确率。 Considering the absence of spatial location information of visual words in the conventional Bag-of-Visual-Words model, this paper presents a novel image categorization approach based on visual words co-occurrence matrix. Firstly, a sequence of image regions is gained using spatial pyramid to partition the whole image. Then, for each SIFT located in the image region, a Visual Words Co- occurrence Matrix （VWCM） unit is constructed based on the spatial context region of this SIFT point, resulting in the VWCM for the image region. Finally, a new matching kernel termed Spatial Pyramid Co-occurrence Matrix Kernel （SPCMK） is designed and used for image categorization. Capturing both the absolute and relative spatial location information of visual words effectively, this new approach vastly enhances the completeness and correctness to understand images. Experimental results indicate that the proposed approach achieves higher classification rates.

作者朱道广李弼程蒋敏刘钦安

机构地区信息工程大学东华大学外语学院河南省军区司令部

出处《信息工程大学学报》 2013年第4期439-446,共8页 Journal of Information Engineering University

基金国家自然科学基金资助项目(60872142)

关键词图像分类视觉单词空间金字塔视觉单词共生矩阵空间金字塔共生矩阵核 image categorization visual words spatial pyramid visual words co-occurrence matrix spatial pyramid co-occurrence matrix kernel

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献17

1Sivic J, Zisserman A. Video Google: a text retrieval approach to object matching in videos[ C ]//Proceedings of 9th IEEE In- ternational Conference on Computer Vision. 2003 : 1470-1477.
2Kersorn K. An Enhanced bag-of-visual word vector space model to represent visual content in athletics images [ J]. IEEE Transactions on Multimedia, 2012, 14 ( 1 ) : 211-222.
3Lazebnik S, Schmid C, Ponce J. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories [ C ]// Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2006:2169-2178.
4Grauman K, Darrel T. The pyramid match kernel: discriminative classification with sets of image features[ C 1// Proceedings of the 10th IEEE International Conference on Computer Vision. 2005 : 1458-1465.
5Qin J, Yung N H C. Scene categorization via contextual visual words[ J]. Pattern Recognition, 2010, 43 (5) : 1874-1888.
6Shotton J, Johnson M, Cipolla R. Semantic texton forests for image categorization and segmentation [ C ]// Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2008 : 1-8.
7Sharma G, Jurie F. Learning discriminative spatial representation for image classification [ C ]// Proceedings of the 22nd British Machine Vision Conference. 2011: 1-11.
8Zhou Ge, Wang Zhiyong, Wang Jiajun, et al. Spatial context for visual vocabulary construction[ C ]// Proceedings of the In- ternational Conference on Image Analysis and Signal Processing. 2010: 176-181.
9Tang Wenbin, Cai Rui, Li Zhiwei, et al. Contextual synonym dictionary for visual object retrieval[ C ]// Proceedings of the 19th ACM International Conference on Multimedia. 2011. 503-512.
10Wang Xiaoyu, Yang Ming, Cour T, et al. Contextual weighting for vocabulary tree based image retrieval[ C ]//Proceedings of 13th IEEE International Conference on Computer Vision. 2011: 209-216.

同被引文献8

1王宇新,郭禾,何昌钦,冯振,贾棋.用于图像场景分类的空间视觉词袋模型[J].计算机科学,2011,38(8):265-268. 被引量：17
2张琳波,王春恒,肖柏华,邵允学.基于Bag-of-phrases的图像表示方法[J].自动化学报,2012,38(1):46-54. 被引量：25
3范新南,丁朋华,刘俊定,郑庆元.基于序列图像的人体跟踪算法研究综述[J].计算机工程与设计,2012,33(1):278-281. 被引量：5
4王孟月,宋彦,戴礼荣.一种用于图像分类的多视觉短语学习方法[J].小型微型计算机系统,2012,33(2):298-302. 被引量：5
5温向兵,满君丰,李倩倩,李长云.视频监控中针对拥挤人群的人体分割与跟踪[J].小型微型计算机系统,2012,33(4):891-895. 被引量：7
6张晓伟,刘弘,孙玉灵.一种多特征自适应融合的球员跟踪算法[J].计算机工程,2012,38(17):214-217. 被引量：4
7张宇.一种图像确认目标的多目标跟踪方法[J].电子测量与仪器学报,2014,28(6):617-624. 被引量：11
8闵军,孟朝晖.基于SIFT的足球运动员检测和跟踪[J].信息技术,2016,40(5):195-198. 被引量：1

引证文献2

1生海迪,段会川,孔超.基于语义短语的空间金字塔词袋模型图像分类方法[J].小型微型计算机系统,2015,36(4):877-881. 被引量：8
2秦海玉,廖志武.基于比赛环境特征的多目标运动员跟踪方法[J].计算机工程与设计,2017,38(11):3173-3178. 被引量：4

二级引证文献12

1彭天强,栗芳.哈希编码结合空间金字塔的图像分类[J].中国图象图形学报,2016,21(9):1138-1146. 被引量：8
2彭天强,栗芳.基于二进制哈希与空间金字塔的视觉词袋模型生成方法[J].计算机工程,2016,42(12):164-170. 被引量：1
3刘尚旺,胡剑兰,崔艳萌.改进HFT模型及其在图像分类中的应用[J].小型微型计算机系统,2017,38(5):1111-1115. 被引量：1
4万源,史莹,陈晓丽.非负局部Laplacian稀疏编码和上下文信息的图像分类[J].中国图象图形学报,2017,22(6):731-740. 被引量：3
5田广强,张岐山.邻居匹配与局部约束线性编码的图像分类方法[J].计算机工程与设计,2017,38(8):2217-2221. 被引量：2
6朱杰,吴树芳,谢博鋆,马丽艳.基于颜色的压缩层次图像表示方法[J].计算机应用,2017,37(11):3238-3243.
7张睿萍,马宗梅.基于Hadoop平台的大数据图像分类机制[J].吉林大学学报（理学版）,2018,56(5):1206-1212. 被引量：7
8刘帅.利用多特征融合的运动员人体姿势识别算法[J].信息技术,2019,43(8):17-19. 被引量：3
9张泽晨,巨志勇.基于BoF模型的多特征融合果蔬图像分类方法[J].电子科技,2020,33(7):41-45. 被引量：3
10牛程程,鲁大营,郑亚淼.基于堆叠式长短期记忆网络的篮球运动员微动作评价方法[J].湖南科技大学学报（自然科学版）,2022,37(2):95-103. 被引量：2

1Fu-xiang LU,Jun HUANG.Beyond bag of latent topics: spatial pyramid matching for scene category recognition[J].Frontiers of Information Technology & Electronic Engineering,2015,16(10):817-828. 被引量：2
2郭素敏.基于视觉的仿人机器人运动规划研究[J].科学中国人,2016(1X):2-3.
3罗立宏.基于图像表达复杂物体的方法研究[J].电脑开发与应用,2007,20(6):33-34.
4赵鑫,黄凯奇,谭铁牛.基于可变性分析的紧致图像表达[J].中国科学技术大学学报,2014,44(2):128-137.
5许洪飞,杨红梅.彩色数字图像水印算法抵抗几何攻击[J].河南科技,2013,32(12):2-3.
6许诺,王璐,金程,皮赛男.W-系统在多聚焦图像融合中的应用[J].软件导刊,2016,15(5):193-196.
7王万同,韩志刚,刘鹏飞.基于SIFT点特征和Canny边缘特征匹配的多源遥感影像配准研究[J].计算机科学,2011,38(7):287-289. 被引量：11
8赵骞,李敏,赵晓杰,陈雪勇.基于感受野学习的特征词袋模型简化算法[J].智能系统学报,2016,11(5):663-669.
9崔丽群.BP网络模型的优化及仿真[J].电脑知识与技术,2009,5(7):5263-5264.
10崔丽群,刘万军,包明宇.BP网络模型的联合优化[J].辽宁工程技术大学学报（自然科学版）,2004,23(z1):81-82.

信息工程大学学报

2013年第4期

浏览历史

内容加载中请稍等...

基于视觉单词共生矩阵的图像分类方法被引量：2

参考文献17

同被引文献8

引证文献2

二级引证文献12

相关作者

相关机构

相关主题

浏览历史

基于视觉单词共生矩阵的图像分类方法 被引量：2

参考文献17

同被引文献8

引证文献2

二级引证文献12

相关作者

相关机构

相关主题

浏览历史

基于视觉单词共生矩阵的图像分类方法被引量：2