基于视觉字典容量自动获取的LDA场景分类研究

Research on Scene Classification of LDA Automatically Obtained by Visual Dictionary Capacity

下载PDF

导出

摘要提出了一种高效获取词包模型中视觉字典容量的方法,并研究了该方法与隐狄利克雷分配模型(Latent Dirichlet Allocation,LDA)相结合情况下的场景分类性能.在用SIFT特征构建场景图像数据集特征矩阵的基础上,首先采用吸引子传播方法获取场景图像集特征矩阵的合理聚类数目族,并将其中的最小聚类数目作为视觉字典容量,进而生成视觉字典;然后利用所构建视觉字典中的单词描述场景图像训练集和测试集;最后采用LDA模型对场景图像测试集进行场景分类实验.实验结果表明,提出的方法不仅保持了较高场景分类准确率,同时显著提高了场景分类的效率. An approach is proposed to obtain the dictionary capacity of bag of words（BoW） model efficiently, which is combined with The Latent Dirichlet Allocation （LDA） model to analyze the performance of scene category. Based on the feature matrix of scene image data sets constructed by SIFT feature, the affinity propagation method is firstly employed to obtain the clustering numbers, and to take the minimal clustering number as the visual dictionary capacity before generating a visual dictionary. Secondly, the scene training and testing sets are described by these visual words. Finally, the LDA model is employed to classify the testing data set. The experiments show that the proposed approach in this paper maintains higher accuracy of scene classification and can improve efficiency greatly.

作者张艺钟映春陈俊彬

机构地区广东工业大学自动化学院

出处《广东工业大学学报》 CAS 2015年第4期150-154,共5页 Journal of Guangdong University of Technology

基金广东省科技计划项目(2010A030500006)

关键词词包模型视觉单词视觉字典隐狄利克雷分配模型 bag of words visual words visual dictionary LDA model

分类号 TP311.11 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献16

1谢昭,高隽.基于高斯统计模型的场景分类及约束机制新方法[J].电子学报,2009,37(4):733-738. 被引量：11
2王中锋,王志海,解文杰.基于树型贝叶斯网络的场景分类引擎训练算法[J].仪器仪表学报,2012,33(4):863-869. 被引量：4
3唐颖军,须德,解文杰,薄一航.一种基于类主题空间的图像场景分类方法[J].中国图象图形学报,2010,15(7):1067-1073. 被引量：14
4刘硕研,须德,冯松鹤,刘镝,裘正定.一种基于上下文语义信息的图像块视觉单词生成算法[J].电子学报,2010,38(5):1156-1161. 被引量：41
5Lowe D G. Distinctive image features from scale-invariant keypoints [ J ]. International Journal of Computer Vision, 2004,60(2) :91-110.
6刘洪伟,石雅强,梁周扬,肖岳.面向聚类挖掘的局部旋转扰动隐私保护算法[J].广东工业大学学报,2012,29(3):28-34. 被引量：7
7Blei D, Ng A, Jordan M. Latent dirichlet allocation [ J ]. Journal of Machine Learning Research,2003,3:993-1022.
8Heinrich G. Parameter estimation for text analysis [ EB! OL]. [ 2012-07-25 ]. http://www, arbylon, net/publica- tions/text-est, pdf.
9Rasiwasia N, Vasconcelos N. Latent dirichlet allocation models for image classification [ J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013,35 ( 11 ) : 2665 -2679.
10Corina V , Inge G , Mihai D. Latent dirichlet allocation for spatial analysis of satellite images [ J]. Geoscience and Re- mote Sensing, 2013,51 ( 5 ) : 2770-2786.

二级参考文献101

1李广彪,张剑云,毛云祥.盲源分离的发展及研究现状[J].航天电子对抗,2004,33(6):13-16. 被引量：5
2A Oliva, A Torralba. Modeling the shape of the scene: a holistic representation of the spatial envelope [ J]. International Journal of Computer Vision,2001,42(3) :145- 175.
3A Oliva, A Torralba. Building the gist of a scene: the role of global image features in recognition[ J] .Progress in Brain Research,2006, 155:23 - 36.
4T Serre, A Oliva, T A Poggio. A feedforward architecture accounts for rapid categorization[ A ]. In Proceedings of the National Academy of Science [ C ]. New York: PNAS, 2007. 6424 - 6429.
5A Vailaya, M Figueiredo, A Jain, H Zhang. Image classification for content-based indexing [ J ].IEEE. Transactions on Image Processing, 2001,10(1) : 117 - 130.
6E B Sudderth,A Torralba,W T Freeman,A S Willsky. Learning hierarchical models of scenes,objects and parts[A] .In Proceedings of Tenth IEEE International Conference on Vision[C]. USA: IEEE., 2005.1331 - 1338.
7L F Fei, P Perona. A bayesian hierarchical model for learning natural scene categodes[ A]. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition[ C]. USA: IEEE., 2001,2.524 - 531.
8T Serre, L Wolf, T Poggio. Object recognition with features inspired by visual cortex [ A ]. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition[ C]. USA: IEEE, 2005,2.994 - 1000.
9Y W Teh. Bethe free energy and contrastive divergence approximations for undirected graphical models[ D]. Toronto, Canada: University of Toronto, 2003,7.
10Y W Teh,M Welling. The unified propagation and scaling algorithm[ J]. Advances in Neural Information Processing Systems,2001,14(2) :953 - 960.

共引文献85

1高隽,谢昭,张骏,吴克伟.图像语义分析与理解综述[J].模式识别与人工智能,2010,23(2):191-202. 被引量：20
2胡正平,戎怡.基于EILBP视觉描述子结合PLSA的场景分类算法[J].光电工程,2010,37(11):128-134. 被引量：2
3胡正平,戎怡.基于EICS-LBP与统计边缘主色对的场景分类算法[J].系统工程与电子技术,2011,33(4):919-924.
4唐颖军.类别约束下自适应主题建模的图像场景分类[J].小型微型计算机系统,2011,32(5):958-963. 被引量：2
5高常鑫,桑农.整合局部特征和滤波器特征的空间金字塔匹配模型[J].电子学报,2011,39(9):2034-2038. 被引量：9
6胡正平,涂潇蕾.多方向上下文特征结合空间金字塔模型的场景分类[J].信号处理,2011,27(10):1536-1542. 被引量：5
7赵旭东,刘鹏,刘家锋,唐降龙.一种图像序列平稳性和相关性检验的天气场景分类方法[J].计算机研究与发展,2011,48(11):1973-1982. 被引量：2
8亓晓振,王庆.一种基于稀疏编码的多核学习图像分类方法[J].电子学报,2012,40(4):773-779. 被引量：31
9张素兰,郭平,张继福,胡立华.图像语义自动标注及其粒度分析方法[J].自动化学报,2012,38(5):688-697. 被引量：20
10刁蒙蒙,张菁,卓力,隋磊.一种基于视觉单词的图像检索方法[J].测控技术,2012,31(5):17-20. 被引量：1

1程全,樊宇,刘玉春,程朋.特征显著性的车辆目标检测算法[J].河南科技大学学报（自然科学版）,2017,38(1):48-51. 被引量：3
2吴京辉,唐林波,赵保军,邓宸伟,李嘉桐.基于视觉字典的在线多示例目标跟踪[J].系统工程与电子技术,2015,37(2):428-435. 被引量：2
3钟映春,谭志,孙伟,连伟烯.视觉字典合理容量的自动获取研究[J].计算机工程与设计,2014,35(9):3279-3283.
4崔大成,曾连荪.基于视觉字典的移动机器人闭环检测方法研究[J].微型机与应用,2015,34(9):85-88. 被引量：4
5李明骏.美光25nm固态硬盘出击个人电脑与工业应用市场[J].集成电路应用,2011(3):24-24.
6何思霖.浅论互联网与传统行业的结合[J].科技创新与应用,2016,6(3):1-1. 被引量：4
7何率天,达飞鹏,尤伟.时滞控制理论的研究进展[J].南京理工大学学报（社会科学版）,2005,18(S1):138-142.
8魏剑啸.电力系统自动化和计算机技术的结合应用[J].电子技术与软件工程,2015(16):170-170. 被引量：4
9刘博元,范文慧,肖田元.决策支持系统研究现状分析[J].系统仿真学报,2011,23(B07):241-244. 被引量：35
10李庆江.基于Android系统的团队协作管理系统设计与实现[J].山东工业技术,2013(9):32-33.

广东工业大学学报

2015年第4期

浏览历史

内容加载中请稍等...

基于视觉字典容量自动获取的LDA场景分类研究

参考文献16

二级参考文献101

共引文献85

相关作者

相关机构

相关主题

浏览历史