基于随机化视觉词典组和上下文语义信息的目标检索方法被引量：5

Object Retrieval Method Based on Randomized Visual Dictionaries and Contextual Semantic Information

下载PDF

导出

摘要传统的视觉词典法(Bag ofVisual Words,BoVW)具有时间效率低、内存消耗大以及视觉单词同义性和歧义性的问题,且当目标区域所包含的信息不能正确或不足以表达用户检索意图时就得不到理想的检索结果.针对这些问题,本文提出了基于随机化视觉词典组和上下文语义信息的目标检索方法.首先,该方法采用精确欧氏位置敏感哈希(Exact Euclidean Locality Sensitive Hashing,E2LSH)对局部特征点进行聚类,生成一组支持动态扩充的随机化视觉词典组;然后,利用查询目标及其周围的视觉单元构造包含上下文语义信息的目标模型;最后,引入K-L散度(Kullback-Leibler divergence)进行相似性度量完成目标检索.实验结果表明,新方法较好地提高了目标对象的可区分性,有效地提高了检索性能. There are several problems existing in the conventional bag of visual words methods,such as low time efficiency and large memory consumption, the synonymy and polysemy of visual words, furthermore, they may fail to return satisfactory results if the object region is inaccurate or if the captured object is too small to be represented with discriminative features. An object re- trieval method based on randomized visual dictionaries and contextual semantic information is proposed for the above problems. Firstly, E2LSH （Exact Euclidean Locality Sensitive Hashing） is used, and a group of scalable random visual vocabularies is generat- ed; then, a new object model consisting of contextual semantic information is devised, which is drawn from the visual dements sur- rounding the query object; finally, the Kullback-Leibler divergence is introduced as a similarity measurement to accomplish object re- trieval. Experimental results indicate that the distinguishability of objects is effectively improved and the object retrieval performance method is substantially boosted compared with the traditional methods.

作者赵永威郭志刚李弼程高毫林陈刚

机构地区解放军信息工程大学信息系统工程学院

出处《电子学报》 EI CAS CSCD 北大核心 2012年第12期2472-2480,共9页 Acta Electronica Sinica

基金国家自然科学基金(No.60872142) 全军军事学研究生课题资助项目

关键词目标检索上下文语义信息精确欧氏位置敏感哈希随机化视觉词典组 K-L散度 object retrieval contextual semantic information exact Euclidean locality sensitive hashing randomized visual vocabularies Kullback-Leibler divergence

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献28

1Sivic J,Zisserman A. Video Google:A text retrieval approach to object matching in videos[A].Nice:IEEE Press,2003.1470-1477.
2Jurie F,Triggs B. Creating efficient codebooks for visual recognition[A].Proceedings of International Conference on Computer Vision[A].Beijing:Springer,2005.604-610.
3Nister D,Stewenius H. Scalable recognition with a vocsbulary tree[A].Proceeding of IEEE Conference on Computer Vision and Pattern Recognition[A].New York:IEEE Press,2006.2161-2168.
4Philbin J,Chum O,Isard M. Object retrieval with large vocabularies and fast spatial matching[A].Minneapolis:IEEE Press,2007.1-8.
5Cao Yang,Wang Chang-hu,Li Zhi-wei. Spatial-bag-of-features[A].San Francisco:IEEE Press,2010.3352-3359.
6Rapha(e)l Marée,Philippe Denis,Louis Wehenkel. Incremental indexing and distributed image search using shared randomized vocabularies[A].Philadelphia:ACM Press,2010.91-100.
7刘硕研,须德,冯松鹤,刘镝,裘正定.一种基于上下文语义信息的图像块视觉单词生成算法[J].电子学报,2010,38(5):1156-1161. 被引量：41
8Philbin J,Chum O,Isard M. Lost in quantization:improving particular object retrieval in large scale image databases[A].Anchorage:IEEE Press,2009.278-286.
9Van G J C,Veenman C J,Smeulders A W M. Visual word ambiguity[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2010,(32):1271-1283.
10Wang Jing-yan,Li Yong-ping,Zhang Ying. Bag-of-features based medical image retrieval via multiple assignment and visual words weighting[J].IEEE Transactions on Medical Imaging,2011,(11):1-17.

二级参考文献74

1张利彪,周春光,马铭,刘小华.基于粒子群算法求解多目标优化问题[J].计算机研究与发展,2004,41(7):1286-1291. 被引量：225
2吴洪,卢汉清,马颂德.基于内容图像检索中相关反馈技术的回顾[J].计算机学报,2005,28(12):1969-1979. 被引量：52
3于林森,张田文.基于视觉与标注相关信息的图像聚类算法[J].电子学报,2006,34(7):1265-1269. 被引量：6
4Oliva A, Tonalba A. Modeling the shape of the scene:A holistic representation of the spatial envelope[J].International Journal of Computer Vision,2001,42(3) : 145 - 175.
5Vogel J, Schiele B. Semantic modeling of natural scenes for content-based image retrieval[ J]. International Journal of Computer Vision,2007,72(2):133 - 157.
6Nowak E, Jurie F, Triggs B. Sampling strategies for bag-of-features image classification[A]. Proc of European Conference on Computer Vision (ECCV'06) [ C]. Austria: Springer, 2006.490 - 503.
7Van Gemert J, G-eusebroek J, Veenman C, Snoek C, Smeulders A. Robust scene categorization by learning image statistics in context[A]. Proc of Int. Conf. on Computer Vision and Pattern Recognition Workshop (CVPRW'06)[C]. USA. IEEE Computer Society,2006. 105 - 122.
8Fei-Fei L,Perona P.A Bayesian hierarchical model for learning natural scene categories [ A]. Proc. of IEEE Int. Conf. on Computer Vision and Pattern Reeosnition (CVPR'05) [ C]. USA: IEEE Computer Society,2005.524- 531.
9Bosch A,Zisserman A. Scene classification using a hybrid generative/discriminative approach [J].IEEE Trans on Pattern Analysis and Machine Intelligence,2008,30(4) :712 - 727.
10Jingen L, Mubarak S. Scene Modeling Using Co-Clustering [ A ]. Proc of IEEE Int. Conf on Computer Vsion ( ICCV'07) [ C ]. Brazil: IEEE Computer Society 2007.1 - 7.

共引文献95

1李生,赵铁军.Chinese Information Processing and Its Prospects[J].Journal of Computer Science & Technology,2006,21(5):838-846. 被引量：1
2王斌.一种用于形状描述的拱高半径复函数[J].电子学报,2011,39(4):831-836. 被引量：11
3杨文涛,司应硕.微粒群算法在图像检索中的应用[J].华北水利水电学院学报,2011,32(2):90-92. 被引量：1
4冯纪强,谢维信,徐晨.T-S模糊粒子群优化建模及稳定性分析[J].电子学报,2011,39(5):1150-1153. 被引量：2
5梁洪,李金,鲍佩华.基于组合特征双重加权的相关反馈算法[J].信息与电子工程,2011,9(4):491-496. 被引量：1
6张静,曲晓杰,冀中,苏育挺.基于内容的图像和视频搜索重排序技术综述[J].计算机工程与应用,2011,47(29):171-174. 被引量：7
7高常鑫,桑农.整合局部特征和滤波器特征的空间金字塔匹配模型[J].电子学报,2011,39(9):2034-2038. 被引量：9
8胡正平,涂潇蕾.多方向上下文特征结合空间金字塔模型的场景分类[J].信号处理,2011,27(10):1536-1542. 被引量：5
9鲁珂,赵继东,丁正明,吴跃.一种基于近邻保留的相关反馈图像检索算法[J].计算机科学,2012,39(1):281-284. 被引量：5
10王斌.一种不变的基于傅立叶变换的区域形状描述子[J].电子学报,2012,40(1):84-88. 被引量：13

同被引文献67

1Gao Huilin,Dou Lihua,Chen Wenjie.Image Classification with Bag-of-Words Model Based on Improved SIFT Algorithm[C]//Control Conference (ASCC).2013:1-6.
2Jiang Y,Meng J,Yuan J.Randomized visual phrases for object search[C]//Computer Vision and Pattern Recognition (CVPR).2012:3100-3107.
3Hofmann T.Probabilistic latent semantic indexing[C]//Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval.1999:50-57.
4Blei D M,Ng A Y,Jordan M I.Latent dirichlet allocation[J].Journal of machine Learning research,2003,3:993-1022.
5Wu Lei,Li Mingjing.Visual language modeling for image classification[C]//Proc.of 9th ACM SIGMM International Workshop on Multimedia Information Retrieval.2007:115-124.
6Wu Lei,Hu Yang.Scale-Invariant Visual Language Modeling for Object Categorization[C]//IEEE Transactions on multimedia.2009:286-294.
7Pham T T,Maisonnasse L,Mulhem P,et al.Visual Language Model for Scene Recognition[C]//Proceedings of the Singaporean-French Ipal Symposium.2009:76-85.
8Narayanaswamy S,Barbu S,Siskind J M.A Visual Language Model for Estimating Object Pose and Structure in a Generative Visual Domain[C]//IEEE International Conference on Robotics and Automation.2011:4854-4860.
9Li Mingjing,Ma Weiying.Visual language modeling for image classification.United States Patent.US008126274B2[P].2012.
10Katz S.Estimation of Probabilities from sparse data for the language model component of a speech recognizer[C]//IEEE Tansaction on Acoustics Speech and Signal Proeessing.1997:400-401.

引证文献5

1王挺进,赵永威,李弼程.N步长距离视觉语言模型的图像分类方法[J].信息工程大学学报,2014,15(4):453-458.
2桂振文,刘越,陈靖,王涌天,徐志伟.一种适用于智能手机的图像识别算法[J].电子学报,2014,42(8):1487-1494. 被引量：10
3王挺进,赵永威,李弼程.基于显著图加权视觉语言模型的图像分类方法[J].计算机工程,2015,41(3):204-210.
4王忠伟,陈叶芳,钱江波,陈华辉.基于LSH的高维大数据k近邻搜索算法[J].电子学报,2016,44(4):906-912. 被引量：4
5王挺进,赵永威,李弼程.一种基于自适应软分配的图像分类方法[J].太赫兹科学与电子信息学报,2015,13(1):154-159.

二级引证文献14

1黄学沛,张燕,项炬,张佳峰,汤岚钦.基于云架构的自适应聚类图像识别技术的研究与实现[J].电脑与电信,2016(5):30-32. 被引量：2
2张泳,王旭.基于图像配准的医用转运车稳定控制的研究[J].军械工程学院学报,2016,28(4):74-78.
3任民宏,鲁秋菊.基于高斯混合模型自适应肤色识别算法[J].陕西理工学院学报（自然科学版）,2016,32(6):53-56. 被引量：3
4王强.用Sift算子实现铁路扣件状态检测[J].黑龙江科技信息,2016(25):169-170.
5袁姮,王志宏,姜文涛.基于复合梯度向量的指纹匹配算法[J].电子学报,2017,45(4):912-921. 被引量：10
6李秀华,姚佳.归一化积相关Brisk图像配准算法[J].长春工业大学学报,2017,38(2):167-173. 被引量：1
7马敏燕,王梅.印刷图像网点增大在线检测系统的开发[J].包装工程,2018,39(3):22-27.
8李松,胡晏铭,郝晓红,张丽平,郝忠孝.基于维度分组降维的高维数据近似k近邻查询[J].计算机研究与发展,2021,58(3):609-623. 被引量：6
9李腾腾,蔡梦杰,高永峰,王佳东,王海滨.基于智能手机视频处理技术的智能电表故障检测[J].电气技术,2021,22(6):124-127. 被引量：1
10刘浩阳,林耀进,刘景华,吴镒潾,毛煜,李绍滋.由粗到细的分层特征选择[J].电子学报,2022,50(11):2778-2789. 被引量：4

1赵永威,李弼程,高毫林.一种基于精确欧氏位置敏感哈希的目标检索方法[J].应用科学学报,2012,30(4):349-355. 被引量：3
2赵永威,李弼程,彭天强,高毫林.一种基于随机化视觉词典组和查询扩展的目标检索方法[J].电子与信息学报,2012,34(5):1154-1161. 被引量：9
3赵永威,李弼程,彭天强,唐永旺.基于E^2LSH过滤与空间一致性度量的目标检索方法[J].四川大学学报（工程科学版）,2016,48(2):169-175.
4赵永威,周苑,李弼程.基于词典优化与空间一致性度量的目标检索[J].计算机研究与发展,2016,53(5):1043-1052. 被引量：1
5吴国栋,刘政怡,王小帅.基于OpenCV的视频目标检索[J].计算机技术与发展,2014,24(11):210-213. 被引量：1
6张云彬,张永生.基于图像纹理特征的目标快速检索[J].高技术通讯,2004,14(8):11-14. 被引量：6
7刘辛,杨素锦,杨俊.一种基于压缩Fisher向量的目标检索方法[J].火力与指挥控制,2015,40(7):37-42.
8温光玉,唐雁,吴梦蝶,黄智兴.基于图像上下文语义信息的场景分类方法[J].四川大学学报（自然科学版）,2013,50(6):1223-1229. 被引量：3
9张瑞杰,李弼程,魏福山.基于多尺度上下文语义信息的图像场景分类算法[J].电子学报,2014,42(4):646-652. 被引量：14
10赵嵩,焦阳,曹海旺,杨恒.随机维哈希量化视词字典的目标检索方法[J].计算机应用与软件,2015,32(9):149-151. 被引量：1

电子学报

2012年第12期

浏览历史

内容加载中请稍等...

基于随机化视觉词典组和上下文语义信息的目标检索方法被引量：5

参考文献28

二级参考文献74

共引文献95

同被引文献67

引证文献5

二级引证文献14

相关作者

相关机构

相关主题

浏览历史

基于随机化视觉词典组和上下文语义信息的目标检索方法 被引量：5

参考文献28

二级参考文献74

共引文献95

同被引文献67

引证文献5

二级引证文献14

相关作者

相关机构

相关主题

浏览历史

基于随机化视觉词典组和上下文语义信息的目标检索方法被引量：5