期刊文献+

基于距离约束稀疏/组稀疏编码的自动图像标注 被引量:4

Distance Constraint Sparse/Group Sparse Coding for Automatic Image Labeling
下载PDF
导出
摘要 为解决图像自动标注中的语义鸿沟问题,有效选择并利用图像特征,提出基于距离约束稀疏/组稀疏编码(distance constraint sparse/group sparse coding,DCSC/DCGSC)的2种特征选择算法,并分别应用到图像自动标注任务中。考虑到不同特征基相似性对图像语义相似性的贡献不同,定义了度量二者相关性的距离约束正则项。将该正则项分别集成到稀疏/组稀疏编码的特征选择模型中,使选择的特征在保证稀疏性/组稀疏性的同时,优先选择与语义相似性描述最接近的视觉特征基。利用在训练图像集中学习的特征权值,寻找测试图像的K最近邻(Knearest neighbor,KNN)图像,并通过标签转移实现图像标注。在Corel5K图像库上测试标注性能,集成多特征的DCGSC查准率、查全率和标注正确的关键词个数可达32%、34%和151,优于其他相关标注算法。而对于单特征图像,使用DCSC也能改善标注性能。可见,距离约束对特征选择和图像标注是有效的。 In order to bridge the semantic gap in automatic image labeling, and effectively leverage image features, two feature selection algorithms based on distance constraint sparse/group sparse coding (DCSC/DCGSC) were presented to solve the problem of image se- mantic labeling. Considering that feature atoms similarity may have different contribution to the serhantie similarity between images, a distance constraint regularization was defined and integrated with sparse/group sparse coding for feature selection, which encourages the feature atoms with sparsity/group sparsity and more Similar to the semantic discrimination to be enforced. Given a test image, the K-Nea- rest Neighbors (KNN) can be found using the learned feature weights from the training images and labels can be transfered. Experimental results on Corel5K showed that DCGSC outperforms other related method with the average precision of 32% , average recall of 34% , and the numbers of total labels recalled of 151. For images represented with single type of feature, DCSC also helps to improve the annotation performance, which validates the effectivity of distance constraint for image labeling.
出处 《四川大学学报(工程科学版)》 EI CAS CSCD 北大核心 2016年第5期78-83,共6页 Journal of Sichuan University (Engineering Science Edition)
基金 国家自然科学基金资助项目(61371143) 北京市自然科学基金资助项目(4132026) 北京市教委科研计划面上项目资助(KM201410009006)
关键词 自动图像标注 距离约束 稀疏编码 组稀疏编码 特征选择 K最近邻 automatic image labeling distance constraint sparse coding group sparse coding feature selection K-nearest neighbor
  • 相关文献

参考文献18

  • 1Chang E, Goh K, Sychay G, et al. CBSA: Content-based soft annotation for multimodal image retrieval using hayes point machines [ J]. IEEE Transactions on Circuits and Systems for Video Technology,2003,13 ( 1 ) :26 - 38.
  • 2Cusano C, Ciocca G, Schettini R. Image annotation using SVM[ C]//International Society for Optics and Photon- ics. San Jose : SPIE ,2003:330 - 338.
  • 3Carneiro G, Chan A B, MorenoP J,et al. Supervised learn- ing of semantic classes for image annotation and retrieval [ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence ,2007,29 (3) :394 - 410.
  • 4Duygulu P,Barnard K,Freitas J F G, et al. Object recog- nition as machine translation: Laming a lexicon for a fixed image vocabulary[C]//Proceedings of the European Conference on Computer Vision--Part IV. Berlin: Spring- er-Verlag, 2002 : 97 - 112.
  • 5Feng S L, Manmatha R, Lavrenko V. Multiple bernoulli relevance models for image and video annotation [ C ]/! Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Piscataway: IEEE ,2004 : 1002 - 1009.
  • 6Carneiro G, Chan A B, Moreno P J, et al. Supervised learning of semantic classes for image annotation and re- trieval[ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007,29 ( 3 ) : 394 - 410.
  • 7. Makadia A,Pavlovic V,Kumar S. A new baseline for im- age annotation[C]//Proceedings of the European Confer- ence on Computer Vision. Berlin: Springer, 2008, 5304 (3) :316 -329.
  • 8Guillaumin M,Mensink M,Verbeek J,et al. Tagprop- Dis- criminative metric learning in nearest neighbor models for image auto-annotation[ C]//Proceedings of the IEEE In- ternational Conference on Computer Vision. New York: IEEE ,2009:309 - 316.
  • 9Wang Mei ,Li Feng,Wang Meng. Collaborative visual mod- eling for automatic image annotatto "via sparse model coding[ J]. Nerocomputing,2012,95 (14) :22 - 28.
  • 10Zhang Wenbo, Tian Dongpng, Hu Hong, et al. Automatic image annotation via local sparse coding[ C]//Proceed- ings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. New York: IEEE, 2013: 1661 - 1665.

共引文献4

同被引文献21

引证文献4

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部