期刊文献+

Understanding visual-auditory correlation from heterogeneous features for cross-media retrieval 被引量:2

Understanding visual-auditory correlation from heterogeneous features for cross-media retrieval
下载PDF
导出
摘要 Cross-media retrieval is an interesting research topic,which seeks to remove the barriers among different modalities.To enable cross-media retrieval,it is needed to find the correlation measures between heterogeneous low-level features and to judge the semantic similarity.This paper presents a novel approach to learn cross-media correlation between visual features and auditory features for image-audio retrieval.A semi-supervised correlation preserving mapping(SSCPM)method is described to construct the isomorphic SSCPM subspace where canonical correlations between the original visual and auditory features are further preserved.Subspace optimization algorithm is proposed to improve the local image cluster and audio cluster quality in an interactive way.A unique relevance feedback strategy is developed to update the knowledge of cross-media correlation by learning from user behaviors,so retrieval performance is enhanced in a progressive manner.Experimental results show that the performance of our approach is effective. Cross-media retrieval is an interesting research topic, which seeks to remove the barriers among different modalities. To enable cross-media retrieval, it is needed to find the correlation measures between heterogeneous low-level features and to judge the semantic similarity. This paper presents a novel approach to learn cross-media correlation between visual features and auditory features for image-audio retrieval. A semi-supervised correlation preserving mapping (SSCPM) method is described to construct the isomorphic SSCPM subspace where canonical correlations between the original visual and auditory features are further preserved. Subspace optimization algorithm is proposed to improve the local image cluster and audio cluster quality in an interactive way. A unique relevance feedback strategy is developed to update the knowledge of cross-media correlation by learning from user behaviors, so retrieval performance is enhanced in a progressive manner. Experimental results show that the performance of our approach is effective.
出处 《Journal of Zhejiang University-Science A(Applied Physics & Engineering)》 SCIE EI CAS CSCD 2008年第2期241-249,共9页 浙江大学学报(英文版)A辑(应用物理与工程)
基金 Project supported by the National Natural Science Foundation of China (Nos. 60533090 and 60773051) the Natural Science Foundation of Zhejiang Province (No. Y105395),China
关键词 HETEROGENEITY Cross-media retrieval Subspace optimization Dynamic correlation update 信息处理 多媒体技术 动力相关分析 子空间
  • 相关文献

参考文献11

  • 1Zhengyou Zhang,Zicheng Liu,Dennis Adler,Michael F. Cohen,Erik Hanson,Ying Shan.Robust and Rapid Generation of Animated Faces from Video Images: A Model-Based Modeling Approach[J].International Journal of Computer Vision.2004(2)
  • 2Chen, Z,Liu, W.Y,Zhang, F,Li, M.J,Zhang, H.J.Web mining for web image retrieval[].J Amer Soc Inf Sci & Tech.2001
  • 3Duygulu, P,Barnard, K,de Freitas, J.F.G,Forsyth, D.A.Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary[].Proc th European Conf on Computer Vision.2002
  • 4Hotelling,H.Relations between two sets of variables[].Biometrika.1936
  • 5Jeon, J,Lavrenko, V,Manmatha, R.Automatic Image Annotation and Retrieval using Cross-media Relevance Models[].Proc Int ACM Conf on Research and Devel-opment in Information Retrieval.2003
  • 6Lu, T.C,Chang, C.C.Color image retrieval technique based on color features and image bitmap[].Int J Inf Processing and Management.2007
  • 7Slaney M,Covell, M.2000. FaceSync: A Linear Operator foMeasuring Synchronization of Video Facial Images andAudio Tracks[].Proc Neural Information Processing Systems.
  • 8Snoek, C,Worring, M,Smeulders, A.W.M.Early versus Late Fusion in Semantic Video Analysis[].Proc ACM Multimedia.2005
  • 9Tan B,Shen, X.H,Zhai, C.X.Mining Long-termSearch History to Improve Search Accuracy[].Proc IntConf on Knowledge Discovery and Data Mining.2006
  • 10Wang, X.J,Ma, W.Y,Xue, G.R,Li, X.Multi-model Similarity Propagation and its Applications for Web Image Retrieval[].Proc ACM Multimedia.2004

同被引文献6

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部