期刊文献+

基于多模态子空间相关性传递的视频语义挖掘 被引量:12

Video Semantics Mining Using Multi-Modality Subspace Correlation Propagation
下载PDF
导出
摘要 在视频语义信息理解和挖掘中,充分利用图像、音频和文本等多模态媒质之间的交互关联是非常重要的研究方向.考虑到视频的多模态和时序关联共生特性,提出了一种基于多模态子空间相关性传递的语义概念检测方法来挖掘视频的语义信息.该方法对所提取视频镜头的多模态底层特征,根据共生数据嵌入(co-occurrence data embedding)和相似度融合(Si mFusion)进行多模态子空间相关性传递而得到镜头之间的相似度关系,接着通过局部不变投影(locality preserving projections)对原始数据进行降维以获得低维语义空间内的坐标,再利用标注信息训练分类模型,从而可对训练集外的测试数据进行语义概念检测,实现视频语义信息挖掘.实验表明该方法有较高的准确率. Research on content-based multimedia retrieval is motivated by a growing amount of digital multimedia content in which video data is a big part. Interaction and integration of multi-modality media types such as visual, audio and textual data in video are the essence of video content analysis. Although any uni-modality type partially expresses limited semantics less or more, video semantics are fully manifested only by interaction and integration of any unimodal. Video data comprises plentiful semantics, such as people, scene, object, event and story, etc. A great deal of research has been focused on utilizing multi-modality features for better understanding of video semantics. Proposed in this paper is a new approach to detect semantic concepts in video using co-occurrence data embedding (CODE), SimFusion, and locality preserving projections (LPP) from temporal associated cooccurring multimodal media data in video. The authors' experiments show that by employing these key techniques, the performance of video semantic concept detection can be improved and better video semantics mining results can be obtained.
出处 《计算机研究与发展》 EI CSCD 北大核心 2009年第1期1-8,共8页 Journal of Computer Research and Development
基金 国家自然科学基金项目(60603096 60533090) 国家"八六三"高技术研究发展计划重点基金项目(2006AA010107) 长江学者和创新团队发展计划基金项目(IRT0652)~~
关键词 视频语义挖掘 多模态 语义概念检测 子空间相关性传递 时序关联共生特性 video semantics mining multi-modality propagation temporal associated co-occurrence semantic concept detection subspace correlation
  • 相关文献

参考文献23

  • 1中国国家统计局2004年统计数据[OL].[2005-02-15].http://www.stats.gov.cn/tjsj/ndsj/2005/indexch.htm.
  • 2Peter Lyman, Varian Hal R. How much information [OL]. [2007-01-03]. http://www. sims. berkeley. edu/how muchinfo-2003
  • 3Liu Y N, Wu F. Video semantic concept detection using multi modality subspace correlation propagation [C]//Proc of the I3th Int Multimedia Modeling Conference. Berlin: Springer, 2007:527-534
  • 4Babaguchi N, Kawai Y, Kitahashi T. Event based indexing of broadcast sports video by intermodal collaboration [J]. IEEE Trans on Multimedia, 2002, 4(1): 68-75
  • 5Snoek C G M, Worring M. Multimedia event-based video indexing using time intervals [J]. IEEE Trans on Multimedia, 2005, 7(4): 638-647
  • 6代科学,付畅俭,武德峰,李国辉.视频挖掘:概念、技术与应用[J].计算机应用研究,2006,23(1):1-4. 被引量:8
  • 7Snoek C G M, Worring M, Smeulders A W M. Early versus late fusion in semantic video analysis [C] //Proc of the 13th Annual ACM Int Conf on Multimedia. New York: ACM, 2005 : 399-402
  • 8Hotellin H. The most predictable criterion [J]. Journal of Educational Psychology, 1935, 26:139-142
  • 9张鸿,吴飞,庄越挺.跨媒体相关性推理与检索研究[J].计算机研究与发展,2008,45(5):869-876. 被引量:20
  • 10Zhang H, Zhuang Y T, Wu F. Cross modal correlation learning for clustering on image-audio dataset [C] //Proc of ACM Int Conf on Multimedia. New York: ACM, 2007: 273-276

二级参考文献46

  • 1赵丕锡,王秀坤,李国辉,田宏.视频概要的分类与综合评价方法[J].计算机应用研究,2004,21(11):5-7. 被引量:3
  • 2张静,路红,薛向阳.基于索引结构的高效运动视频检索[J].计算机研究与发展,2006,43(11):1953-1958. 被引量:3
  • 3庄毅,庄越挺,吴飞.Composite Distance Transformation for Indexing and κ-Nearest-Neighbor Searching in High-Dimensional Spaces[J].Journal of Computer Science & Technology,2007,22(2):208-217. 被引量:3
  • 4付畅俭 李国辉 武德峰.基于直方图高阶差分聚类的视频结构挖掘[C]..宁波:第十三届全国多媒体技术学术会议[C].,2004.12-15.
  • 5李国辉 武德峰 付畅俭.多媒体知识及其获取[C]..宁波:第十三届全国多媒体技术学术会议[C].,2004.451-456.
  • 6A Hampapur, L Brown, J Connell, et al. Smart Surveillance: Applications Technologies and Implications [ C ]. Singapore: IEEE Pacific-Rim Conference on Multimedia, 2003.
  • 7Aslandogan Y, Yu C. Techniques and Systems for Image and Video Retrieval [J]. IEEE Transactions on Knowledge and Data Engineering, 1999,11(1):56-63.
  • 8柳崎峰 楼建光.智能视觉监控技术[EB/OL].http://www.seiencetimes.net/snw/2002—23/snw28.htm,2004—12—24.
  • 9Jiawei Han, Micheline Kambr. Data Mining: Concepts and Techniques[M]. Morgan Kaufmann Publishers,2002.
  • 10Mitsuru Kakimoto, Chie Morita, Hiroshi Tsukimoto. Data Mining from Brain Images[J]. Bosteon:ACM SIGKDD Conference, 2000.

共引文献26

同被引文献140

引证文献12

二级引证文献55

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部