期刊文献+

面向视频的细粒度多模态实体链接

Fine-grained Multimodal Entity Linking for Videos
下载PDF
导出
摘要 随着互联网和大数据的飞速发展,数据规模越来越大,种类也越来越多.视频作为其中重要的一种信息方式,随着近期短视频的发展,占比越来越大.如何对这些大规模视频进行理解分析,成为学界关注的热点.实体链接作为一种背景知识补全方式,可以提供丰富的外部知识.视频上的实体链接可以有效地帮助理解视频内容,从而实现对视频内容的分类、检索、推荐等.但是现有的视频链接数据集和方法的粒度过粗,因此提出面向视频的细粒度实体链接,并立足于直播场景,构建了细粒度视频实体链接数据集.此外,依据细粒度视频链接任务的难点,提出利用大模型抽取视频中的实体及其属性,并利用对比学习得到视频和对应实体的更好表示.实验结果表明,该方法能够有效地处理视频上的细粒度实体链接任务. With the rapid development of the Internet and big data,the scale and variety of data are increasing.Video,as an important form of information,is becoming increasingly prevalent,particularly with the recent growth of short videos.Understanding and analyzing large-scale videos has become a hot topic of research.Entity linking,as a way of enriching background knowledge,can provide a wealth of external information.Entity linking in videos can effectively assist in understanding the content of video,enabling classification,retrieval,and recommendation of video content.However,the granularity of existing video linking datasets and methods is too coarse.Therefore,this study proposes a video-based fine-grained entity linking approach,focusing on live streaming scenarios,and constructs a fine-grained video entity linking dataset.Additionally,based on the challenges of fine-grained video linking tasks,this study proposes the use of large models to extract entities and their attributes from videos,as well as utilizing contrastive learning to obtain better representations of videos and their corresponding entities.The results demonstrate that the proposed method can effectively handle fine-grained entity linking tasks in videos.
作者 赵海全 王续武 李金亮 李直旭 肖仰华 ZHAO Hai-Quan;WANG Xu-Wu;LI Jin-Liang;LI Zhi-Xu;XIAO Yang-Hua(School of Computer Science,Fudan University,Shanghai 201203,China;Shanghai Key Laboratory of Data Science,(Fudan University),Shanghai 201203,China;School of Computer Science and Technology,Soochow University,Suzhou 215006,China)
出处 《软件学报》 EI CSCD 北大核心 2024年第3期1140-1153,共14页 Journal of Software
基金 国家重点研发计划(2020AAA0109302) 国家自然科学基金(62072323,62102095) 上海市科技创新行动计划(22511105902,22511104700) 上海市科技重大专项(2021SHZDZX0103) 上海市科学技术委员会资助项目(22511105902)。
关键词 细粒度 视频实体链接 数据集 大语言模型 对比学习 fine-grained video entity linking dataset large language model contrastive learning
  • 相关文献

参考文献3

二级参考文献6

共引文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部