实用语音识别的场景标记辅助系统

Multimedia scene labeling based on speech

下载PDF

导出

摘要标引是通过给音频-视频数据加入标记,对其内容进行描述,以便于信息的检索和查询。语音标引在媒体资产管理中扮演了很重要的脚色。介绍了一种基于EBF网络的语音标引辅助系统,该系统可自动识别标引员所说的短语,辅助标引员在视频媒体上实现标引。系统从语句中将这些短语分割出来,并通过EBF神经网络进行建模。实验结果证明,该系统具有实用性,在媒体资产管理方面有良好的应用前景。 The main objective of the indexing process is to assign labels to the audio-visual data in order to describe its content. Audio indexing plays a key role in this process. In this paper, a speech-based man-machine labeling system for media asset management is presented. The system recognizes the phrases spoken by the human annotator automatically and assists him to mark up shots of subjects in a video sequence. The phrases are segmented from short sentences and modeled by the elliptical basis function （EBF） networks. Experimental results indicate that the speech-based labeling system is practical and has great promise for media asset management.

作者杨庆涛李昕郑宇张芸

机构地区上海大学机电工程与自动化学院上海大学计算机学院

出处《声学技术》 CSCD 北大核心 2006年第5期478-481,共4页 Technical Acoustics

基金上海市教委青年基金(04AB72) 上海市科委启明星计划资助(04QMX1441)

关键词媒体资产管理语音标引 EBF网络 media asset management speech-based label EBF neural network.

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1Han Mei,Gong Yihong.Baseball scene classification using multimedia features[A].2002 IEEE International Conference on Multimedia and Expo[C].2002,1:821-824.
2Li Baoxin,Pan Hao,Sezan I.A general framework for sports video summarization with its application to soccer[A].2003 IEEE International Conference on Acoustics[C].Speech,and Signal Processing (ICASSP'03),2003,3:169-172.
3Chang Y L,Chang W Zeng,I Kamel,et al,Integrated image and speech analysis for content-based video indexing[A].Proceedings of the Third IEEE International Conference on 17-23 June 1996[C].1996,306-313.
4Mak M W,Kung S Y.Estimation of elliptical basis function parameters by the EM algorithm with application to speaker verification[A].IEEE Trans.on Neural Networks[C].2000,11(4),961-969.

1李昕,郑宇,江芳泽.基于EBF网络的话者识别研究和软件开发[J].计算机应用与软件,2003,20(9):3-5.
2李昕,费敏锐.用改进RPCL和EM算法确定EBF网络结构和参数的策略及其应用[J].模式识别与人工智能,2003,16(2):204-207. 被引量：2
3文竹.基于互联网的软件开发和过程分析的研究[J].电脑编程技巧与维护,2017(6):16-18.
4张文龙.学生成绩管理系统[J].科技创新导报,2009,6(27):136-137. 被引量：3
5李昕,郑宇,费敏锐.基于EBF网络的非线性特征映射器及其在鲁棒话者识别中的应用[J].信号处理,2003,19(3):256-261.
6高培,赵鑫,王士同.基于有效神经元的自组织模糊神经网络算法[J].计算机工程与应用,2012,48(35):50-56. 被引量：2
7唐达,刘丹妮.一种工作流时间截止期限的动态验证方法[J].计算机集成制造系统,2004,10(9):1154-1159. 被引量：8
8DINTEK以服务业精神提供布线精品——鼎志十周年记[J].计算机网络世界,2000,10(11):87-88.
9王金花,朱怡安.基于AODV且考虑拓扑信息的节能路由算法[J].微电子学与计算机,2009,26(5):117-121. 被引量：1
10龙猫.其实它是个人——从《金刚》看动作捕捉技术[J].大众软件,2006(8):33-33.

声学技术

2006年第5期

浏览历史

内容加载中请稍等...

实用语音识别的场景标记辅助系统

参考文献4

相关作者

相关机构

相关主题

浏览历史