基于音频的数字媒体内容分析及其可视化

Audio-based digital multimedia content analysis and its visualization

下载PDF

导出

摘要为了对音视频内容进行更加有效地分析,将信息可视化方法引入数字媒体信息处理领域。设计并实现了集多媒体信号采集、大词表连续语音识别、文本检索和音频检索为一身的多媒体内容可视化分析平台,取得了较理想的效果,充实了信息可视化理论并对其具体应用进行了有益尝试。 To facilitate the content analysis of audio and video, information visualization methods are applied to digital multimedia processing. A multimedia content visualization analysis system is designed and constructed including multimedia signal collection, large vocabulary continuous speech recognition, text retrieval and audio retrieval, which is a supplement to the theory of information visualization and beneficial to its application. The experimental results are rather good.

作者张田李嵩高畅邱荣发李海峰

机构地区哈尔滨工业大学计算机科学与技术学院

出处《燕山大学学报》 CAS 2010年第2期100-105,共6页 Journal of Yanshan University

基金国家自然科学基金资助项目(60772076) 语言语音教育部-微软重点实验室开放基金资助项目(HIT.KLOF.2009015)

关键词数字媒体内容信息可视化语音识别文本检索音频检索 digital multimedia content information visualization speech recognition text retrieval audio retrieval

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献14

1Allen,B,窦平安.图书情报学研究中的内容分析法[J].国外情报科学,1993,11(1):27-30. 被引量：16
2Robertson G,Card S K,Mackinlay J D.The cognitive coprocessor architecture for interactive user interfaces[C] //Proceedings of the 2nd Annual ACM SIGGRAPH Symposium on User Interface Software and Technology,Williamsburg,Virginia,United States,1989:10-18.
3Huang XD,Acero A,Hon H W.Spoken language processing:a guide to theory,algorithm and system development[M].New Jersey:Prentice Hall PTR,2001.
4倪崇嘉,刘文举,徐波.汉语大词汇量连续语音识别系统研究进展[J].中文信息学报,2009,23(1):112-123. 被引量：39
5Reynolds D A.Automatic speaker recognition using Gaussian mixture speaker models[J].MIT Lincoln Laboratory Journal,1995,8 (2):173-191.
6Rabiner L R.A tutorial on hidden Markov models and selected applications in speech recognition[C] //Procedings of IEEE,1989,77 (2):257-286.
7Huang C,Shi Y,Zhou J L,et al..Segmental tonal modeling for phone set design in Mandarin LVCSR[C] //Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing,Montreal,Quebec,Canada,2004:901-904.
8Young S,Russell NH,Thornton J H S.Token passing:a simple conceptual model for connected speech recognition systems[R].Cambridge:Cambridge University Engineering Departmet,1989.
9Sakoe H,Chiba S.A similarity evaluation of speech patterns by dynamic programming[C] //the Dig.1970 Nat.Meeting,Institute of Electronic Communications Engineering of Japan,1970:136.
10Young S,Evermann G,Hain T,et al..The HTK Book (for HTK 3.3)[M/OL].Cambridge University Engineering Department,2005.

二级参考文献87

1孔燕,葛列众.突显及其工效学研究[J].人类工效学,1999,5(3):40-42. 被引量：10
2钱跃良,林守勋,刘群,刘宏.2005年度863计划中文信息处理与智能人机接口技术评测回顾[J].中文信息学报,2006,20(B03):1-6. 被引量：4
3Zhang, B., S. Matsoukas and R. Schwartz. Discrimina tively trained region dependent teature transforms for speech recognition [C]// Proc. ICASSP, Vol. 1-13, 2006: 313-316.
4Beyerlein, P., et al., Large vocabulary continuous speech recognition of Broadcast News - The Philips/ RWTH approach[J]. Speech Communication, 2002, 37(1-2): 109- 131.
5Hain, T., et al., Automatic transcription of conversational telephone speech [C]// IEEE Transactions on Speech and Audio Processing, 2005, 13(6): 1173-1185.
6Zhang, B. and S. Matsoukas, Minimum phoneme error based heteroscedastic linear discriminant analy sis for speech recognition[C]// Proc. ICASSP, Vol. 1-5, 2005: 1925-1928.
7Hirsimaki, T., et al., Unlimited vocabulary speech recognition with morph language models applied to Finnish[J]. Computer Speech and Language, 2006, 20(4) : 515-541.
8Odell, J.J., The Use of Context in Large Vocabulary Speech Recognition[D]. 1995, University of Cambridge :Cambridge
9Young, S.J., J.J. Odell, and P. C. Woodland. Tree-Based State Tying for High Accuracy Modelling [C]// Proceedings ARPA Workshop on Human Language Technology. 1994.
10Xu, B., et al., Integrating tone information in continuous Mandarin recognition[C]// Proc. ISSPIS, 1999.

共引文献63

1张韶瑾.微博健康传播议题设置特点及问题研究——以2022年微博健康知识相关热搜话题为例[J].新媒体研究,2023,9(4):104-109. 被引量：1
2赵蓉英,邹菲.内容分析法学科基本理论问题探讨[J].图书情报工作,2005,49(6):14-18. 被引量：21
3张岌秋.论网络环境下情报学研究方法的演化[J].图书情报工作,2005,49(10):33-36. 被引量：11
4王岚霞,李高峰.内容分析法在图书情报领域中的应用与展望[J].新世纪图书馆,2007(1):16-18. 被引量：6
5张少龙,吴佳鑫.语音信息的内容分析技术研究综述[J].现代图书情报技术,2007(4):28-31. 被引量：2
6金燕.WWW信息导航可视化研究[J].图书馆理论与实践,2007(4):49-51. 被引量：1
7葛列众,王琦君,王哲.采用Focus+Context技术改进网页注册界面可用性的实证研究[J].人类工效学,2007,13(3):28-30. 被引量：2
8黄桂晶,张进宝,罗李.基于“扎根理论”对微软“携手助学”信息技术教师培训现状的研究[J].电化教育研究,2008,29(5):86-89. 被引量：2
9张威.口译语料库的开发与建设:理论与实践的若干问题[J].中国翻译,2009,30(3):54-59. 被引量：48
10孟莎,刘加.汉语语音检索的集外词问题与两阶段检索方法[J].中文信息学报,2009,23(6):91-97. 被引量：8

1龚菁菁.可视化理论在地图学中的应用[J].河南测绘,2001(3):6-8.
2夏威夷,张迪,朱立谷.基于决策树的可视化分析平台的设计与实现[J].中国传媒大学学报（自然科学版）,2015,22(1):51-56. 被引量：1
3朱耀华,郝文宁,陈刚.可视化技术简述[J].电脑知识与技术,2012,8(2X):1402-1407. 被引量：9
4邵杰.以可视化分析平台挖掘数据背后的价值[J].现代制造,2016,0(26):34-34.
5鹿玉红,白灵,邢丽莉,王金峰,刘颖,李忠.可视化技术在进程管理模拟中的应用[J].科技通报,2014,30(9):177-180. 被引量：1
6毛爱萍.Scratch让程序教学成为创作之旅[J].中国信息技术教育,2013(10):15-17. 被引量：3
7孙凯,刘玉华,张成海,王长波.基于网络数据的企业知识图谱可视化[J].东华大学学报（自然科学版）,2016,42(4):473-477. 被引量：2
8Teradata增强Presto企业级功能[J].金融电子化,2016,0(7):95-95.
9吴玲达,杨超,于荣欢.基于并行的电子信息装备效能可视化分析研究[J].装备学院学报,2014,25(2):1-5. 被引量：3
10曲朝阳,谢光强,赵晓彤.面向服务结构的发电设备可靠性厂级监控信息系统的开发[J].电网技术,2004,28(20):23-27. 被引量：5

燕山大学学报

2010年第2期

浏览历史

内容加载中请稍等...

基于音频的数字媒体内容分析及其可视化

参考文献14

二级参考文献87

共引文献63

相关作者

相关机构

相关主题

浏览历史