手语合成中的多模式行为协同韵律模型被引量：9

Multi-Model Behavior Synchronizing Prosody Model in Sign Language Synthesis

下载PDF

导出

摘要利用大量真实多模式行为数据进行学习训练、获取单模式行为的韵律模型以及多模式行为之间的协同关联模型的方法,来实现虚拟人多模式行为之间的协同.重点给出了多模式行为的韵律模型描述,同时给出基于手语韵律参数与语音韵律特征融合的协同控制韵律模型以及韵律参数获取方法,并运用于多模式行为协同控制中,取得了较好的实验结果.与传统的规则法相比,该学习方法更能刻画多模式之间协同关联的复杂性,更好地实现虚拟人多模式行为合成的逼真性. This paper proposes a multi-model behavior synchronizing prosody model and its application to Chinese sign language synthesis. Based on huge realistic multi-model behavior training data, the authors adopt learning the prosody mode for each single channel behavior data and further synchronizing relation model of all models, and present the framework for the multi-mod- el synchronization in virtual human synthesis, including models of sign language, speech, facial expression and lip movement and so on. The formal description of multi-model prosody model is demonstrated in detail. Comparing to traditional regularity approaches, the learning based ap- proach in this paper is more adequate to express complicatedly the multi-model synchronizing rela- tionship, and to synthesize realistically the multi-model behavior of the virtual human. As the example, the synchronizing prosody model involving sign language prosody parameters and speech prosody parameters is given. The authors design an approach to compute the prosody parameters and apply it to control the virtual human＇s multi-model behavior synchronously. Experiments based on the Coss （＂863＂ speech material library） and Chinese sign language library show that the multi-model behavior synchronizing prosody model works well. It enhances the recognition rate of synthetic sign language by 5.94%.

作者陈益强高文刘军发杨长水

机构地区中国科学院计算技术研究所

出处《计算机学报》 EI CSCD 北大核心 2006年第5期822-827,共6页 Chinese Journal of Computers

基金国家自然科学基金(60303018 60403037) 北京市科技新星计划项目基金(2005B54) 北京工业大学多媒体与智能软件技术实验室开放课题基金联合资助.

关键词手语合成多模式韵律模型 sign language synthesis multi model prosody model

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献8

1Cassell J,Vilhjalmsson H,Bickmore T..BEAT:The behavior expression animation toolkit.In:Proceedings of the International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH'2001),2001,205～216
2Kshirsagar S,Thalmann N,Vuilleme A,Thalmann D,Kamyab K,Mamdani E..Avatar markup language.In:Proceedings of 8th Eurographics Workshop on Virtual Environments,2002,169～177
3Brand M,Hertzmann A..Style machines.In:Proceedings of the International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH2000),2000,183～192
4Li Yan,Wang Tian-Shu,Shum Heung-Yeung.Motion texture:A two level statistical model for character motion synthesis.In:Proceedings of the International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH2002),2002,465～472
5Badler N,Allbeck J,Zhao Li-Wei,Byun M..Representing and parameterizing agent behaviors.In:Proceedings of the Computer Animation 2002 Geneva,Switzerland,2002,133～144
6Cassell Justine,Pelachaud Catherine et al.Animated conversation:Rule-based generation of facial expression,gesture & spoken intonation for multiple conversational agents.In:Proceedings of the 21st Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH94),1994,413～420
7Joint Research & Development Laboratory for Advanced Computer and Communication Technologies.The Version 2.0 of Chinese Sign Language Synthesis System Achieve Success in Four Deaf Schools of Beijing.http://sy.jdl.ac.cn/yjjz/dt3.asp
8Chen Yi-Qiang,Gao Wen,Zhu Ting-Shao,Ling Charles.Learning prosodic patterns for mandarin text to speech.The Journal of Intelligent Information System,2002,19(1):95～109

同被引文献60

1杨长水,王兆其,高文.基于WEB的手语新闻虚拟主持人的研究与实现[J].系统仿真学报,2001,13(S2):408-411. 被引量：11
2胡友树.手势识别技术综述[J].中国科技信息,2005(2):42-42. 被引量：27
3曾祥永,鲁鹏,张满囤,王阳生.基于视频与语音的多通道游戏用户界面系统[J].计算机辅助设计与图形学学报,2005,17(10):2353-2358. 被引量：6
4孙茂松.语言计算:信息科学技术中长期发展的战略制高点[J].语言文字应用,2005(3):38-40. 被引量：3
5张良国,高文,陈熙霖,陈益强,王春立.面向中等词汇量的中国手语视觉识别系统[J].计算机研究与发展,2006,43(3):476-482. 被引量：11
6王亮,付永刚,纪连恩,张凤军,戴国忠.基于约束语义的双手交互场景布局系统[J].计算机辅助设计与图形学学报,2006,18(8):1243-1249. 被引量：5
7葛春宝,陈益强,尹宝才,高文,杨长水.一种新的手势运动数据重定向方法[J].计算机学报,2006,29(10):1850-1855. 被引量：4
8姜峰,高文,姚鸿勋,陈熙霖.手势手语力效分析[J].计算机学报,2007,30(5):851-860. 被引量：4
9刘海宽,齐美星,陈传虎.基于新型数据手套手语识别的研究与应用[J].机床与液压,2007,35(7):29-30. 被引量：1
10Ong S C W,Ranganath S.Automatic sign language analysis:A survey and the future beyond lexical meaning[J].IEEE Transaction on Pattern Analysis and Machine Intelligence,2005,27(6):873-891.

引证文献9

1颜庆聪,陈益强,刘军发.面向广电节目的虚拟人手语合成显示平台研究[J].计算机研究与发展,2009,46(11):1893-1899. 被引量：4
2范双南,陈益强,周经野.自然手语动作序列生成的研究[J].计算机与数字工程,2010,38(8):110-112. 被引量：6
3何文静,陈益强,颜庆聪,周经野.真实感虚拟手语主持人的实现[J].微计算机信息,2010,26(31):217-219. 被引量：2
4许天然,吴垚,苏红旗.基于移动终端的汉语手语识别技术研究[J].科技资讯,2012,10(19):24-24. 被引量：1
5何文静,陈益强,刘军发.手势数据驱动的头部运动合成方法[J].计算机科学与探索,2012,6(12):1109-1115. 被引量：2
6张凤军,戴国忠,彭晓兰.虚拟现实的人机交互综述[J].中国科学：信息科学,2016,46(12):1711-1736. 被引量：220
7吕美林,侯宇慧,李博文.基于虚拟现实的多机器人围捕研究[J].科技资讯,2017,15(34):14-15.
8姚登峰,江铭虎,鲍泓,李晗静,阿布都克力木.阿布力孜.手语计算30年:回顾与展望[J].计算机学报,2019,42(1):111-135. 被引量：7
9陈琳,朱守业,郝国生,张高飞,文燕银,毛文秀.语-视互转无障碍交流系统研究[J].现代教育技术,2021,31(9):87-94.

二级引证文献238

1武传宝,王琦.浅谈VR在计算机教学中的探索[J].中外企业家,2020(8):198-198.
2向安玲,许可.人机何以交互:理论溯源、范式演变与前景趋势[J].全球传媒学刊,2023,10(5):88-105. 被引量：12
3刘全程,侯加林,袁建强,宋立柱,牛子孺.大葱联合收获机虚拟仿真培训系统设计与开发[J].农业工程,2019,9(12):25-30. 被引量：2
4韩丽东.基于网络平台的高职轨道交通虚拟实验教学研究[J].内江科技,2022,43(10):157-158.
5代明远,王明江,肖利伟,杨文军.工程机械产品虚拟设计应用综述[J].机械设计,2020,37(3):128-134. 被引量：14
6张一苇.中国手语语言标准化及语言建设工作研究——基于豪根矩阵框架[J].汉字文化,2023(12):46-48.
7孙俊彬,陆熊,潘覃毅,林闽旭,殷宏彬,黄晓梅.基于SteamVR激光扫描的手指位置检测系统设计[J].电子测量技术,2020,43(15):154-157. 被引量：3
8张轶,韩宇翃.基于多模态信息的交互景观设计研究[J].包装工程,2024,45(S01):393-399. 被引量：1
9杨强.地方电视台手语新闻发展机遇、问题及革新探究[J].神州,2012(3):36-36. 被引量：1
10章晓明,王扬.一体化仿真支撑软件PROSIMS[J].计算机仿真,2000,17(1):64-67. 被引量：16

1王敬华,刘建银,张国燕,赵新想.情感语音合成中韵律参数的基频研究[J].小型微型计算机系统,2013,34(9):2047-2050. 被引量：2
2赵欢,谭华.多维关联规则在汉语韵律模型研究中的运用[J].计算机工程与应用,2007,43(34):223-225.
3胡文英,王志中.基于韵律模型的普通话基频分析[J].计算机仿真,2006,23(1):262-266.
4王志伟,邵艳秋,赵永贞,刘挺.一个普通话文语转换系统中的韵律模型[J].计算机应用研究,2006,23(6):79-81. 被引量：1
5李勇,于洪志,达哇彭措.基于关联规则的藏语语音韵律参数提取[J].微计算机信息,2009(6):255-257. 被引量：3
6邵艳秋,穗志方,韩纪庆,王志伟.小规模情感数据和大规模中性数据相结合的情感韵律建模研究[J].计算机研究与发展,2007,44(9):1624-1631.
7达哇彭措.基于优化Apriori算法的藏语音韵律规则研究[J].微计算机信息,2010,26(18):202-204.
8韩华,丁永生,郝矿荣.基于协同关联粒子滤波算法的交互多视频目标跟踪(英文)[J].控制理论与应用,2013,30(9):1187-1193. 被引量：4
9沈冠楠.协同三重奏[J].软件世界,2009(2):95-96.
10杨金辉,易中华,王煦法.一种基于Straight的语音焦点合成方法[J].计算机工程,2005,31(13):46-47. 被引量：3

计算机学报

2006年第5期

浏览历史

内容加载中请稍等...

手语合成中的多模式行为协同韵律模型被引量：9

参考文献8

同被引文献60

引证文献9

二级引证文献238

相关作者

相关机构

相关主题

浏览历史

手语合成中的多模式行为协同韵律模型 被引量：9

参考文献8

同被引文献60

引证文献9

二级引证文献238

相关作者

相关机构

相关主题

浏览历史

手语合成中的多模式行为协同韵律模型被引量：9