期刊文献+

新型多模态人性化语音交互系统 被引量:2

New Multi-modeling Voice Interactive System
下载PDF
导出
摘要 面向服务机器人的语音交互需求,研究了一种新型的多模态人性化语音交互系统。该系统采用了连续语音流的关键词检测技术、说话人识别技术、基于传声器阵列语音定位技术和对话管理技术,能进行和谐的人机语音交互。多模态人性化语音交互系统通过这些技术综合利用使系统能知道"谁"在什么"时候"什么"地点"发出什么"指令"。基于人性化语音交互系统,机器人根据语音知道特定服务对象的需求,提供特定服务。 Oriented to the voice interactive system for service robots, a new multi-modeling voice interactive system is developed. The keyword spotting, speaker detection, microphone array localization and dialogue system are used in the system, and the user, instruction, time, and position could be detected. The robots can provide the service needed and react like a human.
作者 韩超 刘加
出处 《电声技术》 2009年第8期78-80,85,共4页 Audio Engineering
基金 国家自然科学基金委员会与微软亚洲研究院联合资助项目(60776800) 国家高技术研究发展计划(863计划)项目(2006AA010101) 国家高技术研究发展计划(863计划)项目(2007AA04Z223) 国家高技术研究发展计划(863计划)项目(2008AA02Z414)
关键词 语音关键词识别 说话人识别 传声器阵列 对话管理系统 keyword spotting speaker detection microphone array sound localization dialog system
  • 相关文献

参考文献5

  • 1王菁华,钟义信,王枞,刘建毅.口语对话管理综述[J].计算机应用研究,2005,22(10):5-8. 被引量:8
  • 2DVORKIND T G, GANNOT S. Time difference of arrival estimation of speech source in a noisy and reverberant environment[J]. Signal Processing, 2005,85 : 177-204.
  • 3REYNOLDS D, QUATIERI T, DUNN R. Speaker verification using adapted Gaussian mixture models [J]. Digital Signal Processing, 2000,10: 19-41.
  • 4王树西.问答系统:核心技术、发展趋势[J].计算机工程与应用,2005,41(18):1-3. 被引量:28
  • 5HAGEN E, POPOWICH F. Flexible speech act based dialogue management [C]// Proceedings of the 1st SIG Dial Workshop on Discourse and Dialogue. Hong Kong: Association for Computational Linguistics,2000:131-140.

二级参考文献15

  • 1陆汝钤.世纪之交的知识工程与知识科学[M].北京:清华大学出版社,2001..
  • 2James R Glass. Challenges for Spoken Dialogue Systems [ C ]. Proc. of IEEE ASRU Workshop, 1999.
  • 3Karl Branting, James Lester, Bradford Mott. Dialogue Management for Conversational Case-based Reasoning[ C ]. Proceedings of the 7th European Conference on Case-based Reasoning, 2004.
  • 4Lars Bo Larsen, Tom Brφndsted, Hans Dybkjaer, et al. State-of-theart of Spoken Language Systems: A Survey [ R ]. Spoken Language Dialogue System Project Report 1, Aalborg.
  • 5Eli Hagen, Fred Popowich. Flexible Speech Act Based Dialogue Management[ C]. Hong Kong: Proceedings of the 1st SIGdial Workshop on Discourse and Dialogue, Association for Computational Linguistics, 2000.
  • 6Alon Lavie, Lori Levin, Yan Qu, et al. Dialogue Processing in a Conversational Speech Translation System [ C ] . Philadelphia: Proceedings of ICSLP-96, 1996. 554-557.
  • 7A M Turing.Computing Machinery and Intelligence[J].Mind,1950;59(236):433-460.
  • 8John R Searle.Minds, brains, and programs[J].Behavioral and Brain Sciences, 1980;3:417-424.
  • 9Weizenbaum,Joseph.ELIZA-A Computer Program for the Study of Natural Language Communication between Man and Machine[J].Communications of the ACM, 1966;9(1):36-45.
  • 10.[EB/OL].http://hci.stanford.edu/winograd/shrdlu/.,.

共引文献33

同被引文献22

  • 1王水平,唐振民,陈北京,蒋晔.复杂环境下语音增强的复平面谱减法[J].南京理工大学学报,2013,37(6):857-862. 被引量:6
  • 2肖游.全国老龄办发布《中国人口老龄化发展趋势预测研究报告》[J].人权,2006(2):60-60. 被引量:40
  • 3KARNJANADECHA M, ZAHORIAN SA. Signal model- ing for isolated word recognition[C]//Proc. IEEE Acous- tic, Speech and Signal Processing.[S.l.]: IEEE Press, 1999: 293-296.
  • 4LIM J S, OPPENHEIM A V. Enhancement and band- width compression of noisy speech[J]. Proceedings of the IEEE, 1979,67(12) : 1588-1604.
  • 5XU Xin, HAYASAKA N. Noise robust Chinese speech recognition system for isolate words[C]//Proc. IEEE Nonlinear Signal and Image Processing. [S.l.]: IEEE Press, 2005 : 36.
  • 6BOLL S. Suppression of acoustic noise in speech using spectral subtraction[C]//Proc. IEEE Acoustic, Speech and Signal Processing. [S.l.]:IEEE Press, 1979:113- 120.
  • 7YANG Haijie, YAO Jing, LIU Jia. A novel speech rec- ognition system-on-chip[C]//Proc. IEEE Audio, Lan- guage and Image Processing. [S.l.]: IEEE Press, 2008: 764-768.
  • 8CHEN Wei, YU Yueqing, ZHANG Xuping, et al. Vi- bration controllability of underactuated robots with flex- ible links[C]//Proc. Technology and Innovation Confer- ence, E&T. Hangzhou : IEEE Press, 2006 : 1872-1878.
  • 9单煜翔,陈谐,史永哲,刘加.基于扩展N元文法模型的快速语言模型预测算法[J].自动化学报,2012,38(10):1618-1626. 被引量:6
  • 10张飞宇.在线教学平台中视频语音识别系统设计[J].电子科技,2012,25(10):43-45. 被引量:1

引证文献2

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部