期刊文献+

动态调整直方图剪枝PDA声控拨号器的应用与实现 被引量:1

Application and Implementation of Dynamically-Adjustable Histogram Pruning for PDA Voice Dialing
下载PDF
导出
摘要 以使用嵌入式操作系统 Pocket Pc的个人数字助理(PDA)为实验平台研究了基于非特定人语音命令识 别的可定制声控拨号器。针对PDA存储空间和运算能力的限制,在保证性能的前提下从严格控制搜索空间和提高 解码速度出发,提出了结合搜索路径分数差值实时调整剪枝宽度的动态调整直方图剪枝策略,提出了利用速查表 加速似然计算的方法,并在通过实验验证后采用较少维数的特征、结合扩展声韵母进行声学建模等措施,有效地解 决了上述问题。在实际PDA设备上实验表明,在词表大小为200个人名时,识别正确率达98.70%,而识别速度比采 用标准算法的参考系统提高了约80倍,同时节省了约30%搜索存储空间。 Memory and speed are two demanding factors thai: must be faced when applying voice dialing speech recognition system to Pocket PC. We proposed a novel decoding method, dynamic histogram pruning adjusted by the difference scores of token paths,to precisely control the decoding search space and improve decoding efficiency. Besides,a new acoustic modeling method based on Extended Initial/Final(XIF) with less feature dimension is proven suitable for embedded speech recognition. By using the above methods developed,we implemented a speaker-independent and user definable voice dialing speech recognition system with good performance on a real PDA device. In 200-word-sized vocabulary,it obtained the accuracy of 98.70% and better recognition speed with 30% decoding space saving in comparison to the baseline system.
出处 《电声技术》 2005年第12期38-43,共6页 Audio Engineering
关键词 语音识别 声控拨号 个人数字助理(PDA) 动态调整直方图剪枝 speech recognition voice dialing user-definable vocabulary dynamically-adjustable histogram pruning
  • 相关文献

参考文献16

二级参考文献26

  • 1陈景东,徐波,黄泰翼.一种基于迟滞编码的自动语音端点检测方法[J].电路与系统学报,1996,1(4):29-32. 被引量:2
  • 2E Bocchieri.Vector quantization for efficient computation of continuous density likelihoods[C].In:Proc ICASSP,1993;2:692~695
  • 3M J F Gales,K M Knill,S J Young.State-Based Gaussian Selection in Large Vocabulary Continuous Speech Recognition using HMM's[C].In:IEEE Trans on Speech and Audio Processing,1999;7(2):152~161
  • 4Douglas B Paul.An Investigation of Gaussian Shortlists[C].In:Proc of the IEEE workshop on ASRU,Colorado,1999
  • 5J Fritsch,I Rogina.The Bucket Box Intersection(BBI)Algorithm for Fast Approximative Evaluation of Diagonal Mixture Gaussians[C].In:Proc ICASSP,1996:837~840
  • 6Du Limin,Feng Junlan,Song Yi et al.Speech Translation on Internet CEST-CAS2.0[C].In:Proc ISIMP,2001:189~192
  • 7L Rabiner,B H Juang.Fundementals of Speech Recognition[M].Prentice Hall,1993:350~352
  • 8边肇祺 张学工.模式识别[M]·第二版[M].北京:清华大学出版社,1999.235-237.
  • 9Hwang M Y, Hon H W, Lee K F. Modeling between-word coarticulation in continuous speech recognition [A].Proceedings of the ISCA European Conference on Speech Communication and Technology [C]. Paris, France: ISCA (International Speech Communication Association), 1989.5-8.
  • 10Ney H, Haeb-Umbach R. Improvements in beam search for 10000-word continuous speech recognition [A]. Proceedings of ICASSP1992 [C]. San Francisco, USA: IEEE Press,1992. 9-12.

共引文献18

同被引文献2

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部