期刊文献+

可配置语音识别输出概率计算协处理器的设计

Design of a configurable output probability calculation coprocessor for speech recognition
原文传递
导出
摘要 在基于连续隐含Markov模型的嵌入式语音识别系统中,为提升计算效率、降低系统功耗,将算法中计算消耗最大的输出概率计算模块作为协处理器实现。通过先入先出队列电路隔离输出概率计算中的Markov距离和对数加法的数据通路使得系统参数可以灵活配置,并根据输出概率计算所需参数的地址产生规则设计了地址产生单元。采用Xilinx Virtex-5系列FPGA实现了该输出概率协处理器,并通过S3C44B0X微控制器验证了该设计。在配置参数为3维Gauss混合分量、27维特征矢量的条件下,对358个状态,协处理器工作在27MHz的时钟频率时计算输出概率的处理速度达到了0.13倍实时。 The most time-consuming output probability in embedded speech recognition systems based on the continuous hidden Markov model was computed using a configurable co-processor to promote the computation efficiency and lower the system power consumption. The output probability calculation (OPC) includes the Mahalanobis distance and the add-log modules with a FIFO used to separate these two circuits,therefore,making the designed system more configurable. The address generation unit was also specially designed for the OPC. The coprocessor was implemented on the Xilinx Virtex-5 and verified by using S3C44B0X as a host controller. Experiments show that the coprocessor costs 0.13 real-time to calculate 358 states' output probabilities with 3-D Gaussian mixtures and 27-D speech feature vectors and with clock of 27 MHz.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2010年第4期636-639,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家“八六三”高技术研究发展计划重点项目(2008AA010700)
关键词 语音识别 输出概率计算 并行计算 FIFO speech recognition output probability calculation parallel computing FIFO
  • 相关文献

参考文献5

  • 1DONG Ming, LIU Jia, LIU Runsheng. Speech interface ASIC of SOC architecture for embedded application [C]// ICSP'02. Piscataway, NJ: IEEE Press, 2002.. 402-405.
  • 2董明,刘加,刘润生.高性能汉语数码语音识别芯片系统[J].清华大学学报(自然科学版),2003,43(9):1257-1260. 被引量:5
  • 3施妙根.科学与工程计算基础[M].北京:清华大学出版社,1999,8:140-142.
  • 4LI Peng, TANG Huang, LIANG Weiqian. Low power embedded speech recognition system based on an mcu and coprocessor [C]// ICASSP'09. Taipei, China: IEEE Press, 2009, 625-628.
  • 5李鹏,智强,董明,梁维谦,刘润生.嵌入式语音识别Mahalanobis距离计算模块[J].清华大学学报(自然科学版),2008,48(7):1202-1204. 被引量:2

二级参考文献6

  • 1DONG Ming, LIU Jia, LIU Runsheng. Speech interface ASIC of SOC architecture for embedded application [C]// ICSP' 02. Piscataway, NJ : IEEE Press, 2002 : 402 - 405.
  • 2HUANG Xuedong. Spohen Language Processing: A Guide to Theory, Algorithm, and System Development [M]. Upper Saddle River, NJ: Prentice Hall PTR, 2001.
  • 3Hennessy J L, Patterson D A. Computer Architecture: A Quantitative Approach [M]. Third Edition. Burlington, MA, USA: Elsevier Science, 2004.
  • 4Ciletti M D. Advanced Digital Design With the Verilog HDL [M]. Boston, MA, USA: Prentice Hall, 2005.
  • 5顾良,刘润生.汉语数码语音识别:发展现状、难点分析与方法比较[J].电路与系统学报,1997,2(4):32-39. 被引量:12
  • 6董明,刘加,刘润生.高性能汉语数码语音识别芯片系统[J].清华大学学报(自然科学版),2003,43(9):1257-1260. 被引量:5

共引文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部