期刊文献+

智能语音机器人前端语音处理系统的设计及实现 被引量:2

Design and Implementation of Front End Speech Processing System for Intelligent Voice Robot
下载PDF
导出
摘要 语音前端处理是智能语音机器人中一项关键的技术。传统的语音前端处理方法大多通过DSP来实现,大大增加系统复杂度和研发成本。基于WebRTC语音库,采用SRP-PHAT声音定位算法实现语音前端处理系统可以解决上述问题。该系统使用C++语言实现,可以直接部署在普通通用处理器或嵌入式ARM处理器中。经过实际功能及性能测试,该系统可以满足智能语音机器人对前端语音处理的要求。 Speech front-end processing is a key technology in intelligent speech robot.The traditional DSP implementation method will increase the system complexity and cost.This paper presents a front-end speech processing system based on WebRTC speech database and SRPPHAT sound localization algorithm.The system is implemented by C++voice and can be directly deployed in general purpose processor or embedded ARM processor.After the function and performance test,the system can meet the requirements of intelligent voice robot for front-end voice processing.
作者 刘生 LIU Sheng(Nanjing Panda Electronic Equipment Co.,Ltd.,Nanjing 210000)
出处 《现代计算机》 2021年第3期106-110,共5页 Modern Computer
关键词 前端语音 WebRTC 声源定位 语音机器人 Front End Voice WebRTC Sound Source Localization Speech Robot
  • 相关文献

参考文献2

二级参考文献10

  • 1王金芳,虢明.指数函数规整群时延的VAD特征研究[J].吉林大学学报(工学版),2013,43(S1):435-439. 被引量:1
  • 2MA Y, NISHIHARA A. Efficient Voice Activity Detection Algorithm Using Long-Term Spectral Flatness Measure [ J]. Eurasip Journal on Audio, Speech, and Music Processing, 2013( 1 ) : 1-18.
  • 3DAVIS A, NORDHOLM S, TOGNERI R. Statistical Voice Activity Detection Using Low-Variance Spectrum Estimation and an Adaptive Threshold [ J]. IEEE Transactions on Audio Speech and Language Processing, 2006, 14 (2) : 412-424.
  • 4NEMER E, GOUBRAN R, MAHMOUD S. Robust Voice Activity Detection Using Higher-Order Statistics in the LPC Residual Domain [J]. IEEE Transactions on Speech and Audio Processing, 2001, 9(3) : 217-231.
  • 5MURTHY H A, YEGNANARYANA B. Formant Extraction from Group Delay Function [ J]. Speech Communication, 1991, 10(3) : 209-221.
  • 6YEGNANARYANA B. Formant Extraction from Linear Prediction Phase Spectrum [ J ]. Journal of the Acoustical Society of America, 1978, 63(5): 1638-1640.
  • 7WANG Jinfang, GUO Ming. Research on VAD Feature of Exponent Function Warping Group Delay Function [ J]. Journal of Jilin University : Engineering and Technology Edition, 2013, 43 (3) : 435-439.
  • 8WU Z, XIAO X, CHENG E S, et al. Synthetic Speech Detection Using Temporal Modulation Feature [C]//2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Vancouver, Canada: [ s. n. ], 2013 : 7234-7238.
  • 9MURTHY H A, YEGNANARAYANA B. Group Delay Functions and its Applications in Speech Technology [ J ]. Sadhana, 2011, 36(5) : 745-782.
  • 10MURTHY H A, GADDE V. The Modified Group Delay Function and its Application to Phoneme Recognition [ C ]//2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Hongkong, China: [ s. n. ], 2003 : 68-71.

共引文献28

同被引文献44

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部