期刊文献+

基于上下文敏感区块的模糊语音准确识别方法

Accurate recognition of fuzzy speech based on context sensitive block
下载PDF
导出
摘要 为对语音信号进行良性切分,实现有目的性的声源重组,提出一种基于上下文敏感区块的模糊语音准确识别方法。在区块组织的频谱特征中,确定模糊语音的Gabor滤波传输条件,并对Delta描述算子进行定向计算,完成上下文敏感区块模糊语音的特征参数分析。在此基础上,利用深度识别神经网络,对模糊语音的特征线索进行有效分离,并对其识别端点进行逐一排查,完成新型语音准确识别方法的构建。对比实验数据显示,与基础语音识别方法相比,基于上下文敏感区块的模糊语音准确识别方法既可将最大信号切分率提升至95%左右,也能保持声源信号的最大深度不超过4.50×10^-7μm,达到重组声源的目的。 In order to segment speech signal benignly and achieve purposeful source reorganization,an accurate recognition method based on context-sensitive blocks for fuzzy speech is proposed. In the spectrum characteristics of the block organization,the Gabor filter transmission condition of the fuzzy speech is determined,and the Delta descriptor is calculated in orientation to complete the analysis of the characteristic parameters of the context-sensitive block fuzzy speech. On this basis,the deep recognition neural network is used to effectively separate the feature clues of the fuzzy speech,and the recognition endpoints are checked one by one to complete the construction of a new accurate speech recognition method. The experimental results show that compared with the basic speech recognition method,the context-sensitive block-based fuzzy speech recognition method can not only increase the maximum signal segmentation rate to about 95%,but also maintain the maximum depth of the source signal not more than 4.50*10^-7μm,so as to achieve the purpose of recombining the source.
作者 全龙翔 阿不力克木·吾甫尔 马超 武江波 QUAN Long⁃xiang;Abulikemu·Wupuer;MA Chao;WU Jiang⁃bo(State Grid Xinjiang Electeic Power Research Institute CO.,LTD,Urumqi 830000,China)
出处 《电子设计工程》 2020年第1期32-35,44,共5页 Electronic Design Engineering
基金 江苏省科技厅项目(CGYKJQQ00000019)
关键词 敏感区块 模糊语音 频谱特征 GABOR滤波 Delta描述子 sensitive blocks fuzzy speech spectrum features Gabor filtering Delta descriptor
  • 相关文献

参考文献16

二级参考文献94

  • 1刘梓溪,张航.基于QPSO算法优化的RBF神经网络设计[J].中南大学学报(自然科学版),2013,44(S1):27-30. 被引量:3
  • 2汤韩杰,袁晓.子波分析中尺度与波长的关系[J].电子科技大学学报,2006,35(1):13-16. 被引量:6
  • 3杨艺,李建勋,柯熙政.小波方差在信号特征提取中的应用[J].传感器世界,2006,12(1):33-35. 被引量:11
  • 4陈理,袁晓,汤韩杰,帅晓飞.金融时间序列结构波动的子波变换分析[J].四川大学学报(自然科学版),2007,44(2):293-298. 被引量:1
  • 5Torres-Carrasquillo P A, Singer E, Kohler M A., et al. Approachesto language identification using gaussian mixture models andshifted delta cepstral features [C]//Proc ICSLP. 2002: 33-36.
  • 6Mohamed A, Dahl G, Hinton G. Acoustic modeling using deepbelief networks [J]. IEEE Transactions on Audio, Speech, andLanguage Processing, 2012, 20(1): 14-22.
  • 7Dahl G E, Sainath T N, Hinton G E. Improving deep neural networksfor lvcsr using rectified linear units and dropout[C]//ICASSP,2013.
  • 8Hinton G, Srivastava N, Krizhevsky A, et al. Improving neuralnetworks by preventing co-adaptation of feature detectors[J]. TheComputing Research Repository, abs/1207.0580, 2012.
  • 9Vinod Nair, Geoffrey G, Hinton. rectified linear units improverestricted boltzmann machines[C]//ICML-10.2010.
  • 10Zeiler M D, Ranzato M, Monga R., et al. On Rectified LinearUnits for Speech Processing[C]//ICASSP, 2013.

共引文献86

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部