期刊文献+

嵌入深度信念网络的点过程模型用于关键词检出 被引量:5

Point process models embedded with deep belief networks for spotting Key words
下载PDF
导出
摘要 基于点过程模型的关键词检出系统是一种新颖的连续语音关键词检出系统,虽然该系统具有对样本数要求不高、计算速度快等优点,但其检出性能比较依赖于前端音素探测器的准确度,而目前广泛用于音素探测器的高斯混合模型存在表征和建模能力不强的问题。针对这一缺陷,本文提出了一种嵌入深度信念网络的点过程模型并将其应用于关键词检出,该模型采用表征能力强的深度信念网络来建立音素探测器,改进了高斯混合模型在表征能力上的不足。实验结果表明该方法能够获得比原模型更高的检出率,并且降低了计算复杂度,更适用于需要实时检测关键词的场合。 The keywords spotting system based on point process model is a novel keyword spotting system in continuous speech.Although this system has the advantage of less demanding on samples number and fast calculation,but its performance is mostly depends on the accuracy of the front phoneme detector.However,the Gaussian mixture model which is widely used in the phoneme detector has weaknesses in representation and modeling.To solve this problem,this paper proposes a point process model embedded with deep belief networks and use it for Key words spotting.This model establishes a phoneme detector using deep belief networks,which has a prominent capability to represent features,to overcome GMM's shortage in feature representation.Experimental results show that this method can obtain a higher detection rate than the original model and reduce the computational complexity,and it can meet the real-time requirement of spotting Key words preferably.
出处 《信号处理》 CSCD 北大核心 2013年第7期865-872,共8页 Journal of Signal Processing
基金 国家自然科学基金(No.61272333)
关键词 关键词检出 点过程模型 深度信念网络 Key words spotting point processes models deep belief network
  • 相关文献

参考文献12

  • 1Jansen A, Niyogi P. Point Process Models for Spotting Keywords in Continuous Speech[ J ]. IEEE Transactions on Audio, Speech, and 'Language Processing. 2009, 17 (8) : 1457-1470.
  • 2Jansen A. Whole Word Discriminative Point Process Mod- els[ C ]. IEEE International Conference on Acoustics, Speech and Signal Processing, 2011:5180-5183.
  • 3Deng L. An Overview of Deep-Structured Learning for Infor- marion Processing: APSIPA ASC 2011[C]. Xi'an: 2011.
  • 4Mohamed A, Dahl G E, Hinton G. Acoustic Modeling Using Deep Belief Networks[ J ]. IEEE Transactions on Audio, Speech, and Language Processing. 2012, 20 (1): 14-22.
  • 5许友亮,张连海,张文林,李永彬.基于语速调整和音位属性后验概率的音素识别[J].信号处理,2012,28(2):295-300. 被引量:5
  • 6Himon G E, Osindero S, Teh Y. A Fast Learning Algo- rithm for Deep Belief Nets [ J ]. Neural Computation. 2006, 18: 1527-1554.
  • 7Hinron G E, Salakhutdinov R. Reducing the Dimension- ality of Data with Neural Networks[ J]. Science. 2006, 313(5786) : 504-507.
  • 8Mostafa A. Salanm, Aboul Ella Hassanien, Aly A. Fahmy. Deep Belief Network for Clustering and Classification of a Continuous Data[ J]. IEEE Inlemational Symposium on Sig- nal Processing and Ibformation Technology, 2010: 473-477.
  • 9Mohamed A, Sainath T, Dahl G. Deep belief networks using discriminative features for phone recognition [ C ]. IEEE International Conference on Acoustics, Speech and Signal Processing, 2011: 5060-5063.
  • 10Pan J, Liu C, Wang Z, Hu Y, Jiang H. Investigalion of Deep Neural Networks (DNN) for Large Vocabulary Con-tinuous Speech Recognition Why DNN Surpasses GMMs in Acoustic Modeling. In Proceedings of International Sympo- sium on Chinese Spoken Language Processing 2012, un- published.

二级参考文献23

  • 1Chin-Hui Lee,Mark A.Clements,Sorin Dusan.An Overview on Automatic Speech Attribute Transcription(ASAT) [C]// Conference on the International Speech Communication Association.Antwerp,Belgium;InterSpeech Express, 2007.1825-1828.
  • 2S.King,P.Taylor.Detection of phonological features in continuous speech recognition using neural networks[J]. Computer,Speech and Language,2000,14(4):333-353.
  • 3M.A.Siegler,R.M.Stern.On the effects of speech rate in large vocabulary speech recognition systems[C]// International Conference on Acoustics,Speech,and Signal Processing. Detroit,MI:ICASSP express,1995.612-615.
  • 4V.R.Gadde,K.Sonmez,H.Franco.Multirate ASR Models for Phone-class Dependent N-best List Rescoring [C]//IEEE Workshop on Automatic Speech Recognition and Understanding(ASRU ).San Juan:IEEE express, 2005.157-161.
  • 5S.Dimopoulos,A.Potamianos,E.-F.Lussier,L.Chin-Hui. Multiple time resolution analysis of speech signal using MCE training with application to speech recognition [C]// International Conference on Acoustics,Speech, and Signal Processing.Tai Bei:IEEE express,2009. 3801-3804.
  • 6I-F Chen,Hsin-Min Wang.Articulatory Feature Asynchrony Analysis and Compensation in Detection-Based ASR//.International Speech Communication Association, Brighton United Kingdom,2009:3059-3062.
  • 7Zoltan Tuske,Christian Plahl,Ralf Schluter.A study on Speaker Normalized MLP Features in LVCSR[C]//Conference on the International Speech Communication Association. Florence,Italy,2011:1089-1092.
  • 8N.Strom,.“The NICO Artificial Neural Network Toolkit”, http://nico.nikkostrom.com.
  • 9Frantisek Grezl.Trap-Based Probabilistic Features For Automatic Speech Recognition[D].Brno,CZ:Brno University of Technology,2007.
  • 10Afsaneh Asaei,Benjamin Picart,Herve Bourlard.Analysis of Phone Posterior Feature space Exploiting Class-Specific Sparsity And MLP-Based Similarity Measure[C]// International Conference on ICASSP.Dallas,TX:2010. 4886-4889.

共引文献7

同被引文献49

  • 1熊伟丽,徐保国.基于PSO的SVR参数优化选择方法研究[J].系统仿真学报,2006,18(9):2442-2445. 被引量:65
  • 2谢经明,徐小凤,陈冰,陈幼平,艾武.基于模拟退火遗传算法的电动汽车网络优化调度[J].中国机械工程,2007,18(14):1697-1700. 被引量:7
  • 3Hinton G E,Salakhutdinov R R.Reducing the dimensionality of data with neural networks[J]. Science,2006,313:504- 507.
  • 4D.C.Park,M.A.El -Sharkawi,RJ.Marks,etal.Electric Load Forecasting Using an Artificial Neural Network[J].IEEE Trans On Power System,1991,6(2):442-449.
  • 5Hinton G E.A Practical Guid to Training Restricted Boltzman Machines[R].UMTL Tech Report 2010-003. Toronto,Canada: Univ of Toronto,2010.
  • 6张超,吕玉琴,侯宾,陈小军,俎云霄.基于BP神经网络短期电力负荷预测研究[J/OL].(2013-4-22)[2014-4-12].http://www.paper.edu.cn.
  • 7赵立强,张晓华,高振波,张洪亮.基于BP神经网络的主分量分析人脸识别算法[J].计算机工程与应用,2007,43(36):226-229. 被引量:12
  • 8VEDAM H, VENKATASUBRAMANIAN V. PCA-SDG based process monitoring and fault diagnosis[ J]. Control Engineering Practice, 1999, 7 (7) :903-917.
  • 9GHATE V N, DUDUL S V. Optimal MLP neural network classifier for fault detection of three phase in- duction motor[ J]. Expert Systems with Application, 2010, 37(4) : 3468-3481.
  • 10SALAMA M A, HASSANIEN A E, FAHMY A A. Deep belief network for clustering and classification of a contin- uous data [ J 1. IEEE International Symposium on Signal Processing and Information Technology, 2010:473- 477.

引证文献5

二级引证文献35

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部