嵌入深度信念网络的点过程模型用于关键词检出被引量：5

Point process models embedded with deep belief networks for spotting Key words

下载PDF

导出

摘要基于点过程模型的关键词检出系统是一种新颖的连续语音关键词检出系统,虽然该系统具有对样本数要求不高、计算速度快等优点,但其检出性能比较依赖于前端音素探测器的准确度,而目前广泛用于音素探测器的高斯混合模型存在表征和建模能力不强的问题。针对这一缺陷,本文提出了一种嵌入深度信念网络的点过程模型并将其应用于关键词检出,该模型采用表征能力强的深度信念网络来建立音素探测器,改进了高斯混合模型在表征能力上的不足。实验结果表明该方法能够获得比原模型更高的检出率,并且降低了计算复杂度,更适用于需要实时检测关键词的场合。 The keywords spotting system based on point process model is a novel keyword spotting system in continuous speech.Although this system has the advantage of less demanding on samples number and fast calculation,but its performance is mostly depends on the accuracy of the front phoneme detector.However,the Gaussian mixture model which is widely used in the phoneme detector has weaknesses in representation and modeling.To solve this problem,this paper proposes a point process model embedded with deep belief networks and use it for Key words spotting.This model establishes a phoneme detector using deep belief networks,which has a prominent capability to represent features,to overcome GMM＇s shortage in feature representation.Experimental results show that this method can obtain a higher detection rate than the original model and reduce the computational complexity,and it can meet the real-time requirement of spotting Key words preferably.

作者陆俊张琼杨俊安王一刘辉

机构地区电子工程学院电子制约技术安徽省重点实验室中国电子设备系统工程公司研究所

出处《信号处理》 CSCD 北大核心 2013年第7期865-872,共8页 Journal of Signal Processing

基金国家自然科学基金(No.61272333)

关键词关键词检出点过程模型深度信念网络 Key words spotting point processes models deep belief network

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献12

1Jansen A, Niyogi P. Point Process Models for Spotting Keywords in Continuous Speech[ J ]. IEEE Transactions on Audio, Speech, and 'Language Processing. 2009, 17 (8) : 1457-1470.
2Jansen A. Whole Word Discriminative Point Process Mod- els[ C ]. IEEE International Conference on Acoustics, Speech and Signal Processing, 2011:5180-5183.
3Deng L. An Overview of Deep-Structured Learning for Infor- marion Processing: APSIPA ASC 2011[C]. Xi'an: 2011.
4Mohamed A, Dahl G E, Hinton G. Acoustic Modeling Using Deep Belief Networks[ J ]. IEEE Transactions on Audio, Speech, and Language Processing. 2012, 20 (1): 14-22.
5许友亮,张连海,张文林,李永彬.基于语速调整和音位属性后验概率的音素识别[J].信号处理,2012,28(2):295-300. 被引量：5
6Himon G E, Osindero S, Teh Y. A Fast Learning Algo- rithm for Deep Belief Nets [ J ]. Neural Computation. 2006, 18: 1527-1554.
7Hinron G E, Salakhutdinov R. Reducing the Dimension- ality of Data with Neural Networks[ J]. Science. 2006, 313(5786) : 504-507.
8Mostafa A. Salanm, Aboul Ella Hassanien, Aly A. Fahmy. Deep Belief Network for Clustering and Classification of a Continuous Data[ J]. IEEE Inlemational Symposium on Sig- nal Processing and Ibformation Technology, 2010: 473-477.
9Mohamed A, Sainath T, Dahl G. Deep belief networks using discriminative features for phone recognition [ C ]. IEEE International Conference on Acoustics, Speech and Signal Processing, 2011: 5060-5063.
10Pan J, Liu C, Wang Z, Hu Y, Jiang H. Investigalion of Deep Neural Networks (DNN) for Large Vocabulary Con-tinuous Speech Recognition Why DNN Surpasses GMMs in Acoustic Modeling. In Proceedings of International Sympo- sium on Chinese Spoken Language Processing 2012, un- published.

二级参考文献23

1Chin-Hui Lee,Mark A.Clements,Sorin Dusan.An Overview on Automatic Speech Attribute Transcription(ASAT) [C]// Conference on the International Speech Communication Association.Antwerp,Belgium;InterSpeech Express, 2007.1825-1828.
2S.King,P.Taylor.Detection of phonological features in continuous speech recognition using neural networks[J]. Computer,Speech and Language,2000,14(4):333-353.
3M.A.Siegler,R.M.Stern.On the effects of speech rate in large vocabulary speech recognition systems[C]// International Conference on Acoustics,Speech,and Signal Processing. Detroit,MI:ICASSP express,1995.612-615.
4V.R.Gadde,K.Sonmez,H.Franco.Multirate ASR Models for Phone-class Dependent N-best List Rescoring [C]//IEEE Workshop on Automatic Speech Recognition and Understanding(ASRU ).San Juan:IEEE express, 2005.157-161.
5S.Dimopoulos,A.Potamianos,E.-F.Lussier,L.Chin-Hui. Multiple time resolution analysis of speech signal using MCE training with application to speech recognition [C]// International Conference on Acoustics,Speech, and Signal Processing.Tai Bei:IEEE express,2009. 3801-3804.
6I-F Chen,Hsin-Min Wang.Articulatory Feature Asynchrony Analysis and Compensation in Detection-Based ASR//.International Speech Communication Association, Brighton United Kingdom,2009:3059-3062.
7Zoltan Tuske,Christian Plahl,Ralf Schluter.A study on Speaker Normalized MLP Features in LVCSR[C]//Conference on the International Speech Communication Association. Florence,Italy,2011:1089-1092.
8N.Strom,.“The NICO Artificial Neural Network Toolkit”, http://nico.nikkostrom.com.
9Frantisek Grezl.Trap-Based Probabilistic Features For Automatic Speech Recognition[D].Brno,CZ:Brno University of Technology,2007.
10Afsaneh Asaei,Benjamin Picart,Herve Bourlard.Analysis of Phone Posterior Feature space Exploiting Class-Specific Sparsity And MLP-Based Similarity Measure[C]// International Conference on ICASSP.Dallas,TX:2010. 4886-4889.

共引文献7

1杨春风,王欢良.触发式语言模型下的混淆网络解码方法[J].计算机工程与应用,2011,47(10):127-130.
2尹明明,李弼程,屈丹,牛铜.汉语音节混淆网络的生成与重打分算法研究[J].小型微型计算机系统,2012,33(6):1385-1388.
3陆俊,杨俊安,王一.改进的基于点过程模型的连续语音关键词识别技术[J].电路与系统学报,2013,18(2):129-133.
4洪学敏,刘惠华.利用极点轨迹图探讨语速对语音共振峰的影响[J].北京信息科技大学学报（自然科学版）,2015,30(5):57-60.
5杨金霄,沈天飞,滕秋霞.基于声门激励的语音语速、音量调整方法[J].电子测量技术,2016,39(2):72-75. 被引量：3
6王民,苏利博,王稚慧,要趁红.采用STRAIGHT模型和深度信念网络的语音转换方法[J].计算机工程与科学,2016,38(9):1950-1954. 被引量：4
7王民,黄斐,刘利,卫铭斐,王明明.采用深度信念网络的语音转换方法[J].计算机工程与应用,2016,52(15):168-171. 被引量：2

同被引文献49

1熊伟丽,徐保国.基于PSO的SVR参数优化选择方法研究[J].系统仿真学报,2006,18(9):2442-2445. 被引量：65
2谢经明,徐小凤,陈冰,陈幼平,艾武.基于模拟退火遗传算法的电动汽车网络优化调度[J].中国机械工程,2007,18(14):1697-1700. 被引量：7
3Hinton G E,Salakhutdinov R R.Reducing the dimensionality of data with neural networks[J]. Science,2006,313:504- 507.
4D.C.Park,M.A.El -Sharkawi,RJ.Marks,etal.Electric Load Forecasting Using an Artificial Neural Network[J].IEEE Trans On Power System,1991,6(2):442-449.
5Hinton G E.A Practical Guid to Training Restricted Boltzman Machines[R].UMTL Tech Report 2010-003. Toronto,Canada: Univ of Toronto,2010.
6张超,吕玉琴,侯宾,陈小军,俎云霄.基于BP神经网络短期电力负荷预测研究[J/OL].(2013-4-22)[2014-4-12].http://www.paper.edu.cn.
7赵立强,张晓华,高振波,张洪亮.基于BP神经网络的主分量分析人脸识别算法[J].计算机工程与应用,2007,43(36):226-229. 被引量：12
8VEDAM H, VENKATASUBRAMANIAN V. PCA-SDG based process monitoring and fault diagnosis[ J]. Control Engineering Practice, 1999, 7 (7) :903-917.
9GHATE V N, DUDUL S V. Optimal MLP neural network classifier for fault detection of three phase in- duction motor[ J]. Expert Systems with Application, 2010, 37(4) : 3468-3481.
10SALAMA M A, HASSANIEN A E, FAHMY A A. Deep belief network for clustering and classification of a contin- uous data [ J 1. IEEE International Symposium on Signal Processing and Information Technology, 2010:473- 477.

引证文献5

1肖同录,赵增顺.基于深度信念网络的短期电力负荷预测[J].电子世界,2014(10):186-187. 被引量：7
2王培良,夏春江.基于PCA-PDBNs的故障检测与自学习辨识[J].仪器仪表学报,2015,36(5):1147-1154. 被引量：20
3王飞,李强.基于改进的深度信念网络的人脸识别算法研究[J].兰州交通大学学报,2016,35(1):42-47. 被引量：4
4伍忠东,王飞.基于PCA-GA-DBNs的人脸识别算法研究[J].西北师范大学学报（自然科学版）,2016,52(3):43-48. 被引量：2
5李建文,杨亚威.基于移动设备的听障人特定语音识别训练系统[J].河南科技学院学报（自然科学版）,2019,47(1):67-73. 被引量：2

二级引证文献35

1许亮,刘兰英,李秀喜.面向化工过程安全运行的信息物理融合系统[J].现代化工,2016,36(3):169-172. 被引量：5
2许桢英,罗来齐,王匀,俞慧芳,刘欢.基于PCA的管道缺陷导波信号特征优化方法[J].电子测量技术,2016,39(4):160-163. 被引量：10
3冯贺平,杨敬娜,吴梅梅.基于深度神经网络的身份识别研究[J].电脑知识与技术,2016,0(8):161-162. 被引量：1
4严良达,陶剑文.基于联合子空间与多源适应学习的多标签视觉分类[J].西北师范大学学报（自然科学版）,2016,52(6):56-63.
5杨健健,唐至威,王子瑞,吴淼.基于PSO-BP神经网络的掘进机截割部故障诊断[J].煤炭科学技术,2017,45(10):129-134. 被引量：23
6荣凡稳,郑伟,陈冉,高军峰.基于深度学习的运动心率测量系统[J].电子测量与仪器学报,2017,31(12):1912-1917. 被引量：12
7姚世选,陈智元,刘小臣.基于自学习算法的起重机减速器故障报警系统[J].电子测量技术,2018,41(6):90-94. 被引量：1
8陈钊正,郭晓峰,谭政宇,张薇.基于省域路网的高速公路联网监控与人脸识别技术改进[J].科学技术与工程,2018,18(21):122-128.
9周奇才,沈鹤鸿,赵炯,熊肖磊.基于深度学习的机械设备健康管理综述与展望[J].现代机械,2018(4):19-27. 被引量：7
10杨健,周涛,郭丽芳,张飞飞,梁蒙蒙.基于布谷鸟搜索和深度信念网络的肺部肿瘤图像识别算法[J].计算机应用,2018,38(11):3225-3230. 被引量：6

1杨俊安,王一,刘辉,李晋徽,陆俊.深度学习理论及其在语音识别领域的应用[J].通信对抗,2014,33(3):1-5. 被引量：9
2任承蒙,张剑峰,王聪,谢威,徐友云.基于类Matern点过程模型的异构蜂窝网建模[J].应用科学学报,2014,32(5):486-492.
3王一,杨俊安,刘辉,柳林,卢高.一种基于层次结构深度信念网络的音素识别方法[J].应用科学学报,2014,32(5):515-522. 被引量：2
4王勇,张连海.基于点过程模型连续语音关键词检测[J].太赫兹科学与电子信息学报,2013,11(6):958-963. 被引量：2
5吴进,严辉,王洁.采用局部二值模式与深度信念网络的人脸识别[J].电讯技术,2016,56(10):1119-1123. 被引量：10
6张淑清,胡永涛,姜安琦,李军锋,宿新爽,姜万录.基于双树复小波和深度信念网络的轴承故障诊断[J].中国机械工程,2017,28(5):532-536. 被引量：27
7高鑫,欧阳宁,袁华.基于快速去噪和深度信念网络的高光谱图像分类方法[J].桂林电子科技大学学报,2016,36(6):469-476. 被引量：11
8肖同录,赵增顺.基于深度信念网络的短期电力负荷预测[J].电子世界,2014(10):186-187. 被引量：7
9王媛媛,周涛,吴翠颖.深度学习及其在医学图像分析中的应用研究[J].电视技术,2016,40(10):118-126. 被引量：15
10李明,邓家梅.基于信念网络的串行译码方法及其在多维并行级联方案中的应用[J].电信技术研究,2000(10):6-14.

信号处理

2013年第7期

浏览历史

内容加载中请稍等...

嵌入深度信念网络的点过程模型用于关键词检出被引量：5

参考文献12

二级参考文献23

共引文献7

同被引文献49

引证文献5

二级引证文献35

相关作者

相关机构

相关主题

浏览历史

嵌入深度信念网络的点过程模型用于关键词检出 被引量：5

参考文献12

二级参考文献23

共引文献7

同被引文献49

引证文献5

二级引证文献35

相关作者

相关机构

相关主题

浏览历史

嵌入深度信念网络的点过程模型用于关键词检出被引量：5