一种新的基于DBN的声学特征提取方法

A New Feature Extraction Method Based on Bottleneck Deep Belief Network

下载PDF

导出

摘要大词汇量连续语音识别系统中,为了进一步增强网络的鲁棒性、提升深度置信网络的识别准确率,提出一种基于区分性和ODLR自适应瓶颈深度置信网络的特征提取方法。该方法首先使用鲁棒性较强的瓶颈深度置信网络进行初步特征提取,进而进行区分性训练,使网络的区分性更强、识别准确率更高,在此基础上引入说话人自适应技术对网络进行调整,提高模型的鲁棒性。利用提出的声学特征在多个噪声较强、主题风格较为随意的多个公共连续语音数据库上进行了测试,识别结果取得了22.2%的提升。实验结果表明所提出的特征提取方法有效性。 In order to further improve the robustness and recognition rate of deep belief network in Large Vocabulary Continuous Speech Recognition system,this paper presented a novel bottleneck deep belief network to extract new features, which was based on speaker adaptation and discriminative training.Firstly, a bottleneck deep belief network was adopted to get the feature.And discriminative training performed on this basis gave a more distinguished network to improve the recognition accuracy.Simultaneously,a more robust speaker adaptation method was introduced to adjust the network.The proposed method was tested on several public continuous speech databases with strong noise and casual themes and a relative 6.9% promotion of the recognition accuracy was obtained.The result proves the superiority of the proposed method compared to the conventional one.

作者陈雷杨俊安王龙李晋徽

机构地区电子工程学院电子制约技术安徽省重点实验室

出处《无线电通信技术》 2015年第6期41-45,共5页 Radio Communications Technology

基金国家自然科学基金项目(60872113)

关键词连续语音识别瓶颈深度置信网络区分性训练 ODLR Continuous Speech Recognition Bottleneck Deep Belief Network Discriminative Training ODLR

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献9

1Mohamed A,Dahl G, Hinton G.Acoustic Modeling Using Deep Belief Networks [ J ] .IEEE Transactions on Audio, Speech, and Language Processing, 2012,20 ( 1 ) : 14- 22.
2Mohamed A, Sainath T, Dahl G, et al. Deep Belief Networks Using Discriminative Features for Phone Recog- nition [ C ]// Proceedings of the IEEE International Con- ference on Acoustics, Speech, and Signal Processing. 2011, Prague, Cech Republic, 2011 : 5060-5063.
3Sainath T, Kingsbury B, Ramabhadran B. Auto-Encoder Bottleneck Features using Deep Belief Networks [ C ] //Proceedings of the IEEE International Conference on A- coustics, Speech, and Signal Processing, Kyoto, Japan. 2012:4153-4156.
4Vahchev V,Odell J J,Woodl P C.Lattice-Based Discrimi- native 1 Yaining for Large Vocabulary Speech Recognition [ C]// Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing ( ICASSP ) , 1996 (2) : 605-608.
5Kingsbury B.Lattice-based Optimization of Sequence Clas- sification Criteria for Neural-Network Acoustic Modeling [ C]// Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing ( ICASSP ) , 2009 : 3761-3764.
6Hinton G, Srivastava N, Krizhevsky A, et al. Improving Neural Networks by Preventing Co-adaptation of Feature Detectors[ C]// CoRR,2012: 1207-1210.
7Kuhn R,Junqua J C, Nguyen P, et al. Rapid Speaker Ad- aptation in Eigenvoice Space [ J ]. IEEE Transactions on Speech and Audio Processing, 2000,8 (6) : 695- 707.
8Siniscalchi S M, Dong Yu, Li Deng, et al.Speech Recogni- tion Using Long-Span Temporal Patterns in a Deep Network Mode[ J] .IEEE Signal Processing Letters, 2013 : 20(3) :201-204.
9BaoYebo,Jiang Hui ,Liu Cong ,et al.Investigation on Di- mensionality Reduction of Concatenated Features with Deep Neural Network for LVCSR Systems [ C ] // Pro- ceedings of the IEEE llth International Conference on Signal Processing (ICSP2012), Beijing, China, 2012: 562-566.

1田旺兰,李加升.改进运用深度置信网络的语音端点检测方法[J].计算机工程与应用,2014,50(20):207-210. 被引量：5
2王贵新,郑孝宗,张浩然,张小川.利用深度置信网络的中文短信分类[J].现代电子技术,2016,39(9):37-40. 被引量：3
3青山绿水.完美模拟Windows7[J].软件指南,2009(8):36-40.
4李倩云,夏斌.基于EEG的睡眠数据的分类[J].电子设计工程,2016,24(5):26-28. 被引量：5
5王星,周一鹏,周东青,陈忠辉,田元荣.基于深度置信网络和双谱对角切片的低截获概率雷达信号识别[J].电子与信息学报,2016,38(11):2972-2976. 被引量：27
6马飞,王金明,朱森.基于深度卷积神经网络的连续语音识别研究[J].军事通信技术,2016,37(4):37-40. 被引量：4
7李俊峰,汪月乐,胡升,何慧灵.基于DBN,SVM和BP神经网络的光谱分类比较[J].光谱学与光谱分析,2016,36(10):3261-3264. 被引量：8
8“三基”体验馆“声势浩大”[J].演艺科技,2011(6):91-91.
9张正平,张丽娜,贺松.基于GMM-UBM说话人模型的连续自适应算法研究[J].通信电源技术,2016,33(2):81-83. 被引量：2
10钱洪伟,贺苏宁.说话人模型参数自适应技术研究[J].电信技术研究,2008(5):16-22.

无线电通信技术

2015年第6期

浏览历史

内容加载中请稍等...

一种新的基于DBN的声学特征提取方法

参考文献9

相关作者

相关机构

相关主题

浏览历史