基于多GPU并行框架的DNN语音识别研究被引量：1

Speech Recognition Research on Multi-GPU Parallel Framework of Deep Neural Network

下载PDF

导出

摘要提出了深度神经网络DNN的多GPU并行框架,描述了其实现方法及其性能优化,依托多GPU的强大协同并行计算能力,结合数据并行特点,实现快速高效的深度神经网络训练.对语音识别应用,在模型收敛速度和模型性能上都取得了有效提升——相比单GPU有4.6倍加速比,数十亿样本的训练数天收敛,字错率降低约10%. A Muhi-GPU parallel framework o{ DNN is given out, and the implementation method and its performance optimization are presented, relying on the powerful synergy parallel computing ability of Muhi-GPU, combining with the characteristics of data parallel, the fast and efficient training of Deep Neural Network is realized. The application of speech recognition, the model convergence rate and model performance have achieved effective promotion. Compared with single GPU, it has 4. 6 times speedup ratio, billions of training days convergence, and word error rate is reduced about 10%.

作者杨宁

机构地区南京晓庄学院数学与信息技术学院

出处《微电子学与计算机》 CSCD 北大核心 2015年第7期6-10,共5页 Microelectronics & Computer

基金国家自然科学基金青年基金项目(61202136)

关键词深度神经网络语音识别图形处理器并行框架 deep neural network speech recognition graphic processing unit parallel framework

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献6

1Kai-Fu Lee~ Automatic speech recognition: the develop- ment of the sphinx recognition system [ M~. Norwell, MA, USA:Kluwer Academic Publishers, 1988.
2Frank Seide, Gang Li, Dong Yu. Conversational speech transcription using context-dependent deep neural net- works[C]//Appearing in Proceedings of the 29~ Interna- tional Conference on Machine Learmng. Edinburgh, Scot- land, UK, 2012.. 437-440.
3Dahl G E, Dong Yu, Li Deng, et al. Context depend- ent pretrained deep neural networks {or large vocabu- lary speech recognition[J]. IEEE Transactions on Au- dio Speech and Language Processing, 2012, 20 ( 1 ) ~ 30-42.
4Li Deng, Hinton G, Kingsbury 13. New types of deep neural network learning for speech recognition and re- lated applications: an overview [C] /// Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Italy, Florence, 2013: 8599-8603.
5Hinton G, Li Deng, Dong Yu, et al. Deep neural net- works for acoustic modeling in speech recognition: the shared views of four research groups[J]. Signal Pro- cessing Magazine, 2012,29(6) ~ 82-97.
6Le Q V. Building high-level features using large scale unsupervised learning [ C] // Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Italy, Florence, 2013 ~ 8595-8598.

同被引文献5

1顾乃杰,赵增,吕亚飞,张致江.基于多GPU的深度神经网络训练算法[J].小型微型计算机系统,2015,36(5):1042-1046. 被引量：8
2李抵非,田地,胡雄伟.基于分布式内存计算的深度学习方法[J].吉林大学学报（工学版）,2015,45(3):921-925. 被引量：6
3吕亚飞,于振华,张致江,赵增,顾乃杰.基于多GPU的并行BP算法及优化[J].小型微型计算机系统,2016,37(4):748-752. 被引量：3
4杨旭瑜,张铮,张为华.深度学习加速技术研究[J].计算机系统应用,2016,25(9):1-9. 被引量：4
5张任其,李建华,范磊.分布式环境下卷积神经网络并行策略研究[J].计算机工程与应用,2017,53(8):1-7. 被引量：7

引证文献1

1李相桥,李晨,田丽华,张玉龙.卷积神经网络并行训练的优化研究[J].计算机技术与发展,2018,28(8):12-16.

1杨宁.基于多GPU并行框架的DNN语音识别研究[J].南京晓庄学院学报,2015,31(6):21-25.
2唐潇霖.你好!新语音识别时代[J].互联网周刊,2006(16):44-45. 被引量：1
3刘琳茜,李永康,索红军.云服务安全平台研究开发与语音识别应用[J].软件导刊,2014,13(1):7-8. 被引量：2
4姬丽娜,陈庆奎,赵永涛,刘伯成,陈圆金,高倩.一种基于GLCM的运动目标检测新方法[J].太原理工大学学报,2015,46(6):719-726. 被引量：1
5杨宁.深度卷积神经网络的多GPU并行框架[J].计算机与现代化,2016(11):95-98.
6卢风顺,宋君强,银福康,张理论.CPU/GPU协同并行计算研究综述[J].计算机科学,2011,38(3):5-9. 被引量：95
7张颢,陈军芳,张磊.GPU\CPU协同并行计算提升叠前偏移成像效率[J].江汉石油科技,2012,22(2):18-19.
8林鸣霄.基于Speech SDK的语音识别技术在三维仿真中的应用[J].计算机技术与发展,2011,21(11):160-162. 被引量：4
9肖晓燕,王启贵.GPU集群系统在地震资料叠前时间偏移处理中的应用[J].江汉石油科技,2013,23(2):21-24.
10曹玉东.语音识别中的搜索策略研究[J].攀枝花学院学报,2007,24(3):46-49.

微电子学与计算机

2015年第7期

浏览历史

内容加载中请稍等...

基于多GPU并行框架的DNN语音识别研究被引量：1

参考文献6

同被引文献5

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于多GPU并行框架的DNN语音识别研究 被引量：1

参考文献6

同被引文献5

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于多GPU并行框架的DNN语音识别研究被引量：1