一种基于改进CP网络与HMM相结合的混合音素识别方法被引量：1

A Hybrid Approach for Phoneme Recognition Based on Combination of Improved CP Neural Network and HMM

下载PDF

导出

摘要提出了一种基于改进对偶传播 (CP)神经网络与隐马尔可夫模型 (HMM)相结合的混合音素识别方法。这一方法的特点是用一个具有有指导学习矢量量化 (L VQ)和动态节点分配等特性的改进的 CP网络生成离散 HMM音素识别系统中的码书。因此 ,用这一方法构造的混合音素识别系统中的码书实际上是一个由有指导 L VQ算法训练的具有很强分类能力的高性能分类器 ,这就意味着在用 HMM对语音信号进行建模之前 ,由码书产生的观测序列中已经包含了很强的分类信息 ,这将极大地改进 HMM系统在音素层上的识别性能。另一方面 ,由于这一训练是对一个具有诸多改进的 CP网络进行的 ,这就使得训练过程中的 LVQ学习能够自动地在有指导的方式下进行 ,而且加快了学习过程、改进了收敛性能、提高了分类精度 ,同时有效地减小了码书的大小 ,使得HMM的参数估计更为容易。最后 ,通过两个特定说话人的音素识别实验 ,将混合方法与使用 K -means聚类算法生成码书的 VQ- HMM传统音素识别方法进行了比较 ,实验结果表明混合系统的识别率能够达到 98%～ 99% ,误识率要比使用同样大小码书的 VQ- HMM识别系统的误识率低 4～ 6倍。 Proposes a hybrid approach for phoneme recognition based on combination of improved counter propagation (CP) neural network and hidden Markov model (HMM). The characteristic of the approach is that the codebook in a discrete HMM based phoneme recognition system is generated by a modified CP neural network with a few improvements, such as supervised learning vector quantization (LVQ) and dynamic node allocation. Hence, in effect, the codebook in the hybrid phoneme recognition system created through the approach is a high performance classifier with much better discriminating power trained by the supervised LVQ algorithm. It means that before a HMM is used for modeling a speech signal, the observation sequence generated by such a codebook contains highly discriminating information. This will greatly improve the recognition performance of HMM at phoneme level. On the other hand, since the training is done for an improved CP neural network with several new designs, the LVQ learning in the training process can be automatically performed in a supervised mode; learning is accelerated; system convergence is improved; more accurate classification is developed; at the same time, size of codebook is effectively reduced, resulting in the additional advantage of making HMM parameter estimation easier. Finally, through two speaker dependent phoneme recognition experiments, the hybrid approach is compared with the traditional VQ HMM phoneme recognition approach, which uses K means generated codebook. The results show that a correct recognition rate of 98%~99% can be achieved by the hybrid recognition system, and the error rate is 4~6 times lower than that of VQ HMM recognition system using a K means generated codebook of the same size.

作者邓伟赵荣椿

机构地区苏州大学计算机工程系西北工业大学计算机科学与工程系

出处《数据采集与处理》 CSCD 2000年第1期6-11,共6页 Journal of Data Acquisition and Processing

基金航空基础科学基金

关键词隐马尔可夫模型音素识别 CP网络语音识别 neural network hidden Markov model hybrid phoneme recognition counter propagation learning vector quantization

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献1

1邓伟,赵荣椿.一种改进的CP网络学习算法[J].信号处理,1998,14(2):141-145. 被引量：2

二级参考文献1

1Douglas L. Reilly,Leon N. Cooper,Charles Elbaum. A neural model for category learning[J] 1982,Biological Cybernetics(1):35～41

共引文献1

1李艳玲,张永梅.基于CP神经网络的边缘检测[J].山西电子技术,2006(1):52-54. 被引量：1

同被引文献5

1Duda R O, Hart P E, Stork D G. Pattern Classification[M]. 2nd ed. New York, USA: John Wiley & Sons, 2001.
2Zurada J M. Introduction to Artificial Neural Systems[M]. [S. l.]: West Publishing, 1992.
3Dayhoff J E. Neural Network Architectures[M]. New York, USA: Van Nostrand Reinhold, 1990.
4Kuncheva L. Fuzzy Classifiers[M]. Heidelberg, Germany: Physica-Verlag, 2000.
5邓伟,赵翊兰.一种基于HMM的动态语音模式时间归一化方法[J].数据采集与处理,2003,18(3):277-281. 被引量：2

引证文献1

1邓伟,苏美娟,董恩清.用于模式分类的动态有指导前向传播网络[J].计算机工程,2008,34(14):208-209. 被引量：2

二级引证文献2

1尤波,周丽娜,黄玲.应用于假手的肌电信号分类方法研究[J].哈尔滨理工大学学报,2011,16(3):1-7. 被引量：2
2宋丽丽,蔡行语.改进的CPN神经网络算法在流水线故障诊断中的应用[J].自动化技术与应用,2020,39(12):16-19. 被引量：1

1Jih-Hsin Ho.Performance Evaluation of FDL Effect in the Optical Switch[J].通讯和计算机（中英文版）,2015,12(1):33-36.
2梁红军,何岩.DHCP在宽带网络中的应用[J].光通信研究,2005(5):54-56. 被引量：2
3李颖,张有为.一种新型极低比特率声码器在音素HMM语音识别中的应用[J].五邑大学学报（自然科学版）,1999,13(4):37-41.
4宋原章,王仁华.汉语语音的聚类分段研究[J].自动化学报,1989,15(5):463-466.
5薛少飞,宋彦,戴礼荣.基于多GPU的深层神经网络快速训练方法[J].清华大学学报（自然科学版）,2013,53(6):745-748. 被引量：4
6吴杰.由有源RC网络生成OTA—C滤波器[J].西部电子,1993,4(2):19-22.
7方绍武,戴蓓倩,李宵寒.一种离散隐Markov模型参数的全局优化算法[J].电路与系统学报,2000,5(3):78-81. 被引量：4
8罗万伯,罗霄岚,陈炜,彭舰,吴端培.K子空间和时延自相关器的英汉音素识别[J].电子科技大学学报,2006,35(1):66-69.
9王一,杨俊安,刘辉,柳林.基于层次稀疏DBN的瓶颈特征提取方法[J].模式识别与人工智能,2015,28(2):173-180. 被引量：10
10陈建良,吕小红.采用离散HMM的孤立词识别系统[J].信息技术,2006,30(1):83-84. 被引量：3

数据采集与处理

2000年第1期

浏览历史

内容加载中请稍等...

一种基于改进CP网络与HMM相结合的混合音素识别方法被引量：1

参考文献1

二级参考文献1

共引文献1

同被引文献5

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种基于改进CP网络与HMM相结合的混合音素识别方法 被引量：1

参考文献1

二级参考文献1

共引文献1

同被引文献5

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

一种基于改进CP网络与HMM相结合的混合音素识别方法被引量：1