基于动态时间规整和隐马尔可夫统一模型的无端点检测的汉语识别算法

A Recognition Algorithm without the Ending Point Detection of Chinese Based on DTW and HMM Unified Model

下载PDF

导出

摘要根据汉语语音的特点，提出了一种无端点检测的语音识别算法。在识别过程中，该算法无需确定语音信号起止点位置，而是从寂静段开始，直接按帧提取特征（帧长２０ｍｓ，帧间重叠５０％），特征向量由１５阶倒谱系数和帧平均能量组成。在动态时间规整（ＤＴＷ）和隐马尔可夫（ＨＭＭ）统一模型（ＤＨＵＭ）中，引进寂静段自环，并用ＤＨＵＭ实现了该算法。对９９个相似汉语单字的识别实验表明：无端点检测的识别器正识率为９４．９５％，正识率下降很少，但不作端点检测却降低了算法的复杂程度。该算法中，若特征向量采用一种听觉模型特征，识别器具有更好的鲁棒性，识别率会略有提高。 Describes a characteristic of Chinese speech, and proposes a recognition algorithm without the ending point detection. Compared with the traditional method, in this algorithm, it is not necessary to decide the ending point of speech signals. From the stationary segment on, feature vectors consisting of 15 order Cepstrum coefficients and the average energy of each frame, are extracted in frames(length of each frame is 20 ms and the overlapping between two frames is 50%). By introducing the self loop of the stationary segment of the DTW and HMM Unified Model(DHUM), this algorithm is successfully implemented. In recognition of 99 similar Chinese words, a first candidate recognition rate of 94.95% is obtained. If an auditory feature is accepted for feature vectors, the robustness of the algorithm will be better.

作者张杰张焱黄志同

机构地区南京理工大学自动化系

出处《数据采集与处理》 CSCD 1998年第3期220-223,共4页 Journal of Data Acquisition and Processing

基金江苏省自然科学基金国防科工委预研基金

关键词语音识别端点检测汉语语音隐马尔可夫模型 speech recognition detection ending point detection hidden Markov model dynamic time warping

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

1刘彤.噪声环境下的汉语语音识别技术[J].情报指挥控制系统与仿真技术,2001(9):44-50. 被引量：2
2赵珀璋.汉语语音信息处理系统体系结构设计[J].电子学报,1989,17(3):19-23.
3沈泉波,韩慧莲.基于HMM的语音识别系统的Matlab仿真[J].电声技术,2012,36(10):56-57. 被引量：3
4李冠宇.隐马尔可夫模型及其在语音识别中的应用[J].科技风,2011(23):89-90.
5杜利民,侯自强.汉语语音识别研究面临的一些科学问题[J].电子学报,1995,23(10):110-116. 被引量：21
6冯丽娟,吾守尔.斯拉木.维吾尔语连续语音识别技术研究[J].现代计算机,2010,16(1):4-7. 被引量：2
7王巍,王成友.说话人识别技术综述[J].湖南通信技术,1999(4):19-21.
8檀蕊莲.基于DTW的说话人识别技术研究[J].黑龙江科技信息,2010(13):42-42. 被引量：1
9黄见峰.基于马尔可夫的软件可信评估模型研究[J].电子世界,2014(16):374-374. 被引量：1
10李苇营,易克初,胡征.神经网络与HMM构成的混合网络在语音识别中应用的研究[J].电子学报,1994,22(10):73-80. 被引量：8

数据采集与处理

1998年第3期

浏览历史

内容加载中请稍等...

基于动态时间规整和隐马尔可夫统一模型的无端点检测的汉语识别算法

相关作者

相关机构

相关主题

浏览历史