期刊文献+

基于动态时间规整和神经网络的方言辨识研究 被引量:9

Dialect identification based on dynamic time warping and neural network
下载PDF
导出
摘要 汉语方言辨识技术的研究不仅有利于提高方言语音识别系统的识别效率,而且对于公安部门的刑事侦查等方面都具有非常重要的应用价值。以湖南方言作为研究对象,对不同方言特征的差异及方言辨识中特征参量的合适选取进行了深入研究。针对语音信号具有很强的随机性而神经网络的输入结构相对固定等特点,提出了基于动态时间规整和神经网络的方言辨识方法。实验结果表明,选取相同的特征参数时对不同类别或不同声调的方言的辩识率不同。 The research of Chinese dialect identification is not only conducive to improving the efficiency of dialect speech recognition system,but also important in the criminal investigation department for public security.Hunan dialects have been selected as a research object in this paper.The difference of characteristics between various dialects and how to choose appropriate parameter have been studied thoroughly.Because the speech signal has the very strong randomicity and the input structure of neural network is firm relatively,the dialects identification technology based on a mixed cascade neural networks of time alignment network and BP neural network is proposed.The experimental results show that for the different dialects and different tone ,the identification rate is not the same when the same characteristic parameter is chosen .
出处 《计算机工程与应用》 CSCD 北大核心 2008年第10期211-213,共3页 Computer Engineering and Applications
基金 湖南省教育厅资助科研课题(the Research Project of Department of Education of Hunan Province China under Grant No.06C517)
关键词 方言辨识 语音特征 动态时间规整 神经网络 dialects identification speech characteristics Dynamic Time WarDing(DTW) neural network
  • 相关文献

参考文献6

二级参考文献19

  • 1徐士林.四声模糊识别方法[J].电子学报,1996,24(1):119-121. 被引量:12
  • 2Ross M,Shaffer H,Cohen A et al. Average magnitude difference function pitch extractor[J].IEEE Trans on Acoustics,Speech and Signal Processing, 1974; ASSP-22 (5): 353~362
  • 3Seneff S.Real-time harmonic pitch detector[J].IEEE Trans on Acoustics,Speech and Signal Processing,1978;ASSP-26(4):358~365
  • 4孙放,胡光锐.一种新型前向神经网络用于汉语四声识别[J].上海交通大学学报,1997,31(5):36-38. 被引量:3
  • 5侯精一.现代汉语方言音库[M].上海:上海教育出版社,1994—1999.
  • 6LR拉宾纳 RW谢弗.语音信号数字处理[M].北京:科学出版社,1983.116-122.
  • 7Wuei-He Tsai,Wen-Whei Chang,Discrimination Training of Guassian Mixture Bigram Models with Application to Chinese Dialect Identification[J].Speech Communication,2002,36:317-326.
  • 8Y.K.Muthusamy,E.Barnard,and R.A.Cole,Reviewing Automatic Language Identification[J].IEEE Signal Processing Mag.,1994,11(4):33 -41.
  • 9M.A.Zissman,Comparison of Four Approaches to Automatic Language Identification of Telephone Speech,[J].IEEE Trans.Speech and Audio Processing,1996,4 (1):31 -34.
  • 10Alvin F.Martin,Mark A.Przybocki,NIST 2003 Language Recognition Evaluation[M].In:EuroSpeech[C],2003.

共引文献48

同被引文献71

引证文献9

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部